Gene Smed_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3821 
Symbol 
ID5318013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp275504 
End bp276505 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content60% 
IMG OID640775633 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001312566 
Protein GI150375970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTCT ACAAGAAGAT TGCTACCTGC ACGGTCGCTT TGGGTGCGCT TTGCGCTGCC 
CAGATTGCCG TCGCGCAGGA TGCGCCCCCG GTCGTGACGG TCGTCAAGGT GACCGGCGAA
AACTGGTTCA CGCGCATGGA GGAAGGCGTT GTTGCCTACG GCAAGGACAA TACGGGCGTT
TCGACGAGCC AGATCGGTCC CGCCAAGGCG GACGCCGCCC AGCAGCTGCG CCTCATAGAA
GACCTCGTCG CGAAGAATGT CAGCGCCATC GCTGTCGTGC CGATGGACCC CTCTGCTCTC
GAAGGCGTCT TCAAGCGCGC GATGAACCGC GGCATCAAGA TCGTCACGCA CGAAGCCGAC
AGCTTGAAGA ATACGCAGGT CGATATCGAA GCCTTCGACA ACAAGGTCTT CGGCGCGCGC
TTCAACGAGA AACTGGCCGA GTGCATGGGC AAGTCCGGCA AGTGGACGTC ATTCGTCGGG
TCGCTCGGCA GCCTGACGCA CGTACAATGG GCTGACGGCG GCGCGGAGAA CGCCAAGAAA
TATCCGGAAA TGGAACTCGT CTCCGAGAAG AACGAGTCCT TCAACGACGC CAACAAGGCC
TACGAAAAGG CGCGCGAGAT CCTTCGCAAG TATCCTGACA TCAAAGGCTT CCAGGGCGGT
TCGGCCATTG ACGTCATCGG AATCGGCCGC GCCGTCGAGG AAGCCGGCCT TGTGGGGAAG
GTTTGCGTCG TCGGCCTCGG GCTGCCGAAG GACACCGCCA AGTACCTCGA ATCCGGTGCG
GTCCAGAGCA TTTCCTTCTG GGACCCGAAG GATGCGGGTT ATGTGATGAA CAAGGTTGCT
CAGCTCGTGA TCGAGGGCAA GGAAATCACC GATGGTATGG ATCTCGGAGT CCCGGGCTAC
AACAAGGTGT CCGTGAAGCA GGGTCCCGGC GAAGGCATCA TCGTCGTCGG CGAAGCCTGG
GTTGACGTCG ATAAGTCCAA CTACAGCCAG TATCCGTTCT GA
 
Protein sequence
MSVYKKIATC TVALGALCAA QIAVAQDAPP VVTVVKVTGE NWFTRMEEGV VAYGKDNTGV 
STSQIGPAKA DAAQQLRLIE DLVAKNVSAI AVVPMDPSAL EGVFKRAMNR GIKIVTHEAD
SLKNTQVDIE AFDNKVFGAR FNEKLAECMG KSGKWTSFVG SLGSLTHVQW ADGGAENAKK
YPEMELVSEK NESFNDANKA YEKAREILRK YPDIKGFQGG SAIDVIGIGR AVEEAGLVGK
VCVVGLGLPK DTAKYLESGA VQSISFWDPK DAGYVMNKVA QLVIEGKEIT DGMDLGVPGY
NKVSVKQGPG EGIIVVGEAW VDVDKSNYSQ YPF