Gene TM1040_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3112 
Symbol 
ID4075559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp83029 
End bp84546 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content57% 
IMG OID638004614 
Productextracellular solute-binding protein 
Protein accessionYP_611348 
Protein GI99078090 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.817588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCA GATCCATGCT GCTTGCCTCA GCGCTGGCGC TTTCCTCCGC GCTTCCCAGT 
TTTGCCGACA AGGCGAATGA CACGCTGGTC GCAGCTTTCA ACAAGGAAGT TCAGACGCTT
GACGGTCTCT ACTCAACGTC GCGCGAGAAC CTGATCCTGT CCTATCTGAC CTCGGATCAG
CTGGTCGAGC TGAACCTCGA CACCGGAGAG TATGAAGGGG CACTAGCGGA AAGCTATACT
TGGGTCGATG ATCGCACCAT CGATTTCACA CTCCGCGAAG GCCTGACATT CCATGACGGT
TCGCCTGTAA TGGTCGAAGA CATCGTCTAT TCCTTTGACT GGATCGCCAA TGCGGACTCC
AAGACCAAAC GCGGCGCCTT TATCCGTGGT TGGTTCGAGA GCGCTGTCGC GATTGATGAT
CGCACAGTGC GCGTCACCGC CAAACAACCC TATCCTTTGA TGCTGCGTGA CATCGCCGTC
TTCGTTCTCA CTCGCAAGGC AGGCAGCTAT GGCGATGGCA ACCCTGATGC GCTGACGCAG
AACTTCGTCG GCACCGGCCC TTACAAGATT TCTGAATTTG CCATGGGCGC CGGTGTGCAG
CTGGAGCGCT ATGATGGATA CTACACTGGC GGACCCAAGG CGGCCGGTTC GATTGAGAAA
ATCGTCCTGC GCCCGATTCC CGACTGGGGC ACCGTGACGG CGGAATTGCT GTCGGGGGGC
GTAAACTGGT CTTTCAACGT GCCTGACGAT ACGGCCAAAG ATCTGGGCGG ATTGCCCATG
GTGGATCATG TATCCGGCGT GTCCACGCGC GTTGCCTTCC TGGTGCTGGA CGCCGCAGGT
GTCAGCGATG CGGAAGGCCC AATGACCAAC AAGCTGGTGC GTCAGGCGCT CAACCATGCG
GTAAACCGGA AAGAAATCGT TGAATTTCTC GTCGGCGGTT CGGGCCGCGT TGTTCACTCG
ACCTGTAACG CGGGCATGTT CGGCTGTGAT GTCGAGATCA CGGAATACGA TTATGACCCC
GAAAAGGCCA AGGCTTTGCT GGCTGAAGCG GGCTACCCGG ACGGCTTTGA GTTCGACCTG
ACCGCCTATC GCGAACGCCC CATCATGGAA GCAGTTGCCG CGGATCTGGC CGAAATCGGC
GTGATCGCGA ATATCAACTT CGTAAAGCTT TCCGCGTTGT CCAAATCCCG CGCCGAAGGT
CAGCTTGAAG CATTCCAGAA CGCCTGGGGC TTTTATGCGA CGCCGGATCT GGGCGCGATT
TCCAACTACT ATGTCGAAGG ATCCAACCGC AACCTACACC AAGACGCAGA GGTTCAAGGT
TGGTTCAAGG CTGCGCTGGA AACTGTCGAT CAAGACGAGC GCGCAGATCT CTATGCGAAG
GCCCTGCAGA AGATCGCCGA TGAGGCTTAC CTGCTTCCGA TCTTCCAGTA TTCGCAAAAC
TACGTGAAGA GCGTGGATGT GAATTTTGCG GCACCGGCCG ACGGCCTGCC ACGGCTCAAT
GAGCTGAGCT GGAAGTAA
 
Protein sequence
MSVRSMLLAS ALALSSALPS FADKANDTLV AAFNKEVQTL DGLYSTSREN LILSYLTSDQ 
LVELNLDTGE YEGALAESYT WVDDRTIDFT LREGLTFHDG SPVMVEDIVY SFDWIANADS
KTKRGAFIRG WFESAVAIDD RTVRVTAKQP YPLMLRDIAV FVLTRKAGSY GDGNPDALTQ
NFVGTGPYKI SEFAMGAGVQ LERYDGYYTG GPKAAGSIEK IVLRPIPDWG TVTAELLSGG
VNWSFNVPDD TAKDLGGLPM VDHVSGVSTR VAFLVLDAAG VSDAEGPMTN KLVRQALNHA
VNRKEIVEFL VGGSGRVVHS TCNAGMFGCD VEITEYDYDP EKAKALLAEA GYPDGFEFDL
TAYRERPIME AVAADLAEIG VIANINFVKL SALSKSRAEG QLEAFQNAWG FYATPDLGAI
SNYYVEGSNR NLHQDAEVQG WFKAALETVD QDERADLYAK ALQKIADEAY LLPIFQYSQN
YVKSVDVNFA APADGLPRLN ELSWK