Gene TM1040_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0218 
Symbol 
ID4076251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp233213 
End bp234322 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content64% 
IMG OID638005512 
Productmajor facilitator transporter 
Protein accessionYP_612213 
Protein GI99080059 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.851402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTTCT GTGTCGGGGC CTTTGCGGCC TATGTGCCCC AACTGAAAAG TCAGGCGGGG 
CTGAGTGATG CCGAATTTGG CCTTGCCCTG TTGATCGGTG CCGCCGGAGC CGTGGGCGCG
ATGTGGCTCG CGCCGCGCGT GGATCGCCTG CTGGGACATG CGGCGATGAC CATCTGCGCG
CTCTTGCTGG CGACGGCCTT TCTGCTGCCG GGCCTTGCGC AGAGCTGGGC CGGTTTTGCC
GCGGCGATGT TTCTGGCCTC CGGGGCGGGA GGGCTTCTGG ATGTGGTGAT GAATGCACGT
CTGTCCGGGC TCGAGGCACG TTCTGGCCGC TCCTTGATGA ACCTCAACCA TGGTCTGTTT
TCGCTTGCCT ACGCCCTGGC TGCCCTGGTC GCCGGGCTCG TTCGCGAAGC AGGTGTGCCG
CCCGTCTGGT GTTTTGCGGG CATCCTTTTG GTGTCGGGCC TGTTGGCATT GGGAATGCGG
GATGAGGTGC CGCCTGCGCC CGCCGAAGAT CCGCAAGGGA ACGCGTCGCT GCCTCTGCCT
GGGGCAATGA TTGTGATTGC CGGGCTGATC GTGCTCATTG CCTTTACGGC TGAGCAGGCA
ACGGAACATT GGTCTGCACT GCATCTGGAG CGCGCCTTTG GCGCGAATGC GGCGGAAGGG
GCTTTGGGGC CTGCGATTCT CGGTTTCACG ATGGGTATCG GGCGTCTCTC CGGACAGGAG
CTGGTGCGCC GCGTGGCCGA GGGCAGGCTC ATGCAGATCG CCGCGGCGCT CGCGGCATCG
GGGTTGATCC TTGCGGCGTT TGCCCCCGTG CAGGCTCTGG CCTACGCGGG CTTTGCGATC
CTTGGATTGG GCGTTTCTAC AGTCGGGCCG ACGGCGCTCG CCTGGGTGGG CAAAACAATC
CCATCGCGCC TGCGCGCGGC GGCCATTTCG CGGCTGGTGA TGATTGGCTA TTGCGGGTTC
TTTGTGGGGC CGCCAGTGAT CGGCTTTATT GCAGAGGTAT TTGGTCTGCG GCTGGCATTG
GCCCTGATGG GGGCAATGCT GCTCTGTATC ACGGTACTTT TGGTGCCCGC CCTGCGCGCC
AGTGCGCGGA AATCCGAGCT GATTGCCTGA
 
Protein sequence
MGFCVGAFAA YVPQLKSQAG LSDAEFGLAL LIGAAGAVGA MWLAPRVDRL LGHAAMTICA 
LLLATAFLLP GLAQSWAGFA AAMFLASGAG GLLDVVMNAR LSGLEARSGR SLMNLNHGLF
SLAYALAALV AGLVREAGVP PVWCFAGILL VSGLLALGMR DEVPPAPAED PQGNASLPLP
GAMIVIAGLI VLIAFTAEQA TEHWSALHLE RAFGANAAEG ALGPAILGFT MGIGRLSGQE
LVRRVAEGRL MQIAAALAAS GLILAAFAPV QALAYAGFAI LGLGVSTVGP TALAWVGKTI
PSRLRAAAIS RLVMIGYCGF FVGPPVIGFI AEVFGLRLAL ALMGAMLLCI TVLLVPALRA
SARKSELIA