Gene TM1040_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2672 
Symbol 
ID4077583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2807277 
End bp2808515 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content64% 
IMG OID638007996 
Productmajor facilitator transporter 
Protein accessionYP_614666 
Protein GI99082512 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGAT CCACCCCTCT GTTTACCCCC GTCCTGCTGG TGGGCTGCCT GATCATCATG 
GTGAGCTTTG CCGTGCGCGC CTCCTTTGGG GTGTTCCAGA TCCCGATCGC CGATGATTTT
GGCTGGCTCC GGAGCGAGTT CTCCCTCGCC ATCGCGATCC AGAACCTCGC CTGGGGGATC
GGGCAGCCGA TCTTTGGCGC CATTGCCGAG AAGATCGGAG ACCGCAAGGC GATCATCATC
GGGGCCGTGG TCTATGCGGC GGGGCTTGTG CTGAGTGCCG GAGCCACCAC CCCCTTCGAG
ATGCAGGCCT ATGAGTGGCT GGTGGGCTTT GGCGTTGCGG GCACGGGCTT TGGCGTTGTG
CTTGCGGTGG TCGGCCGGGC GAGCTCGGAC GAGAACCGGT CCATGTCACT GGCGATTGTC
ACCGCGGCAG GCTCTGCGGG GCAGATCTTC GGCGCGCCGA CGGCGGAATA TATGCTTGGC
CTGATGTCCT GGCAGTCGGT GTTCCTGGTC TTTGCCGGCG TGGTGCTGGC GCTGATCCTG
TCGCTGCCCC TGATGCGCGC GCCGGTCTCT GCGGGCAAGG CGGAGCTTGA GGAAAGCATG
GGCGCGATCC TCAAAAAAGC CTTCCGCGAC CCGTCCTATA CGCTGATATT CCTCGGGTTT
TTCAGCTGTG GCTATCAGCT GGCCTTTGTG ACGGCGCATT TTCCGGCCTT TGTGACCGAG
ATGTGCGGGC CGATCATGCC CGGCGGTGTG CTGCATGGGA TGGGGATCAC CACCACCTCG
GCGCTGGGTG CGGTGTCGAT TTCGCTCATC GGTCTGGCGA ATGTGGCAGG CACGCTGCTC
GCGGGCTGGG CGGGCAAGCA TTACTCCAAG AAATATCTGC TGGCGGGGAT CTACACCGCG
CGGACCATCG TGGCCGGGGC CTTTATCCTG CTGCCGATCA CGCCTTTGTC GGTGATCCTC
TTTTCGGTGG CGATGGGCTC GCTCTGGCTC GCGACCGTGC CGCTTACTTC CGGGCTGGTC
GCGCATATCT ACGGGCTGCG CTACATGGGG ACGCTCTATG GGATCGTGTT CCTGAGCCAC
CAGATCGGCG GGTTCCTCGG CGTGTGGCTC GGTGGGCGGA TGTATGACAT CTATGGCGAC
TACACGATGG TCTGGTGGAT CGGTGTGGGC GTCGGAGCCT TCAGCGCGAT TGTGCATCTG
CCGGTGCGCG AGCGTCCGTT GCAGGCGGCT GCGGCCTGA
 
Protein sequence
MDRSTPLFTP VLLVGCLIIM VSFAVRASFG VFQIPIADDF GWLRSEFSLA IAIQNLAWGI 
GQPIFGAIAE KIGDRKAIII GAVVYAAGLV LSAGATTPFE MQAYEWLVGF GVAGTGFGVV
LAVVGRASSD ENRSMSLAIV TAAGSAGQIF GAPTAEYMLG LMSWQSVFLV FAGVVLALIL
SLPLMRAPVS AGKAELEESM GAILKKAFRD PSYTLIFLGF FSCGYQLAFV TAHFPAFVTE
MCGPIMPGGV LHGMGITTTS ALGAVSISLI GLANVAGTLL AGWAGKHYSK KYLLAGIYTA
RTIVAGAFIL LPITPLSVIL FSVAMGSLWL ATVPLTSGLV AHIYGLRYMG TLYGIVFLSH
QIGGFLGVWL GGRMYDIYGD YTMVWWIGVG VGAFSAIVHL PVRERPLQAA AA