Gene TM1040_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2191 
Symbol 
ID4078182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2301194 
End bp2302420 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content63% 
IMG OID638007513 
Productmajor facilitator transporter 
Protein accessionYP_614185 
Protein GI99082031 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGA TCCTCTCTTT TGCCGCCCTC TTTCTTTCCG TGATTTTGCT TCAGCTCTCG 
ACCGGAGGTG TGGGGCCGCT GGATGCCATT TCCGGCGCTG CGATGGGGTT TGACAACCGC
CAGATCGGCC TTCTGGGCTC GGCGCATTTC TTTGGCTTCT TTATCGGCTG CTGGTGGACC
CCGCGCCTTA TGGGTCGTGT CGGACATTCG CGCGCTTTTG CGGTTTGCAC GGCACTTGGC
GCGATGGGCC TCTTGGCGCA TACGCTCACG GACGATCCCT ACGCATGGGC CGCCATGCGC
ATCGCATCTG GCCTCTGTGT CGCGGGCTGC TACACGGTCA TCGAGGCCTG GATCAACGCA
AAGGTCACCA ATGAGAACCG GGGCCGCACC TCCGGTCTCT ATCGTATTGC CGACACCAGT
GCGTCTCTGG CGGCGCAGCT GTTGATCGCG GTGCTGCCGC CTGCGCATTA CATCTCGTAT
AACCTGCTGG CGATCCTGTG CTGCGCAACG CTGCTGCCGC TGATGATGAC CCGCGCGAGC
CAGCCCGAGA TGCCCGCCGC CCCGCGCCTG AGACCCCGTC TGGCCTGGCG TTGTTCACCC
CTCGGGGTGG CCGGCGTGAT CGTCTCCGCC CTGAGTGGCG CCTCCTTCAG GATGGTTGGC
CCGATCTACG GTCAGGAAGT CGGACTGGTG ATCGACCAGA TCGCCTTTTT CCTTGCCGCC
TTCGTTCTGG GCGGCGCGCT CGCGCAGTAC CCCGTGGGCT GGCTTGCGGA TAAATTCGAT
CGTCGCTGGG TGCTCATCTG GCTCTCAGGC GTCTCGATGA TTGCCTGCGC CGTGACCCTG
CCTGCAAGCG GTGGCGGCAC GATTGCCATC ATGGCGTCGG CGGCTTTCTT CGGCCTGACG
ACAGTGCCGA TCTTTTCTGT CTCTGCCGCG CATGCCAATG ACTTTGCCAC CTCCGAAGAG
CGGGTGGAAC TCTCAGCGGC CTTGATGTTC TTCTACGCCA CGGGCGCGAT TGCCGCACCC
TTTATCGCCT CGGCCCTCAT TGAGGCCTTC GGGCCAGGGG CGCTATTTGT CTTTATCGCA
GTGGGGCATG CGGGGTTGAT CGTCTTTGGC CTTGCAAGGA TGCGCAGCCG CGCCACCCCA
ACCGAACGCA CCCGCTACGT CTATGCGCCG CGCACGTCAT TCACGATCGG CAAGATGCTG
AAACGCGCCC GTGAGGGGCG GCAATGA
 
Protein sequence
MRTILSFAAL FLSVILLQLS TGGVGPLDAI SGAAMGFDNR QIGLLGSAHF FGFFIGCWWT 
PRLMGRVGHS RAFAVCTALG AMGLLAHTLT DDPYAWAAMR IASGLCVAGC YTVIEAWINA
KVTNENRGRT SGLYRIADTS ASLAAQLLIA VLPPAHYISY NLLAILCCAT LLPLMMTRAS
QPEMPAAPRL RPRLAWRCSP LGVAGVIVSA LSGASFRMVG PIYGQEVGLV IDQIAFFLAA
FVLGGALAQY PVGWLADKFD RRWVLIWLSG VSMIACAVTL PASGGGTIAI MASAAFFGLT
TVPIFSVSAA HANDFATSEE RVELSAALMF FYATGAIAAP FIASALIEAF GPGALFVFIA
VGHAGLIVFG LARMRSRATP TERTRYVYAP RTSFTIGKML KRAREGRQ