Gene TM1040_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1094 
Symbol 
ID4076327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1172453 
End bp1173727 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content61% 
IMG OID638006398 
Productmajor facilitator transporter 
Protein accessionYP_613089 
Protein GI99080935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAG TTCTGTCGAG CGCCTGGGCG CTTTTGCTGG GCATTGGCCT TTTGATGTTG 
GGCAACGGCC TGCAAGGCAC GCTGCTGGGT GTGCGCGGTG GCATCGAGGG CTATTCCGCG
CTCACCATGT CGCTGGTGAT GTCCACCTAT TTTGTGGGCT TGCTCCTGGG GTCCTGGGTG
GCGCCGGGGA TGATCCGGCG TGTGGGCCAT GTGCGGGTCT TTGCGGCGCT GGCCTCGCTG
ATTTCGGCGG TGATGGTGAT CTATCCCGCG CTGCCCAACC CGATTGTCTG GATGCTGGGC
CGTCTGGTCG TGGGGTTCTG CTTTTCTGGC GTGTATGTCA CCGCCGAAAG CTGGCTCAAC
AATGCGGCTG ACAACCAGAA CCGAGGCAAG GCGCTCTCTC TCTATATGGT CGTGATGACA
CTGGGGCTGG TGGCCGCACA GGGTTTCATC CTGATCGGGG ACCCGGCGGG CTATCTGCCG
TTTGTGATCG CCTCCATCGC GGTTTCGATC TCCTTCGCCC CGATTTTGCT GTCGATTTCG
CCGACCCCCG CCTTTGATAC GGCCAAGCGG ATGACGCTTC GCGAATTGAT GCATGCCTCG
CCCCTTGGGT GTGTGGGGAT GTTCCTCATT GGCGGCGTGT TCTCGGCGCA GTTTGGTATG
TCTGCGGTCT ATGCCACCGA GGCTGGGATG GAGCTGCATC AGGTTTCGCT CTTTGCGGCG
AGCTTCTATG TCGGTGCGCT CTTTATGCAG TTCCCGCTGG GGCTGCTGTC CGACCGGATG
GATCGACGGG TGCTGATCAT GATCGTGGCA GGGGTCGCCG GGGTGACCTC GGTGCTGGCG
ATGCTGCTTG GGGGCACGTT CAGCCTGCTT CTGCTGGCCG CCTTTGTGAT CGGCGGGCTG
ATCAACCCGC TCTATTCCTT GCTCCTGGCC CACACCAATG ACTTTCTCGA TCACGACGAT
ATGGCCTCTG CCTCGGGCGG GCTGATCTTT ATCAACGGGC TTGGCGCCTG TAGCGGGCCG
GTCATCATCG GCTGGCTGAT GTCGGACGCG ATGTTCGGAC CCAACGGGTT CTTCCTCTTC
ATGGCGATAT TGCTGGGCGT GTTGGTTCTT TATGCCGGGT ATCGCGCAAC GCAGCGTGCG
ACCATTCCAG TCGAAGAGAC CGGTGTCATG CCCGCCATGA GCCCGACCGC GACCTCGGTC
GCGGTAGAGG TGGCGCAGGA ATACGCCATC GAAACCGAGC TCGAAGAGCA AGACAGCGCC
ACAACCACGG GCTGA
 
Protein sequence
MLQVLSSAWA LLLGIGLLML GNGLQGTLLG VRGGIEGYSA LTMSLVMSTY FVGLLLGSWV 
APGMIRRVGH VRVFAALASL ISAVMVIYPA LPNPIVWMLG RLVVGFCFSG VYVTAESWLN
NAADNQNRGK ALSLYMVVMT LGLVAAQGFI LIGDPAGYLP FVIASIAVSI SFAPILLSIS
PTPAFDTAKR MTLRELMHAS PLGCVGMFLI GGVFSAQFGM SAVYATEAGM ELHQVSLFAA
SFYVGALFMQ FPLGLLSDRM DRRVLIMIVA GVAGVTSVLA MLLGGTFSLL LLAAFVIGGL
INPLYSLLLA HTNDFLDHDD MASASGGLIF INGLGACSGP VIIGWLMSDA MFGPNGFFLF
MAILLGVLVL YAGYRATQRA TIPVEETGVM PAMSPTATSV AVEVAQEYAI ETELEEQDSA
TTTG