Gene TM1040_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1097 
Symbol 
ID4077804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1176282 
End bp1177466 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content66% 
IMG OID638006401 
Productmajor facilitator transporter 
Protein accessionYP_613092 
Protein GI99080938 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGC AGGGGCAGGG GACGAACTGG GTCATGGTGC TCTTGATCTG GGCCGCGGGC 
CTCGGCGCGG CGGCGCAATA CGGCAAGATC GCCGTGATCT TTGACCAGCT GCCTGCGCTT
TATCCCGGTG TGGGCGCGGC GATGGGCTGG ACGGTGTCGC TGGTGGGGGT GCTGGGGATC
ATTTTTGGCG TTGTGGCGGG GCTTTATGTG TCGGCCATCG GTTTTCGGCG TACGCTTGTT
CTGTCGCTGG TGCTTGGCGC GGGGGTCTCG GGCCTGCAGG CGCTGCATCT GCCCTTTGGC
CTGTTTCTCA TCACGCGCAT GGTCGAGGGG ATCTCCCATC TGGGCGTGGT TGTTGCGGCG
CCGACGCTGA TGGCGATCCT CGCGCGTGGC CCGGCGCGTG GGGTGGCGCT GACGATCTGG
AGTACGTTCT TTGGCGTTGC CTTTGCGCTC TTGACGTGGT TCGGGCTGCC GCTCGTCGAG
GCACGGGGCA TCCCTGCGCT TTTTGCGGTG CATGCGGGGA TGATGGGCCT TCTGGCGCTC
ATCCTGCATT GGGGGCTGCG CGACTTGCCG GTGCCGCCGC GCGCCAGCTA TCCTGATCTG
CGCGCCTTGC CATCGCTGCA TCTGAATATC TACCGATCGC CGCACAAGCT GGCGCCTGCC
GCGGGCTGGC TCTTTTATAC CTGCTGCTTT GTGGCGGTGC TGACGGTGCT GCCGCCCTAT
ATCGCCGAGA GCCAGCGTGC GCATGTGATG GGGGCGATGC CTCTGGTGTC GATCGTGGTC
TCTCTGACGC TTGGGGCTGG CCTGCTGCGC GTGACCTCGG GGGTCAAAGT GGTGCAGCTT
GGGTTCCTCA TCGGCACGGT GGCGATGCTC TGGCTTTGGG CGATGCCGGG GTACTGGCTG
GCCTGCATGG TCTTGGCGGC GGGGTTCGGG CTGGTGCAGG GGGCCAGCTT TGCTTCTGTG
CCGCAGCTCA ATGACACGCC TTCGACGCAA TCAGAGGCCA ATGGCGCCAT GGCGCAGGCG
GGCAACATGG GCAATGCCAT CGGCACACCG CTGTTTGTCG CCGTGCTGAC CTATGGGGGC
TATGGCTCGC TGGTGCTGAC CGTGGCGCTG CTGCTTTTGG CCGGGGCCGT GGTGCATCAG
GCGCTTGCGC TGCACCGGCA ACGGGTGGCG CGGGGGGCGG TCTGA
 
Protein sequence
MQQQGQGTNW VMVLLIWAAG LGAAAQYGKI AVIFDQLPAL YPGVGAAMGW TVSLVGVLGI 
IFGVVAGLYV SAIGFRRTLV LSLVLGAGVS GLQALHLPFG LFLITRMVEG ISHLGVVVAA
PTLMAILARG PARGVALTIW STFFGVAFAL LTWFGLPLVE ARGIPALFAV HAGMMGLLAL
ILHWGLRDLP VPPRASYPDL RALPSLHLNI YRSPHKLAPA AGWLFYTCCF VAVLTVLPPY
IAESQRAHVM GAMPLVSIVV SLTLGAGLLR VTSGVKVVQL GFLIGTVAML WLWAMPGYWL
ACMVLAAGFG LVQGASFASV PQLNDTPSTQ SEANGAMAQA GNMGNAIGTP LFVAVLTYGG
YGSLVLTVAL LLLAGAVVHQ ALALHRQRVA RGAV