Gene TM1040_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0336 
Symbol 
ID4076037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp339939 
End bp341264 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content64% 
IMG OID638005631 
Productmajor facilitator transporter 
Protein accessionYP_612331 
Protein GI99080177 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00900] H+ Antiporter protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCGA TTTTGGCAGA CCGCACCTAC CGACATCTTT TCCTCGCGCA GGTCGTCGCC 
CTGTTGGGCA CGGGGCTGTC GACGGTCGCG CTTGGGCTAT TGGCCTATGA CCTGGCGGGC
AACAGCGCCG CGATGGTGCT CGGCACTGTC TTTACCCTGA AGATGGTCGC TTATGTGGGG
ATTGCGCCGG TGGCCGGGGC CTTTGCCGAC AGTGTGAACC GGCGCCAGTT GCTCGTGATC
CTCGATCTGG TTCGCGCCGG GGTTGCCATC GCCCTGCCCT TCGTGACCGA GATCTGGCAG
GTCTATGTGC TGATCTTCCT GCTGCAATCG GCCTCCGCCG CCTTTACGCC CACCTTTCAG
GCAACCCTTC CGGATGTGTT GCCCGAAGAG GAGCGCTACA CCCGCGCGCT GTCACTGTCG
CGTCTCGCCT ATGATCTCGA AAATATCATC AGCCCGACGC TGGCCGCGCT GCTGTTGACC
GTGATGGCCT ACAACACGCT GTTCCTCGGC ACGGCGATCG GGTTTCTGGG ATCTGCGCTG
CTTGTGGTCT CGGTGCTCCT GCCCAGCCCC AAGCCAAGCA CGCGGCGCGG GATCTATGCC
CGCACCACAC GCGGTATTCG GATCTACCTT GCCACGCCTC GGCTGCGCGG GCTCTTGTGC
CTGAACCTCG CGGTCGCATC TGCCGGAGCC ATGGTGCTGG TGAACTCGGT TGTGCTGATA
CGCGGAGAGC TTGGCCTCTC CGAGAGCGCG CTTGCCTGGA CCCTCTTTGC CTTCGGGGCC
GGGTCGATGC TGGCGTCTTT GGCGCTGCCG CGGGTGCTCG ACACGCTCAA GGACCGTCCG
GTGATGATCG CCGGGGCCAC CCTGATGGTG GGGGCGCTTC TTGGTCTTGC GGCTGTCATT
GCCCTTGCCG ATCTATCCTG GGCAGGCGTG CTGGTGGCGT GGTTTCTGGT TGGGATCGGC
TATTCAAGCA CACAAACGCC CTCGGGCCGT TTGCTGCGCC GCTCCGCCCA TGCCGAAGAC
CGCCCTGCAA TCTTTGCCGC GCAGTTCGCG CTATCCCACG CCTGCTGGCT GCTGACCTAT
CCGCTGTCAG GCTGGCTGAT GACGGGGCTT GGCCCATTGC CAGCACTGGT CGTGCTGGCA
GCCCTCTCCG GGCTCGGCAT CCTTGCCTGC CTGCGGCTCT GGCCCAAGAA CGCGCCCGAC
GTGATCCTCC ATGACCATGA GGACCTGCCG GCGGACCATC CCCACATGCG CCAATTTGGC
ACCACACCGC ATCAACACGC GATCATCATC GACGATCTGC ACCGTCACTG GCCAACGCAG
GGATAG
 
Protein sequence
MLSILADRTY RHLFLAQVVA LLGTGLSTVA LGLLAYDLAG NSAAMVLGTV FTLKMVAYVG 
IAPVAGAFAD SVNRRQLLVI LDLVRAGVAI ALPFVTEIWQ VYVLIFLLQS ASAAFTPTFQ
ATLPDVLPEE ERYTRALSLS RLAYDLENII SPTLAALLLT VMAYNTLFLG TAIGFLGSAL
LVVSVLLPSP KPSTRRGIYA RTTRGIRIYL ATPRLRGLLC LNLAVASAGA MVLVNSVVLI
RGELGLSESA LAWTLFAFGA GSMLASLALP RVLDTLKDRP VMIAGATLMV GALLGLAAVI
ALADLSWAGV LVAWFLVGIG YSSTQTPSGR LLRRSAHAED RPAIFAAQFA LSHACWLLTY
PLSGWLMTGL GPLPALVVLA ALSGLGILAC LRLWPKNAPD VILHDHEDLP ADHPHMRQFG
TTPHQHAIII DDLHRHWPTQ G