Gene TM1040_3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3266 
Symbol 
ID4075408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp267890 
End bp268942 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content62% 
IMG OID638004775 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_611502 
Protein GI99078244 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.700035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.539349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACT ATCTCAGCAC CGTGGCGCTT GGCGCCGCCC TGGCCCTGGC CACGTCATCG 
GCCTTTGCCC AGACGGTTGG CCCCAAGGGC GAGGCCGCGA CACCAACGGC ATCGATCATG
GTCGAAGACG GAGACCTCGA GACGATCCGC AGCGGCAATC ACACCGCCGC GCTCCTTTGG
CATGACCAAA GCGATTTTGT AAATGCTGTG ACGGCAGGCG CGACCGATGA GCTGGCGCGC
GCCGGGATCG AAGTCGTCGC CACCGCAAGC GCCGGGTTCG ACGCCGCCAA ACAGCGAAGC
GATATCGAAA CCGCCCTGAG CAAAGACCCG AGCATCATCC TGTCACTCCC CCTCGATCCG
GTAACCTCTG CCGCCGCGTT TGAGGAAGCC AAGGAAAACG GCGTGAAGCT GGTGTTCCTG
TCCAACGTGC CCTCTGACTA TGAACACGCA AAGGATTACG CGGCGATTGT CACCGACGAT
CTGTTCCAGA TGGGCAAGCA GGCTGCAGAC GCGCTGGCCG CATCCATGGG CGGGGCGGGC
ACCGTGGGCT GGATCTACCA TGACGCCGAC TATTATGTGA CCAACCAGCG CGATAACGCC
TTCAAGACCA CCATTGAAAA CGACTACCCG GAAATTTCAA TCGTAGCAGA GCAAGGCATC
AGCGACCCTG CCCGCGCTGA GGACATTGCC AACGCGATGT TGCTGCGCAA CCCTGACATC
GGGGGCATCT ACGTGACCTG GGCTGGTCCC GCCGAGGGCG TTCTGGCCGC ACTTCGGGCA
AATGGCAATG ACACCACCAA AGTGGTGACG CTTGATCTCT CGGAGCCGGT GGCGCTCGAT
ATGGTTCGGG GCGGCAATGT TGCAGCCATC GTTGCCGACG AAGCCTATGA GCTAGGCCGC
GCCATGGCCG CAGCTGCGAT CCTTGATCTT TTGGGCAAGG ACGTCCCTCC ATTCGTGGTG
GCCCCCGCAG TGACCGTCAC CGCTGAGAAT GTGGCCGAAG GCTGGATGCG ATCCCTGCAC
ATCGACGCCC CAAAGAGCGT CACTGGCAAC TGA
 
Protein sequence
MKHYLSTVAL GAALALATSS AFAQTVGPKG EAATPTASIM VEDGDLETIR SGNHTAALLW 
HDQSDFVNAV TAGATDELAR AGIEVVATAS AGFDAAKQRS DIETALSKDP SIILSLPLDP
VTSAAAFEEA KENGVKLVFL SNVPSDYEHA KDYAAIVTDD LFQMGKQAAD ALAASMGGAG
TVGWIYHDAD YYVTNQRDNA FKTTIENDYP EISIVAEQGI SDPARAEDIA NAMLLRNPDI
GGIYVTWAGP AEGVLAALRA NGNDTTKVVT LDLSEPVALD MVRGGNVAAI VADEAYELGR
AMAAAAILDL LGKDVPPFVV APAVTVTAEN VAEGWMRSLH IDAPKSVTGN