Gene TM1040_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3343 
Symbol 
ID4075242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp354019 
End bp354969 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content59% 
IMG OID638004851 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_611577 
Protein GI99078319 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.31055e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.275562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCAT TGACAAAGAA ATTTGCGCTG GCGGCGGTTG TTGCCGCAGC GCCCCTGATG 
GTGGCCACGA CGGTCTCAGC CGAAGGGGAG AAATACATCC TGGTCAGCCA CGCGCCTGAC
AGCGACAGCT GGTGGAATAC GATCAAGAAC GGCATCGCAC TGGCCGGCGA GCAGATGGAC
GTCGAAGTGG AATACCGCAA CCCGCCTACG GGCGATTTAG CCGATATGGC GCGGATCATC
GAGCAGGCGG CCGCATCAGG CCCGAATGGG ATTATCACCA CGCTGGCAGA TTATGACGTG
TTGTCCGGTC CCATCAAAAC GGCGGTCGAC AGTGGCGTGG ACGTGATCAT CATGAACACG
GGCACCCCGG ATCAGGCACG CTCTGTCGGG GCTTTGATGT ATGTGGGGCA GCCGGAGTAC
GACGCGGGCT ACGCTGCGGG TCTGCGCGCC AAAGGCGACG GTGTCGAGAG CTTCCTATGT
GTGAACCACT ATATTGTACA ACCTTCCTCG CAGGATCGTT GCCAGGGGTT CGCGGATGGT
CTCGGGGTGG AGCTCGGTAC GCAAATGATT GACGCGGGTC AGGACCCGGC TGAAATCAAG
AACCGTGTGC TGGCCTATCT TTCGGCCAAT CCCGACACCG ACGCTGTTTT GACCCTGGGG
CCGACCAGTG CCGACCCAAC GCTCCTGGCG CTTGAAGAAA ACGGCATGGC GGGTGACATC
TACTTCGGCA CTTTCGATCT GGGCGGCGAA ATCGTCAAAG GCATCCAATC CGGTGTCATC
CAGTGGGGTA TTGACCAGCA GCCTTTCCTC CAGGCTTACC TGCCGGTTGT GGTCATGACG
AATTACCACC GCTATGGCGT CCTGCCCGGC AACAACATCA ACTCTGGTCC CGGCTTTGTA
ACCGCCGATG GTCTCGAGAA GATCGAGCAG TTTGCAGGCG AGTATCGCTG A
 
Protein sequence
MTSLTKKFAL AAVVAAAPLM VATTVSAEGE KYILVSHAPD SDSWWNTIKN GIALAGEQMD 
VEVEYRNPPT GDLADMARII EQAAASGPNG IITTLADYDV LSGPIKTAVD SGVDVIIMNT
GTPDQARSVG ALMYVGQPEY DAGYAAGLRA KGDGVESFLC VNHYIVQPSS QDRCQGFADG
LGVELGTQMI DAGQDPAEIK NRVLAYLSAN PDTDAVLTLG PTSADPTLLA LEENGMAGDI
YFGTFDLGGE IVKGIQSGVI QWGIDQQPFL QAYLPVVVMT NYHRYGVLPG NNINSGPGFV
TADGLEKIEQ FAGEYR