Gene TM1040_3532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3532 
Symbol 
ID4075211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp567761 
End bp568780 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content58% 
IMG OID638005047 
ProductLacI family transcription regulator 
Protein accessionYP_611766 
Protein GI99078508 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.895947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACC GTTTTCCAAT CAAGGAGATC GCGCGGCAGG CGGGTCTTGG CACCGCCACT 
GTCGATCGGG TCCTGAATGA CCGGGCGCAT GTGAGCCCGC AGACAAAGCT GCGTGTTACC
GCTGCAATAA AAGAGTTGAA GGCCCAAGAG GCCCAGCTTG CTGCTCATGG AAGGCGATTG
TTCTTTGACT TCGTCGTTGA AGCGCCATCA CGCTTCAGCC TTGAAGTGAA GGCGGCCGCA
GAAGCAGTAC TCCCTCAGAT CGGAACCGCT GTTTGCCGCC CTCGATTTCT GCTGCAGGAG
ATCATGGAAG AGGATGAGGT CGTCGGGGCA CTGAAACGGA TCATGAAGCG AGGTAGTCAG
GGCGTGTGTC TAAAGGCGCG GGACACGGCG CGGATTAGGG AAGCAGCGAA GACGCTGACC
GCCGCAAAAA TCCCCGTGGT CACGCTGGTC ACCGACATCG GGGGTACTGA TCGTCTTGCC
TACGTCGGGT TAGACAACGC CGGTGCAGGA CGCACTGCAG CCTACCTTAT CTCCCGAGCG
CTTGGGGATG TGCAGGGAAT GGTCTTGGCC ACGCGCAGCC ATGAACGCTT TCTAGGAGAA
GAAGAGCGCG AGTTCGCATT TGTCGAAACC TTGGCACGCG AGCGTCCAGG TCTACAGGTA
TTTGCTGTCC AGGGCGGTAG TGGAGTGGAC TTTGAAACGT CAAAGCTCTT AACGAAGTCC
ATGGTTGGCA TTCACCATCT GCGCGCGGTC TATTCGATGG GGGGTGGCAA CCTATCGATC
CTACGCACGC TGGAGCACAA AGGTCTGAGC CCCGATGTGT ACGTGGCCCA TGATCTTGAT
CGGGAAAACA GGGAGCTGAT CCAGGACCGG CGCATCGACT TCATCCTGCA TCACGATTTG
CAGCTGGACG TACGGAACAC GTTCAACGCC TTTCTATCCT ATCATGGGCT GTCCAGTGGT
CTTGTGGGGG CGCCGATCTC CACGGTCCAG GTGCTGACAC CGGAGAATAT ACCGCGTTGA
 
Protein sequence
MTHRFPIKEI ARQAGLGTAT VDRVLNDRAH VSPQTKLRVT AAIKELKAQE AQLAAHGRRL 
FFDFVVEAPS RFSLEVKAAA EAVLPQIGTA VCRPRFLLQE IMEEDEVVGA LKRIMKRGSQ
GVCLKARDTA RIREAAKTLT AAKIPVVTLV TDIGGTDRLA YVGLDNAGAG RTAAYLISRA
LGDVQGMVLA TRSHERFLGE EEREFAFVET LARERPGLQV FAVQGGSGVD FETSKLLTKS
MVGIHHLRAV YSMGGGNLSI LRTLEHKGLS PDVYVAHDLD RENRELIQDR RIDFILHHDL
QLDVRNTFNA FLSYHGLSSG LVGAPISTVQ VLTPENIPR