Gene TM1040_2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2366 
Symbol 
ID4076485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2488085 
End bp2489236 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content65% 
IMG OID638007688 
ProductTonB domain-containing protein 
Protein accessionYP_614360 
Protein GI99082206 
COG category 
COG ID 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.858133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAACCG GCACAAAGAT CTCTCTCGCG GGGCATGGCC TCCTGATCAC CTGGGCGCTG 
GCTGGCGGGT GGCTCAATCC TGAGCCGCTA CCCTTCCAGA GCCAGGATGT GTCGATCATT
TCGGCAGAAG AGTTTGCACG ACTGAGCGAG CAATATGCTG CTCCCGAGGT CTCTGACGCC
CCTGAGGCGC CACAGGAACC GGCGATCACC GAGACGACCC CGTCGATCCC CGAGCGCCCG
AGCCAGACGC CCGAGACCCC CGCAACACCT GCGCCGATTG AAACGCCGGT CGCAGACACG
GCGGCAGAGC CTCCCGCGCC AGAGCCTGAT CCCGAGGTGC CAAGCCGTGT GCCTGATGCA
CCAGAGCTGC CCGAAATCGT GACGGACCTG CCGCAAACCG CAGAGCCTCA GACTGCTCAG
CAGACGGACC GTGTCGCGCC CGAGCCGGTG CGCGCGCCCG AGCCGGACAC ACAGATTGGC
GATGTGGTGC AGGAAGAGGT TGCACCCGAT GCCGAGGCAG AGGCCAATCA GCCCGTGCAG
GAAGCCACGG CACCCGAGGC CGCAAGTGAC CGTATCGTCA CCGAGGCCGA AGCAGATGAT
GCCGAGCCCA CAGCGCCTCT GGCGAGTGTT CGTCCGCGCG CGCGCCCGGC GCGACCCGCT
CCGGAACCAG AGCCTGACCC TGAACCACAG ACCCAAACCG CCGCCCGCGA CGAAGAGCCC
GCCCCGGATG CGGAGCAAAA CCCCGAGGTC GAGCGTTCCG CTGTCAACGA TGCGCTTGCA
GAAGCGCTGA GTGGCGCGGG GGAGGTGGAG GCGCCCGAAC CCTCCGGCCC GCCGATGACA
CGAGGAGAAA AAGATGCCCT TCGGATTGCG GTGTCGCAAT GCTGGAACCT CGGCTCTTTG
TCGAGCGAGG CGCTGCTTAC GGAGGTTGTC GTGGGGGTCA CCCTCTCGCG CGAGGGCGTG
CCGGATATTG GTTCCATTCG CCTGCTGTCG AGCACTGGTG GGTCCGACAG CTCTGCCAAA
CAGGCCTATG AGGCCGCGCG CCGTGCAATC ATCCGCTGTG GATCAAGAGG ATTTGATCTT
CCGGCTGACA AATACGCTCA GTGGCGTGAT ATTGAGATGA CATTTAACCC CGAAGGAATG
CGGATCAGAT GA
 
Protein sequence
MQTGTKISLA GHGLLITWAL AGGWLNPEPL PFQSQDVSII SAEEFARLSE QYAAPEVSDA 
PEAPQEPAIT ETTPSIPERP SQTPETPATP APIETPVADT AAEPPAPEPD PEVPSRVPDA
PELPEIVTDL PQTAEPQTAQ QTDRVAPEPV RAPEPDTQIG DVVQEEVAPD AEAEANQPVQ
EATAPEAASD RIVTEAEADD AEPTAPLASV RPRARPARPA PEPEPDPEPQ TQTAARDEEP
APDAEQNPEV ERSAVNDALA EALSGAGEVE APEPSGPPMT RGEKDALRIA VSQCWNLGSL
SSEALLTEVV VGVTLSREGV PDIGSIRLLS STGGSDSSAK QAYEAARRAI IRCGSRGFDL
PADKYAQWRD IEMTFNPEGM RIR