Gene Mchl_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1898 
Symbol 
ID7116713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1957771 
End bp1958871 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content76% 
IMG OID643524662 
ProductTonB family protein 
Protein accessionYP_002420689 
Protein GI218529873 
COG category 
COG ID 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGA TGCAGGCCCC GACATCTGCC CCGATGCCGG CTCCCGTGCC CGCTTACGCT 
GGGCAGGGGT TGCCCTCGGG CCCGTCCGAG GGAGGCGGCC AGGGGCGCCT CGCGGCCGCC
TTCGCCCTGG CCCTGGCCCT GCACGCCGCG GGGCTGATCG GCATCACCTA TCTGCATCTG
ACACCGCCCG CGCCGCCGGG CGAGCAGGAG ATCACCATCG ATCTCGCGCC GCAGATGGCG
GAGGCCGAGA CGCAGGCCCC CGCCCAGACA GCGCAGTCCG AGGCGATCCC CGAGGAGGCC
AAGCCCGAGG GCGAGCCGGA GACGGCCGAG CCGGTCGAGA TCCCGGACGA GGTGAAGCCC
CCGCCTCCCC CCGAGATGAC GGAGGTGATG CCGGAGGAGG TGCAGCCGCC GCCTCCGCCG
CCGGAAGCCG TCACGGAAGT TCCGCCCGAC ACGCTGCCCC CGCCGCCCGA GGAGCAGATC
ATCGCCTCCG AGGCGCAGGA GGCGGAGCCG CTGGCGCCGC CCCCGCCCGT GGTGGCGAAG
GTGCCGGAGC GGCCCAAGCC CGATCCCAAG ATCGAGGAGC GCCGCAAGGC CGCCCTGGAG
AAGAAGCGCG AGGCCGAGCG CGAGGCACGC CGCCAGGAGA TCCTCGAGAA GAAGCGCGAG
GAGGCGCAGA AGGAAGCGCG GATCAAGGCC GCCAAGGCGA AGGCGGAGCG CGATGCCGCC
CGGCGTGCCC AGGCCGCGCA GGCGGGCAAT GCGCAGCGCA ACTCCGCCGC CACCTCGCGT
CAGAGCGCGA CGGGCACGGC CGCCGCGGCC AGCGATCCCA ACGCCATGGC CGCCTGGAAG
GGCTCCATCG CCGCGACGAT CCGCGGCCGG ATGAACCGCG AGGCCGCGGC CGGCACCAGC
GGCGGCGTCG CGACCGTGCG CTTCACCGTG AGCCGCTCCG GCGCGGTGAG CGGTGCGGCC
GTGACCGGCA GCAGCGGGGT CGGCGCCATC GACAGCGCCG CGCTCGCGGC GGTGCGCGGC
GGCCTGCCGC CCGCCCCCGC CGGGGTGACG CAGCCGAGCC TCGCCGTCAC CGTGCCGCTG
CGCTTCAGCC CTGGGCGTTA G
 
Protein sequence
MPPMQAPTSA PMPAPVPAYA GQGLPSGPSE GGGQGRLAAA FALALALHAA GLIGITYLHL 
TPPAPPGEQE ITIDLAPQMA EAETQAPAQT AQSEAIPEEA KPEGEPETAE PVEIPDEVKP
PPPPEMTEVM PEEVQPPPPP PEAVTEVPPD TLPPPPEEQI IASEAQEAEP LAPPPPVVAK
VPERPKPDPK IEERRKAALE KKREAEREAR RQEILEKKRE EAQKEARIKA AKAKAERDAA
RRAQAAQAGN AQRNSAATSR QSATGTAAAA SDPNAMAAWK GSIAATIRGR MNREAAAGTS
GGVATVRFTV SRSGAVSGAA VTGSSGVGAI DSAALAAVRG GLPPAPAGVT QPSLAVTVPL
RFSPGR