Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1616 |
Symbol | |
ID | 5834414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1803147 |
End bp | 1804283 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641367414 |
Product | TonB family protein |
Protein accession | YP_001639086 |
Protein GI | 163851043 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain [TIGR02794] TolA protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.37756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00492157 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGCTC CCAAGCCAGC TCCGAAGGCC GCCCCCATGC CGCCGATGCA GGCCCCGACA TCTGCCCCGA TGCCGGCTCC CGTGCCCGCT CATGCCGGGC AGGGGTTGCC CTCGGGCCCG TCCGAGGGAG GCGGCCAGGG GCGCCTCGCG GCCGCCTTCG CCCTGGCCCT GGCCCTGCAC GCCGCGGGGC TGATCGGCAT CACCTATCTG CATCTGACAC CGCCCGCGCC GCCGGGCGAG CAGGAGATCA CCATCGATCT CGCGCCGCAG ATGGCGGAGG CCGAGACGCA GGCCCCCGCC CAGACAGCGC AGTCCGAGGC GATCCCCGAG GAGGCCAAGC CCGAGGGCGA GCCGGAGACG GCCGAGCCGG TCGAGACCCC GGACGAGGTG AAGCCCCCGC CTCCCCCCGA GATGACGGAG GTGATGCCGG AGGAGGTGCA GCCGCCGCCT CCGCCGCCAG AAGCCGTCAC GGAGGTTCCG CCCGACACGC TGCCCCCGCC GCCGGAGGAG CAGATCATCG CCTCCGAGGC GCAGGAGGCG GAGCCGCTGG CGCCGCCCCC GCCCGTGGTG GCGAAGGTGC CGGAGCGGCC CAAGCCCGAT CCCAAGATCG AGGAGCGCCG CAAGGCCGCC CTGGAGAAGA AGCGCGAGGC CGAGCGCGAG GCACGCCGCC AGGAGATCCT CGAGAAGAAG CGCGAGGAGG CGCAGAAGGA AGCGCGGATC AAGGCCGCCA AGGCGAAGGC GGAGCGCGAT GCCGCCCGGC GCGCCCAGGC CGCGCAGGCG GGCAATGCGC AGCGCAACTC CGCCGCGACC TCGCGTCAGA GCGCGACGGG CACGGCCGCC GCGGCCAGCG ATCCCAACGC CATGGCCGCC TGGAAGGGCT CCATCGCCGC GACGATCCGT GGCCGGATGA ACCGTGAGGC CGCGGCCGGC ACCAGCGGCG GCGTCGCGAC CGTGCGCTTC ACCGTGAGCC GCTCCGGCGC GGTGAGCGGC GCGGCCGTGA CCGGCAGCAG CGGGGTGGGC GCCATCGACA GCGCCGCGCT CGCGGCGGTG CGCGGCGGCT TGCCGCCCGC CCCCGCCGGG GTGACGCAGC CGAGCCTCGC CGTCACCGTG CCGCTGCGCT TCAGCCCTGG GCGTTAG
|
Protein sequence | MAAPKPAPKA APMPPMQAPT SAPMPAPVPA HAGQGLPSGP SEGGGQGRLA AAFALALALH AAGLIGITYL HLTPPAPPGE QEITIDLAPQ MAEAETQAPA QTAQSEAIPE EAKPEGEPET AEPVETPDEV KPPPPPEMTE VMPEEVQPPP PPPEAVTEVP PDTLPPPPEE QIIASEAQEA EPLAPPPPVV AKVPERPKPD PKIEERRKAA LEKKREAERE ARRQEILEKK REEAQKEARI KAAKAKAERD AARRAQAAQA GNAQRNSAAT SRQSATGTAA AASDPNAMAA WKGSIAATIR GRMNREAAAG TSGGVATVRF TVSRSGAVSG AAVTGSSGVG AIDSAALAAV RGGLPPAPAG VTQPSLAVTV PLRFSPGR
|
| |