Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_2934 |
Symbol | |
ID | 7115741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 3086413 |
End bp | 3087633 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643525684 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_002421701 |
Protein GI | 218530885 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00608688 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTGAGA CCCCCTCTCG CCTCCCCGTC ACCGTGCTGT CGGGCTTCCT CGGTGCCGGG AAGACGACTT TGCTCAATCA CGTGCTCAAC AATCGCGAGG GTCGGCGCGT CGCGGTGATC GTCAACGACA TGAGCGAGGT GAATATCGAT GCCGACCTCG TCCGTGGCGG TGGGGCGGAT CTGTCGCGCA CGGACGAGCG GCTCGTGGAG ATGACCAACG GCTGCATCTG CTGCACCCTG CGCGACGACC TGCTGGCCGA GGTTCGCCGA CTCGCCGCCG AGGGCCGCTT CGACTACTTG CTGATCGAAG GCACGGGCAT CGCCGAGCCG CTGCCGGTGG CGAGCACCTT CTCCTTCCGC GACGAGGCCG GCGAAGCGCT CAGCGACGTG GCGCGGCTCG ACACGATGGT GACCGTGGTC GATGCCGTGA ACCTCCTGAA GGATTACGGC TCGAACGACT TTCTGCGCCA GCGCGGCGAG ACCGCGGGCG CCGACGACAC CCGCACCCTG GTCGATCTGC TGGTGGAGCA GATCGAGTTT GCCGACGTCG TCGTGATCAA CAAGGCGCTC GACGTCTCGC CGAATCATCT CGACCTCGTG CGCTCGGTAG TGCGGGGGCT CAACGCGGAT GCGCGCATCG TCGAGGCGTC TCACGGCCAA GTGCCGCCGG ACGCCATCCT CGATACCGGG CTGTTCGACG AAGAAAAGGC GCAGGAGCAC CCGCTCTGGT TCAAGGAACT CTACGGCGCC CACGAGCATG TGCCGGAGAC CGAGGAATAC GGCATCGGCT CCTTCGTCTA CCGGGCGCGC CGGCCGCTGG ATCCGCAGAA GTTCCAGGCG TTCGCCAACA CGACCTGGCC GGGGCTGATC CGGGCGAAGG GCCATTTCTG GCTCGCGACC CGGCCGGAAT GGGTCGGTGA ATTCTCGCTG GCCGGTGCCG TCGCGCATAT CGGCGCGATG GGGTTCTGGT GGGCAGCGGT GCCGCGCCAG CGCTGGCCGG AGGAAGAGGT TTTCCGCGAG CGCCTTCACA CGGTCTGGAG CGAGGTCTGG GGGGACCGCC GCCAGGAACT CGTCTTCATC GGCCGGGGCA TGGACCGGGA CGCGATCACC GATGCGCTCG ATGCCTGCCT CGTCGGTCCG GCGGAGGTCC GGCGCTTCGA TGCGGAGGCC TATCGCGACC TGCCCGATCC TTTCCCGGCC TGGCGCAGGG CGGCGGCCTA G
|
Protein sequence | MSETPSRLPV TVLSGFLGAG KTTLLNHVLN NREGRRVAVI VNDMSEVNID ADLVRGGGAD LSRTDERLVE MTNGCICCTL RDDLLAEVRR LAAEGRFDYL LIEGTGIAEP LPVASTFSFR DEAGEALSDV ARLDTMVTVV DAVNLLKDYG SNDFLRQRGE TAGADDTRTL VDLLVEQIEF ADVVVINKAL DVSPNHLDLV RSVVRGLNAD ARIVEASHGQ VPPDAILDTG LFDEEKAQEH PLWFKELYGA HEHVPETEEY GIGSFVYRAR RPLDPQKFQA FANTTWPGLI RAKGHFWLAT RPEWVGEFSL AGAVAHIGAM GFWWAAVPRQ RWPEEEVFRE RLHTVWSEVW GDRRQELVFI GRGMDRDAIT DALDACLVGP AEVRRFDAEA YRDLPDPFPA WRRAAA
|
| |