Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2449 |
Symbol | |
ID | 3102406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 2630263 |
End bp | 2631273 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637171592 |
Product | capsular polysaccharide biosynthesis protein I |
Protein accession | YP_114863 |
Protein GI | 53803405 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0862725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAC TGATCACCGG CACCGCCGGC TTCATCGGAT CGCACCTGGC CCACAAACTG CTGGACCGGG GTGATGAAAT CATCGGCATA GACAACGTCA ACGATTACTA CGACGTCAGC CTCAAAGAAG CACGCCTCGC CCGGCTCCAT GCCCGCCCCG GCTTCAGCGA GGCACGTATC GCCCTGGAGG AACGCGACAA GCTGTTCGCG ACGTTCGCCC GCCACCGTCC CGAACGTGTG GTGAACCTGG CCGCCCAGGC CGGCGTGCGC TATTCACTGG AAAACCCGCA TGCCTACGTC GACGCCAATC TGGTCGGCTT CTGCAACATC CTGGAAGCCT GCCGCCACTA TGAGGTGGAA CACCTGGTCT ATGCTTCCTC CAGTTCGGTC TACGGCGCCA ACACCGCCAT GCCGTTTTCG GTCCATCACA ACCTCGACCA TCCGGTCAGC CTTTATGCCG CGACCAAGAA GGCCAATGAA TTGATGGCCC ATACCTACAG CCATCTGTTC GGGCTTCCCA CCACGGGCCT TCGCTTCTTC ACAGTCTACG GCCCGTGGGG GCGGCCGGAC ATGGCCCTGT TCAAGTTCAC CCGCAACATC CTGGCGGGAC AGCCGATCGA CGTCTACAAC TACGGGCACC ACCGGCGGGA TTTCACCTAC ATCGACGACA TCGTGGAAGG GGTAGTACAG ACCCTGGACA AAGTGGCTGC GCCCGATCCG GCATGGCGTG GCGACCGGCC CGACCCCGGC ACCAGCCGGG CCCCCTACCG GCTGTACAAC ATCGGCAACA ACGAACCGGT CGAGCTTTTG CGCTTCATCG AGGTGCTCGA GCACTGTCTG GGATGCAAAG CCGAGATGAA CCTGCTGCCC ATGCAGGACG GCGACGTGCC CGACACCTAT GCCGACGTGG ACGATCTGAT GCGCGATACC GGTTACCGGC CGGCAACCCC GATCGAAACC GGCATCGCGC GCTTCGTCGA GTGGTACCGG GATTATTACG GCGTCCGCTG A
|
Protein sequence | MRILITGTAG FIGSHLAHKL LDRGDEIIGI DNVNDYYDVS LKEARLARLH ARPGFSEARI ALEERDKLFA TFARHRPERV VNLAAQAGVR YSLENPHAYV DANLVGFCNI LEACRHYEVE HLVYASSSSV YGANTAMPFS VHHNLDHPVS LYAATKKANE LMAHTYSHLF GLPTTGLRFF TVYGPWGRPD MALFKFTRNI LAGQPIDVYN YGHHRRDFTY IDDIVEGVVQ TLDKVAAPDP AWRGDRPDPG TSRAPYRLYN IGNNEPVELL RFIEVLEHCL GCKAEMNLLP MQDGDVPDTY ADVDDLMRDT GYRPATPIET GIARFVEWYR DYYGVR
|
| |