Gene MCA2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2449 
Symbol 
ID3102406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2630263 
End bp2631273 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content62% 
IMG OID637171592 
Productcapsular polysaccharide biosynthesis protein I 
Protein accessionYP_114863 
Protein GI53803405 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0862725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAC TGATCACCGG CACCGCCGGC TTCATCGGAT CGCACCTGGC CCACAAACTG 
CTGGACCGGG GTGATGAAAT CATCGGCATA GACAACGTCA ACGATTACTA CGACGTCAGC
CTCAAAGAAG CACGCCTCGC CCGGCTCCAT GCCCGCCCCG GCTTCAGCGA GGCACGTATC
GCCCTGGAGG AACGCGACAA GCTGTTCGCG ACGTTCGCCC GCCACCGTCC CGAACGTGTG
GTGAACCTGG CCGCCCAGGC CGGCGTGCGC TATTCACTGG AAAACCCGCA TGCCTACGTC
GACGCCAATC TGGTCGGCTT CTGCAACATC CTGGAAGCCT GCCGCCACTA TGAGGTGGAA
CACCTGGTCT ATGCTTCCTC CAGTTCGGTC TACGGCGCCA ACACCGCCAT GCCGTTTTCG
GTCCATCACA ACCTCGACCA TCCGGTCAGC CTTTATGCCG CGACCAAGAA GGCCAATGAA
TTGATGGCCC ATACCTACAG CCATCTGTTC GGGCTTCCCA CCACGGGCCT TCGCTTCTTC
ACAGTCTACG GCCCGTGGGG GCGGCCGGAC ATGGCCCTGT TCAAGTTCAC CCGCAACATC
CTGGCGGGAC AGCCGATCGA CGTCTACAAC TACGGGCACC ACCGGCGGGA TTTCACCTAC
ATCGACGACA TCGTGGAAGG GGTAGTACAG ACCCTGGACA AAGTGGCTGC GCCCGATCCG
GCATGGCGTG GCGACCGGCC CGACCCCGGC ACCAGCCGGG CCCCCTACCG GCTGTACAAC
ATCGGCAACA ACGAACCGGT CGAGCTTTTG CGCTTCATCG AGGTGCTCGA GCACTGTCTG
GGATGCAAAG CCGAGATGAA CCTGCTGCCC ATGCAGGACG GCGACGTGCC CGACACCTAT
GCCGACGTGG ACGATCTGAT GCGCGATACC GGTTACCGGC CGGCAACCCC GATCGAAACC
GGCATCGCGC GCTTCGTCGA GTGGTACCGG GATTATTACG GCGTCCGCTG A
 
Protein sequence
MRILITGTAG FIGSHLAHKL LDRGDEIIGI DNVNDYYDVS LKEARLARLH ARPGFSEARI 
ALEERDKLFA TFARHRPERV VNLAAQAGVR YSLENPHAYV DANLVGFCNI LEACRHYEVE
HLVYASSSSV YGANTAMPFS VHHNLDHPVS LYAATKKANE LMAHTYSHLF GLPTTGLRFF
TVYGPWGRPD MALFKFTRNI LAGQPIDVYN YGHHRRDFTY IDDIVEGVVQ TLDKVAAPDP
AWRGDRPDPG TSRAPYRLYN IGNNEPVELL RFIEVLEHCL GCKAEMNLLP MQDGDVPDTY
ADVDDLMRDT GYRPATPIET GIARFVEWYR DYYGVR