Gene M446_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1749 
Symbol 
ID6133514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1964356 
End bp1965426 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content77% 
IMG OID641642004 
Productcarbonic anhydrase 
Protein accessionYP_001768673 
Protein GI170740018 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCG GGGCCCCGTG CGGGGCCGGC CCCGACGCGA GAGACCCCGT GCGCCCCGAC 
CTGCTGCTTC CCTACGACGG CACCGAGCCG GCCTTCGCCT CCCCGCCGGC CCGCTGCGGG
CGGCGCAGCA CGGTGATCGG CCGCACCAGC CTCGGCGCCG GGGCGTGGCT CGGCGACGCC
GCGGTGATCC GCGGCGACGG CCACGACGTC ATCGCGGGCG ACGGCCTCTG GCTCGGGCCG
CGGGCGACGC TCCACATCGC CCAGGACAAG TACCCCTGCA TCCTGGGCGA CCGGGTCACG
GTCGGGCGCA ACGCCGTCGT GCATGCCTGC ACGGTCGGGG ACGATTGCGT GGTCGAGGAC
GATTGCGTCG TGCTCGACGG CTCCGTGGTC GAGGACGGGG TGGTGATCGA GGCCGGCAGC
ACCGTCTATC CCCGCTCGAC CCTCGCGGCG GGCCTGCTCT ACGCGGGCTC GCCCGCCGCG
CCCCTGCGCC GGCTCGCCGC GGGCGAGGCC GCCGCCCGGG CCGCGCGCCT GCGCGCCGGC
CCCGCGGCGG CCGTGCCGGC GGCGGCCCCG GGCCATCCCG CGCCCGCCGT GTTCGTCGCC
CTGAGCGCGC GCCTCGCCGG GCCGGTCGAC CTCGCCCCGG GGGCCAGCAT CTTCTTCGGC
TGCGACCTCG ACGCGGCGGC CGGGCCGATC GCGGTCGGCC CGAACGCGAA CGTGCAGGAC
AACAGCGTCC TGCGCCCCCT CGGGGCCGGG CTGGTGATCG AGCGGGACAC CACCCTCGGC
CACAACGTGG TGGCGGCGGA CGGGCGGATC GGCCCGCGCA GCCTCGTCGG CATCGGGGCC
GTCCTCGCCC CCGGCACCGT CGTGGACGAG GACGTGCTGG TCGCGGCCGG AACCGTCACC
GCGGCGGGCC AAGTGCTGGA ATCCGGCTGG CTCTGGGGCG GGCGCCCGGC GCGCCGCCTC
GCGCCGCTCG ATGCGGGCAA GCGCGAGATG ATGCGCCGCA TCGTCGAGCA ATATGCCGGC
TATGGCCGCC GCTACCGCAA GGCCCAGATC GCACTTTCAC CGGAGGCCTG A
 
Protein sequence
MKRGAPCGAG PDARDPVRPD LLLPYDGTEP AFASPPARCG RRSTVIGRTS LGAGAWLGDA 
AVIRGDGHDV IAGDGLWLGP RATLHIAQDK YPCILGDRVT VGRNAVVHAC TVGDDCVVED
DCVVLDGSVV EDGVVIEAGS TVYPRSTLAA GLLYAGSPAA PLRRLAAGEA AARAARLRAG
PAAAVPAAAP GHPAPAVFVA LSARLAGPVD LAPGASIFFG CDLDAAAGPI AVGPNANVQD
NSVLRPLGAG LVIERDTTLG HNVVAADGRI GPRSLVGIGA VLAPGTVVDE DVLVAAGTVT
AAGQVLESGW LWGGRPARRL APLDAGKREM MRRIVEQYAG YGRRYRKAQI ALSPEA