Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1749 |
Symbol | |
ID | 6133514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1964356 |
End bp | 1965426 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641642004 |
Product | carbonic anhydrase |
Protein accession | YP_001768673 |
Protein GI | 170740018 |
COG category | [R] General function prediction only |
COG ID | [COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCG GGGCCCCGTG CGGGGCCGGC CCCGACGCGA GAGACCCCGT GCGCCCCGAC CTGCTGCTTC CCTACGACGG CACCGAGCCG GCCTTCGCCT CCCCGCCGGC CCGCTGCGGG CGGCGCAGCA CGGTGATCGG CCGCACCAGC CTCGGCGCCG GGGCGTGGCT CGGCGACGCC GCGGTGATCC GCGGCGACGG CCACGACGTC ATCGCGGGCG ACGGCCTCTG GCTCGGGCCG CGGGCGACGC TCCACATCGC CCAGGACAAG TACCCCTGCA TCCTGGGCGA CCGGGTCACG GTCGGGCGCA ACGCCGTCGT GCATGCCTGC ACGGTCGGGG ACGATTGCGT GGTCGAGGAC GATTGCGTCG TGCTCGACGG CTCCGTGGTC GAGGACGGGG TGGTGATCGA GGCCGGCAGC ACCGTCTATC CCCGCTCGAC CCTCGCGGCG GGCCTGCTCT ACGCGGGCTC GCCCGCCGCG CCCCTGCGCC GGCTCGCCGC GGGCGAGGCC GCCGCCCGGG CCGCGCGCCT GCGCGCCGGC CCCGCGGCGG CCGTGCCGGC GGCGGCCCCG GGCCATCCCG CGCCCGCCGT GTTCGTCGCC CTGAGCGCGC GCCTCGCCGG GCCGGTCGAC CTCGCCCCGG GGGCCAGCAT CTTCTTCGGC TGCGACCTCG ACGCGGCGGC CGGGCCGATC GCGGTCGGCC CGAACGCGAA CGTGCAGGAC AACAGCGTCC TGCGCCCCCT CGGGGCCGGG CTGGTGATCG AGCGGGACAC CACCCTCGGC CACAACGTGG TGGCGGCGGA CGGGCGGATC GGCCCGCGCA GCCTCGTCGG CATCGGGGCC GTCCTCGCCC CCGGCACCGT CGTGGACGAG GACGTGCTGG TCGCGGCCGG AACCGTCACC GCGGCGGGCC AAGTGCTGGA ATCCGGCTGG CTCTGGGGCG GGCGCCCGGC GCGCCGCCTC GCGCCGCTCG ATGCGGGCAA GCGCGAGATG ATGCGCCGCA TCGTCGAGCA ATATGCCGGC TATGGCCGCC GCTACCGCAA GGCCCAGATC GCACTTTCAC CGGAGGCCTG A
|
Protein sequence | MKRGAPCGAG PDARDPVRPD LLLPYDGTEP AFASPPARCG RRSTVIGRTS LGAGAWLGDA AVIRGDGHDV IAGDGLWLGP RATLHIAQDK YPCILGDRVT VGRNAVVHAC TVGDDCVVED DCVVLDGSVV EDGVVIEAGS TVYPRSTLAA GLLYAGSPAA PLRRLAAGEA AARAARLRAG PAAAVPAAAP GHPAPAVFVA LSARLAGPVD LAPGASIFFG CDLDAAAGPI AVGPNANVQD NSVLRPLGAG LVIERDTTLG HNVVAADGRI GPRSLVGIGA VLAPGTVVDE DVLVAAGTVT AAGQVLESGW LWGGRPARRL APLDAGKREM MRRIVEQYAG YGRRYRKAQI ALSPEA
|
| |