Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0461 |
Symbol | |
ID | 3786008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 512247 |
End bp | 513188 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637810537 |
Product | carbonate dehydratase |
Protein accession | YP_411161 |
Protein GI | 82701595 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3338] Carbonic anhydrase |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.454779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAA CATCTCGATC CCTGAGGGTG GCCGTCCTTG CCGCGGTCCT TTTCTTGCTG GGCTTCAATA CCGCATATGC GAGTACGCTG GCGAGCAATC TGGAATTGAT GGAAGCACAA AGTCCGATCG ATATTCGCTC GAATAGCACC TATTACGGAA ATTTGCCCAA GTTGAACTTC AACCTCAATT CCGATACCGC GCTTACCGTG ATCAATAACG GCTCCCCTGA TCACGAAAGT ACTATCAGGG CTAATGTCAG TCCCGGCGGA GGAACCTTGA TGTTGTCAGG ACATCAATGG AACCTTGCTC AATTCCACTT TCACACCCCC TCGGAACATT TGATAAACGG TCGAGCCAGT CCCATGGAAA TGCACCTCGT CTTCAGCGAT GCTGCGAACA ATCTACTCGT GGTCGGCCGG GATATCGAGC AAGGTCTCTT CAAGAACCAG GCACTCGCTC CCATTTTCTC CGATTTGCCG AAAACTACTG AGGAAACACT GAATATCGAG CACTTCAACC TGAACAATCT TCTGCCGGAT TATCTCGGTT CTTTCCGCTA CTCCGGTTCT CTGACGACGC CGCCTTTTAC AGAAGGAGTA AGCTGGGTTG AACTGGCTTC TCCGCTATAT CTATCCGGGA GCCAGATCAA TGCCTTCAAG TCCCTGTTTC CGGAAGGCAA TTCGCGCGAG ATTCAGGATT TGAACGGTCG CATCGTGCTT ACCGACGTGC CGGGCTTCGT CAGCATCCAT GATGACTCCG ATCCCAATCT CCTGGGCACA CTGATCCCTG GCCTGGAAGC AAGCGTTTCT GTCACGGCCG ACTTATCCAA ACTCGCGACG AGCGTTCCCG AACCGTCATC CTATGGCATG CTCCTCGCCG GGCTCGCGGT AATCAGTTTT ATTGGCCTCA AGCGTGGGTC AAGACTCGCT GGAGCAACCT GA
|
Protein sequence | MNETSRSLRV AVLAAVLFLL GFNTAYASTL ASNLELMEAQ SPIDIRSNST YYGNLPKLNF NLNSDTALTV INNGSPDHES TIRANVSPGG GTLMLSGHQW NLAQFHFHTP SEHLINGRAS PMEMHLVFSD AANNLLVVGR DIEQGLFKNQ ALAPIFSDLP KTTEETLNIE HFNLNNLLPD YLGSFRYSGS LTTPPFTEGV SWVELASPLY LSGSQINAFK SLFPEGNSRE IQDLNGRIVL TDVPGFVSIH DDSDPNLLGT LIPGLEASVS VTADLSKLAT SVPEPSSYGM LLAGLAVISF IGLKRGSRLA GAT
|
| |