Gene Nmul_A0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0461 
Symbol 
ID3786008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp512247 
End bp513188 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content52% 
IMG OID637810537 
Productcarbonate dehydratase 
Protein accessionYP_411161 
Protein GI82701595 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3338] Carbonic anhydrase 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.454779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA CATCTCGATC CCTGAGGGTG GCCGTCCTTG CCGCGGTCCT TTTCTTGCTG 
GGCTTCAATA CCGCATATGC GAGTACGCTG GCGAGCAATC TGGAATTGAT GGAAGCACAA
AGTCCGATCG ATATTCGCTC GAATAGCACC TATTACGGAA ATTTGCCCAA GTTGAACTTC
AACCTCAATT CCGATACCGC GCTTACCGTG ATCAATAACG GCTCCCCTGA TCACGAAAGT
ACTATCAGGG CTAATGTCAG TCCCGGCGGA GGAACCTTGA TGTTGTCAGG ACATCAATGG
AACCTTGCTC AATTCCACTT TCACACCCCC TCGGAACATT TGATAAACGG TCGAGCCAGT
CCCATGGAAA TGCACCTCGT CTTCAGCGAT GCTGCGAACA ATCTACTCGT GGTCGGCCGG
GATATCGAGC AAGGTCTCTT CAAGAACCAG GCACTCGCTC CCATTTTCTC CGATTTGCCG
AAAACTACTG AGGAAACACT GAATATCGAG CACTTCAACC TGAACAATCT TCTGCCGGAT
TATCTCGGTT CTTTCCGCTA CTCCGGTTCT CTGACGACGC CGCCTTTTAC AGAAGGAGTA
AGCTGGGTTG AACTGGCTTC TCCGCTATAT CTATCCGGGA GCCAGATCAA TGCCTTCAAG
TCCCTGTTTC CGGAAGGCAA TTCGCGCGAG ATTCAGGATT TGAACGGTCG CATCGTGCTT
ACCGACGTGC CGGGCTTCGT CAGCATCCAT GATGACTCCG ATCCCAATCT CCTGGGCACA
CTGATCCCTG GCCTGGAAGC AAGCGTTTCT GTCACGGCCG ACTTATCCAA ACTCGCGACG
AGCGTTCCCG AACCGTCATC CTATGGCATG CTCCTCGCCG GGCTCGCGGT AATCAGTTTT
ATTGGCCTCA AGCGTGGGTC AAGACTCGCT GGAGCAACCT GA
 
Protein sequence
MNETSRSLRV AVLAAVLFLL GFNTAYASTL ASNLELMEAQ SPIDIRSNST YYGNLPKLNF 
NLNSDTALTV INNGSPDHES TIRANVSPGG GTLMLSGHQW NLAQFHFHTP SEHLINGRAS
PMEMHLVFSD AANNLLVVGR DIEQGLFKNQ ALAPIFSDLP KTTEETLNIE HFNLNNLLPD
YLGSFRYSGS LTTPPFTEGV SWVELASPLY LSGSQINAFK SLFPEGNSRE IQDLNGRIVL
TDVPGFVSIH DDSDPNLLGT LIPGLEASVS VTADLSKLAT SVPEPSSYGM LLAGLAVISF
IGLKRGSRLA GAT