Gene Nmul_A2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2203 
Symbol 
ID3786228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2502644 
End bp2503648 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content60% 
IMG OID637812290 
Productoxidoreductase-like 
Protein accessionYP_412887 
Protein GI82703321 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATC TGATGAATGC ACCCGTGCCG CTATCCCCTG CCCTCGAATT AGCGTTGCCT 
GTTGATACTG CTGTCGTCGG CGTCGGATAT TTCGGCAGCT TGCATGCATT ATGCTACTCC
CGCCTGCCCG GAAGCCGCCT CCAGGCATTG ATCGACCCCG ATCCGATTAC GCAATTCCTG
GCTGAGCATC TAGGCGTCCC CTGGTTTCCA GGTGTTGCAG AACTTCCGTC CACAATCCGC
GCGGTATCGG TTGCCACACC GGTTGCCATG CATTTCGGGC TGACCAGATC GCTGCTCCAG
CAAGGGCTGG ACGTGCTGCT GGAAAAGCCG ATTGCGGAAA CCGCGGCACA GGCGACGGAA
CTGCGAATGG TAGCGGAGGC AAACCAATGC ATCCTGCAGA TCGGACATAT CGAACGCTTC
AATCCGGCCT ATACCGCTGG CGGAACGCTC CTCCCGTTCG CCCGGACCGT CCGCTCGGTG
CGGACCACAC GACATCCTCC GCGATCCAGC GCACTGGACG TGGTCATCGA CCTGATGATC
CATGATCTTG ACCTGATTCT GCATAGCCTG GATTCCTCCG TGGTGGAACT GCGCGCCTCC
GGTAGAAGCT GCGGTTTAAC AGCCATAGAT GAAGCAGAAG TTGAACTGAC TTTCTCCAAT
GGGTGCCGGG TATACCTGGA TGCACACTGG GGGCGAGATA CGGAGCAGGA CGCGCGCTGC
ATGGTTGCGG AACTGGAAAA CGATGAAACC TGGGTCATCG ATTTCAGGCG CCGGATGACC
TATCGCAAGG AACCCGGCAG CTCGACGGGT CCTTTGCCTG CGGACGGCCA CACGCTTCCG
TTTCCCATGC AACGGATACA GGAAGATACG TTGAGCCTGC AACTCGCGGC CTTCCTCGAT
GCCTGCCGCA ACCGTTCACT ACCCCGGGTT ACGCCGGCGG AAGGCAATGC TGCCCTGGAG
CTGGCGCACT GCATCCGGCA GCAGATACTC AGGCCCTGCC CGTGA
 
Protein sequence
MMNLMNAPVP LSPALELALP VDTAVVGVGY FGSLHALCYS RLPGSRLQAL IDPDPITQFL 
AEHLGVPWFP GVAELPSTIR AVSVATPVAM HFGLTRSLLQ QGLDVLLEKP IAETAAQATE
LRMVAEANQC ILQIGHIERF NPAYTAGGTL LPFARTVRSV RTTRHPPRSS ALDVVIDLMI
HDLDLILHSL DSSVVELRAS GRSCGLTAID EAEVELTFSN GCRVYLDAHW GRDTEQDARC
MVAELENDET WVIDFRRRMT YRKEPGSSTG PLPADGHTLP FPMQRIQEDT LSLQLAAFLD
ACRNRSLPRV TPAEGNAALE LAHCIRQQIL RPCP