Gene Nmul_A0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0456 
Symbol 
ID3786003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp506361 
End bp507752 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content57% 
IMG OID637810532 
Productcytochrome c, class I 
Protein accessionYP_411156 
Protein GI82701590 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGCC TGGATCTGAA GCACATGAAG TCGCGCAGAA CGGTTTCCGA CTGGATCATC 
CGGATGGGCA AAGAAACTCG AATTCTTCTG AGTGCTGTCC TGCTTGCCAT GGGTGCAATC
GTGACGGACA CTCAGGTGTC CATTGCGACT GCCGCTGCGG GTCATGCCGC AGAGGCTGCC
GGGAAGGATG AGAGCGCATT CGATGCACTG GTTACACGCG GCGCCTATCT CGCCACTATC
GCAGACTGCA CCGGATGTCA CACGGCAGGA GCCGCGCATC CGGCTTTTGG GGGCGGGCTG
GCGATCAATT CACCCTTCGG TACCATTTAT TCGACCAATA TTACGCCTGA CCCGGAAACC
GGCATCGGCC GCTACAGCTA TGAGGATTTC AGCCGGGCTG TGCGCGGCGG CATCAGGAAG
GACGGTAAAT GGTTGTACCC AGCCATGCCT TACCCGTCAT TCACGGCAAT GTCGGATGAA
GATGCGCGCG CGTTATATGC CTACTTCATG CACGGGGTGA AACCGGTGCG GCACACGCCC
TCGCAGACTC GCCTGCCATT CCCCTTCAAT CAGCGCTGGA CCCTCAAGTT CTGGGATCTT
GCCTTTGTTG AACACAAGCG GTTCGAACCC CGGGCTGATC GGGGTGAAAA ATGGAATCGC
GGTGCGTACC TGGTGCAGGC GCTCGGTCAC TGCGGTGCCT GTCATACCCC CCGCGGTTTG
GCCTATCAGG AAAAGGCCTA TTCGGAAACC TCATCCGACT ATTTGACCGG TGCCCTGATA
GATAACTGGT TTGCTCCGGA GTTGACCGGA AACACCGCAG CGGGTCTGGG CAGATGGACG
GAAGAGGAGA TTGCGGCGTT TCTCAAGACT GGCCATAGTG GCCATACCGC TGCATTTGGC
AGCATGGTCG CTGTCATTGA AAACAGCACC CAGTACTTGC GCCAGGAGGA TCTCGACGCG
GTCGCCCATT ACCTCAAGTC GCTTCCTGCC ACCGGAGAAA AGTCATCGTA TAACCCTCGC
AGAACGCCAG CTGTCATTCC CGCCGGAATG AGCCGATCCG AGAAACCCGG CGCGGGAATA
TATAGTGGCT TCTGCGCCAA ATGTCATCGG AACGAGGGAA GTGGCAAGCC GCCAAAGTTT
CCTCCCCTAT CCGGAAGTTC GATCGTGCTT TCGGAGAATG CCTCTTCCCT GATCCGGCTC
ATGCTGGAAG GAGGAAAAGG CCCGCGGACA AAGACGGGAC CGAAACCGGA GAAAATGCCA
GGCTATGCCG AAAGATTTAC TGACCGGGAA ATCGCTGAAG TGCTTACGTT CATCCGTAAC
AGCTGGGGCA ACAAGGCTCC GCCTGTGACC ACTCGCGATG TGTATTTGAT GCGTGAAGCG
CTGAACGAAT AA
 
Protein sequence
MQRLDLKHMK SRRTVSDWII RMGKETRILL SAVLLAMGAI VTDTQVSIAT AAAGHAAEAA 
GKDESAFDAL VTRGAYLATI ADCTGCHTAG AAHPAFGGGL AINSPFGTIY STNITPDPET
GIGRYSYEDF SRAVRGGIRK DGKWLYPAMP YPSFTAMSDE DARALYAYFM HGVKPVRHTP
SQTRLPFPFN QRWTLKFWDL AFVEHKRFEP RADRGEKWNR GAYLVQALGH CGACHTPRGL
AYQEKAYSET SSDYLTGALI DNWFAPELTG NTAAGLGRWT EEEIAAFLKT GHSGHTAAFG
SMVAVIENST QYLRQEDLDA VAHYLKSLPA TGEKSSYNPR RTPAVIPAGM SRSEKPGAGI
YSGFCAKCHR NEGSGKPPKF PPLSGSSIVL SENASSLIRL MLEGGKGPRT KTGPKPEKMP
GYAERFTDRE IAEVLTFIRN SWGNKAPPVT TRDVYLMREA LNE