Gene Nmul_A2280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2280 
Symbol 
ID3785442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2592670 
End bp2593959 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content57% 
IMG OID637812368 
Productcytochrome c, class I 
Protein accessionYP_412964 
Protein GI82703398 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATACG GCTTCGTCAC AATCCTGATC GCCGGGTTTA CCGGCCTGAG CATTTTGCTG 
GTCCCCCCCG TTACGCTGTC ACCCGCAGCT GAAAAGCCGG AGGTAGCGGG GATACGCGAT
ACTGCCGAAA TCGAGGCGCA ACGAGCACGC GGTGCTTATC TGGCCCGCAT TGGGAACTGT
CTAGGTTGTC ACACGGCCTA TAGCGGACTC CCATACGCTG GAGGACACCT CCTCGATACT
TCGATCGGCG TATTCATCAC GCCTAATATC ACATCTGATA AAGAAACGGG TATCGGCCTC
TGGAGCGAGG AAGATTTCTG GCGGGCTCTC CATAATGGGA GGGGGCGCGA TGGAAATCTC
CTGTACCCGG CATTTCCGTA TTCGGAATAT ACCAAGGTAT CGCGCGAAGA TTCCGATGCC
ATCTTTGCCT ACCTTCAATC ACTTCCACCC GTGAGGCAGC GCAATGCGCC CAACCGCATC
AATTTCCCCT TCAACTGGCG TCCACTGCTG CAGGTCTGGC AGCTTATTTA TTTTTCTCCC
GGCATATATC TTCCCGATAC GCTGCAGGAT GACGAATGGA ACCGGGGGGC CTACCTCGTG
CAGGGGCTCG GGCATTGCAA CGCATGTCAT ACCCGGCGCA ACTTGTTGGG AATAAGCAAA
GGAGATATCC TGGGAGGAGG TCAGCTGATG GGTTCAAACT GGTATGCGCC ATCACTGACT
TCCCTGCAGG AAGCCAGTAC CGCGGATTGG CCGATCGAAG ACATTACGCG ATTGCTGAAA
ACCGGATCCG CTTCGCGGGC TGTAACTACC GGACCGATGG CGAATGTCGT CAGCCAGAGT
CTCCAGTTCC TGACAGAAGA CGACGCACGG GCGGTGGCGA AATATTTGAA GTCTTTGCCC
GAAACCGAGC CTCGCTCCCG TGGAACCGCC CCTCCTCTTA CCGAAGAAGT CGACAAGCAG
CTTAAAAAAG GGGGACAGAT TTACGAAACC TATTGCCAGG ACTGTCATGG AAATCTGGGG
GAAGGTGCCC CGGGAAGCTA TCCGGCGCTT GCCGGCAACC GTGGGGTGAC AATGGCATCC
CCGACCAACG CGATCCGCAG CGTTCTCAAT GGCGGATATG CTCCTGTCAC CGAGGTCCAG
CGGCGTCCCT ACGGAATGCC GCCATTTGCG CAAGTGCTAC CCGACAAGGA GATTGCACTG
GTGCTATCGT ATATCCGTAA CTCATGGGGC AACCGGGGAA GCCTCGTTAC CCCGGAACAG
GTGGACCGAA GCCGAAAAGG CGCACAGTAG
 
Protein sequence
MRYGFVTILI AGFTGLSILL VPPVTLSPAA EKPEVAGIRD TAEIEAQRAR GAYLARIGNC 
LGCHTAYSGL PYAGGHLLDT SIGVFITPNI TSDKETGIGL WSEEDFWRAL HNGRGRDGNL
LYPAFPYSEY TKVSREDSDA IFAYLQSLPP VRQRNAPNRI NFPFNWRPLL QVWQLIYFSP
GIYLPDTLQD DEWNRGAYLV QGLGHCNACH TRRNLLGISK GDILGGGQLM GSNWYAPSLT
SLQEASTADW PIEDITRLLK TGSASRAVTT GPMANVVSQS LQFLTEDDAR AVAKYLKSLP
ETEPRSRGTA PPLTEEVDKQ LKKGGQIYET YCQDCHGNLG EGAPGSYPAL AGNRGVTMAS
PTNAIRSVLN GGYAPVTEVQ RRPYGMPPFA QVLPDKEIAL VLSYIRNSWG NRGSLVTPEQ
VDRSRKGAQ