Gene Nmul_A0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0459 
Symbol 
ID3786006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp508790 
End bp510805 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content57% 
IMG OID637810535 
Productcytochrome-c oxidase 
Protein accessionYP_411159 
Protein GI82701593 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCTA TCGACGCCAC CCGGCATCGC ATCGACCCCC CTCTCGAACG GATCCCGGAA 
GTGGGACGTA TTCCCGCGGG GTCCCCAATC GAAGCGCGTC TGGAGAAGAT CTGGGAGACG
GCGCCAGGTT GGCGCGGATG GCTCTCCACA GTAGACCATA AAACGATCGG AGTACGATAT
CTCGTCACGG CCTTTCTTTT CCTCGTGATC GGCGGAATCG AAGCGCTGTT CCTGCGCATC
CAGCTTGCCG GGCCGGATAT GGCGTTTCTG ACACCGGAGC AATACAACCA GCTTTTCACG
ATGCATGGCG TGACCATGAT TTTCCTCTAT GCATTGCCGG TTCTGTCCGG TTTCTCCAAC
TACCTCTGGC CGCTTTTGCT CGGTTCCAGG GATATGGCCT TTCCGCGCCT CAATGCACTC
TCCTACTGGA TTTTTCTGTT CGCCGGCCTC TTCATGTACA TCAGCTTCCC TCTGGGGCAG
GCGCCGAATG CGGGCTGGTT CAATTATGTG CCTTTCTCGG GCCCGGTATA CAACACCGGA
CCCAATATCG ACGTCTTTGC GCTCGGCATG GTGTTGCTCG GAATTTCCAC AACGGTAGGG
TCGGTTAATT TCGTGGTTAC GCTGTTCAGG ATGCGGGCGC CGGGCATGAC CATAAACCGC
GTTCCCATCC TGGTGTGGGG AACGCTCACC GCGTCCGTGG CCAACCTGTT CGCAATCCCC
GCCGTCAGCC TCGCGTTCTT TCTGTTGTGG CTTGATCGCC AGATAGGCGC GCATTTTTTT
GATGTCGCAA ATAACGGTCA GCCATTGCTT TGGCAGCACT TGTTCTGGAT TTTCGGCCAT
CCCTGGGTCT ATGTCGTCGT GCTTCCGGCA ATGGGCATCG TTTCCGATGC CCTGCCTACA
TTCTGTCGCC GGCCGCTCGT GGGTTACACG GCAGTGGCGC TAGCTACCAT GGCAACCATG
CTGCTCGGTT TCGGGGTGTG GGTCCATCAT ATGTTCGCCA CCGGTCTGCC GACAGTGGCT
CTGTCGATTT TCGGCGCCGC CAGCATGGTC ATCTCCATTC CCAGCGCGGT CGCCGTCTTC
GCCTGGATTG CGACCATCTG GCTGGGAAAA CCCGTATTCA AGACCCCTTT TCTGTTTTTT
GCCGGTTTCG TCGCGCTGTT CATCATCGGC GGCATGTCGG GAGTAATGAC AGCCGCGGTG
CCGCTCGACT GGCAGCTTAA CGAAACCTAC TTCATCGTGG CGCACCTGCA TTATGTCCTG
CTTGGGATCA ACGTGTTTCC GGTGGTAGGA GCGATCTATT ACTGGTTTCC CAAATTCACC
GGGCGCATGA TGAGTGAAAA ACTGGGCAAG TGGAGTTTCT GGACCATGTT TATCGGATTC
AACGCCGGAT TCTTTCCAAT GCATATCGCT GGACTGCTCG GGATGCCGCG CCGCATTTAT
ACCTATCCTT CCCGCATGGG ATGGGACGAA GTCAACCTGA TTACGACTGT CGGATCCTTC
ATATTCGCCG TTGGCGTGCT GATCTTCCTC GTCAATGTCG TCCTCAGCCT GAAACGTGGA
ACCCGGGCGG GCGCCAATCC GTGGGATGCG CCGACCCTGG AATGGGCAGT ATCTTCCCCG
CCGCCCGCTT ATAACTTTGC GACTATTCCT ACGATCGCCA GCCGGCACCC GTTATGGGAG
GGGCGGATCG AAGGAGAAAA AGAACATACC CGGACTAGAT TGGAAGAAGG TTACCTGCTG
ATGCGGGGCC GGGAAACACT CGGCACCTCG CCCATCGATG CAAAGCCGGT TGTCATCCTC
AAAATGCCGG AGGATTCCTA TACCCCTTTC CTTGTCGGTT TGTTCGTCTC GTTGCTATTC
GTCGGCCTGT TGCTGCATTC ATGGACATTC ACTGCATTGA TGGCGGCCTT AAGCTGTGCC
GCGTTGAGTG TCTGGATGTG GCCGCGCAGG AACCTGGGTC AGCGTACTGC CAGGCACGAC
CCAGCCTCTG CCCGAAGAGG AGGAAAAGGC CCGTGA
 
Protein sequence
MAAIDATRHR IDPPLERIPE VGRIPAGSPI EARLEKIWET APGWRGWLST VDHKTIGVRY 
LVTAFLFLVI GGIEALFLRI QLAGPDMAFL TPEQYNQLFT MHGVTMIFLY ALPVLSGFSN
YLWPLLLGSR DMAFPRLNAL SYWIFLFAGL FMYISFPLGQ APNAGWFNYV PFSGPVYNTG
PNIDVFALGM VLLGISTTVG SVNFVVTLFR MRAPGMTINR VPILVWGTLT ASVANLFAIP
AVSLAFFLLW LDRQIGAHFF DVANNGQPLL WQHLFWIFGH PWVYVVVLPA MGIVSDALPT
FCRRPLVGYT AVALATMATM LLGFGVWVHH MFATGLPTVA LSIFGAASMV ISIPSAVAVF
AWIATIWLGK PVFKTPFLFF AGFVALFIIG GMSGVMTAAV PLDWQLNETY FIVAHLHYVL
LGINVFPVVG AIYYWFPKFT GRMMSEKLGK WSFWTMFIGF NAGFFPMHIA GLLGMPRRIY
TYPSRMGWDE VNLITTVGSF IFAVGVLIFL VNVVLSLKRG TRAGANPWDA PTLEWAVSSP
PPAYNFATIP TIASRHPLWE GRIEGEKEHT RTRLEEGYLL MRGRETLGTS PIDAKPVVIL
KMPEDSYTPF LVGLFVSLLF VGLLLHSWTF TALMAALSCA ALSVWMWPRR NLGQRTARHD
PASARRGGKG P