Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0456 |
Symbol | |
ID | 3786003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 506361 |
End bp | 507752 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810532 |
Product | cytochrome c, class I |
Protein accession | YP_411156 |
Protein GI | 82701590 |
COG category | [C] Energy production and conversion |
COG ID | [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGCC TGGATCTGAA GCACATGAAG TCGCGCAGAA CGGTTTCCGA CTGGATCATC CGGATGGGCA AAGAAACTCG AATTCTTCTG AGTGCTGTCC TGCTTGCCAT GGGTGCAATC GTGACGGACA CTCAGGTGTC CATTGCGACT GCCGCTGCGG GTCATGCCGC AGAGGCTGCC GGGAAGGATG AGAGCGCATT CGATGCACTG GTTACACGCG GCGCCTATCT CGCCACTATC GCAGACTGCA CCGGATGTCA CACGGCAGGA GCCGCGCATC CGGCTTTTGG GGGCGGGCTG GCGATCAATT CACCCTTCGG TACCATTTAT TCGACCAATA TTACGCCTGA CCCGGAAACC GGCATCGGCC GCTACAGCTA TGAGGATTTC AGCCGGGCTG TGCGCGGCGG CATCAGGAAG GACGGTAAAT GGTTGTACCC AGCCATGCCT TACCCGTCAT TCACGGCAAT GTCGGATGAA GATGCGCGCG CGTTATATGC CTACTTCATG CACGGGGTGA AACCGGTGCG GCACACGCCC TCGCAGACTC GCCTGCCATT CCCCTTCAAT CAGCGCTGGA CCCTCAAGTT CTGGGATCTT GCCTTTGTTG AACACAAGCG GTTCGAACCC CGGGCTGATC GGGGTGAAAA ATGGAATCGC GGTGCGTACC TGGTGCAGGC GCTCGGTCAC TGCGGTGCCT GTCATACCCC CCGCGGTTTG GCCTATCAGG AAAAGGCCTA TTCGGAAACC TCATCCGACT ATTTGACCGG TGCCCTGATA GATAACTGGT TTGCTCCGGA GTTGACCGGA AACACCGCAG CGGGTCTGGG CAGATGGACG GAAGAGGAGA TTGCGGCGTT TCTCAAGACT GGCCATAGTG GCCATACCGC TGCATTTGGC AGCATGGTCG CTGTCATTGA AAACAGCACC CAGTACTTGC GCCAGGAGGA TCTCGACGCG GTCGCCCATT ACCTCAAGTC GCTTCCTGCC ACCGGAGAAA AGTCATCGTA TAACCCTCGC AGAACGCCAG CTGTCATTCC CGCCGGAATG AGCCGATCCG AGAAACCCGG CGCGGGAATA TATAGTGGCT TCTGCGCCAA ATGTCATCGG AACGAGGGAA GTGGCAAGCC GCCAAAGTTT CCTCCCCTAT CCGGAAGTTC GATCGTGCTT TCGGAGAATG CCTCTTCCCT GATCCGGCTC ATGCTGGAAG GAGGAAAAGG CCCGCGGACA AAGACGGGAC CGAAACCGGA GAAAATGCCA GGCTATGCCG AAAGATTTAC TGACCGGGAA ATCGCTGAAG TGCTTACGTT CATCCGTAAC AGCTGGGGCA ACAAGGCTCC GCCTGTGACC ACTCGCGATG TGTATTTGAT GCGTGAAGCG CTGAACGAAT AA
|
Protein sequence | MQRLDLKHMK SRRTVSDWII RMGKETRILL SAVLLAMGAI VTDTQVSIAT AAAGHAAEAA GKDESAFDAL VTRGAYLATI ADCTGCHTAG AAHPAFGGGL AINSPFGTIY STNITPDPET GIGRYSYEDF SRAVRGGIRK DGKWLYPAMP YPSFTAMSDE DARALYAYFM HGVKPVRHTP SQTRLPFPFN QRWTLKFWDL AFVEHKRFEP RADRGEKWNR GAYLVQALGH CGACHTPRGL AYQEKAYSET SSDYLTGALI DNWFAPELTG NTAAGLGRWT EEEIAAFLKT GHSGHTAAFG SMVAVIENST QYLRQEDLDA VAHYLKSLPA TGEKSSYNPR RTPAVIPAGM SRSEKPGAGI YSGFCAKCHR NEGSGKPPKF PPLSGSSIVL SENASSLIRL MLEGGKGPRT KTGPKPEKMP GYAERFTDRE IAEVLTFIRN SWGNKAPPVT TRDVYLMREA LNE
|
| |