Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1204 |
Symbol | |
ID | 3786135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1390187 |
End bp | 1391257 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637811289 |
Product | cytochrome oxidase assembly |
Protein accession | YP_411899 |
Protein GI | 82702333 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1612] Uncharacterized protein required for cytochrome oxidase assembly |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTTTG TCCAAACTTC AACACTTCCG CAAAAAACCG ATCAAAAGTT TGCTGTGCAG AAACCGATTG CTATCTGGTT GTTCGTTTGC TGCGCTCTGG TATTCGCCAT GGTGGTTGTA GGCGGTGTAA CCCGTCTTAC CGATTCAGGG CTTTCCATCG TCGAATGGCA ACCCCTCGTT GGCACGGTTC CTCCACTCAG CCAGAATGAC TGGGATGAGC TCTTCGAGAA ATATCACCAG ACGCCTCAGT ATAAAAAGGT AAATCTTGGC ATGAGCCTGG AAGAGTTCAA GACAATCTTC TGGTGGGAAT ATTTCCACCG CTTATTGGGG CGCGTCATCG GGTTGGCATT TTTCATACCA TTCCTGTATT TTCTGATGAA AAAGGCAGTC GACCGGCCAC TGGGACTGAA GTTGTCAGGA ATTTTCCTGC TGGGGGCTTT GCAGGGTGGG ATGGGATGGT ACATGGTAAA GAGCGGGTTG GTGGATAACC CCCACGTCAG CCAGTATCGT CTGACTGCGC ACTTGGGTCT CGCTTTCGCG ATTTATGCCG CAATGTTCTG GGTAGCCCTC GATCTGCTCA ATCCCGGCCG CGGTTTGTCC GCAAACAGCG GACTGCGTGG TTTGCTCAAT TTCTCCACCA TGCTGTCTGC CCTGGTATTC ATAATGGTTT TATCGGGCGG GTTCGTGGCA GGCATTCGGG CAGGTCTGGC TTACAATACT TTTCCACTCA TGGATGGCCA CTTCATCCCC CCGGAACTAT TCATGCTGGA ACCCTGGTAC CGGAATTTCT TCGACAATAT GACCACTGTG CAATTCGACC ATCGCCTGAT TGCATGGACA CTGGCAATTC TCGTTCCGAT TTTCTGGCTC AAATCGAGAG CAGTGCCACT TTCAGGCTCG GCTCGTCTTG CATGCACTCT ACTGTTGATC ATGCTGGCAG TGCAGATCAC TCTGGGGATT TCCACGCTGC TGCTGGTTGT TCCTCTAACC CTCGCGGCAG CACATCAGGC AGGCGCACTA CTGTTGTTTA CCGCTGCCCT TTGGGTGAAT CATGAGCTAC GGCGCCAATA G
|
Protein sequence | MQFVQTSTLP QKTDQKFAVQ KPIAIWLFVC CALVFAMVVV GGVTRLTDSG LSIVEWQPLV GTVPPLSQND WDELFEKYHQ TPQYKKVNLG MSLEEFKTIF WWEYFHRLLG RVIGLAFFIP FLYFLMKKAV DRPLGLKLSG IFLLGALQGG MGWYMVKSGL VDNPHVSQYR LTAHLGLAFA IYAAMFWVAL DLLNPGRGLS ANSGLRGLLN FSTMLSALVF IMVLSGGFVA GIRAGLAYNT FPLMDGHFIP PELFMLEPWY RNFFDNMTTV QFDHRLIAWT LAILVPIFWL KSRAVPLSGS ARLACTLLLI MLAVQITLGI STLLLVVPLT LAAAHQAGAL LLFTAALWVN HELRRQ
|
| |