Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1764 |
Symbol | |
ID | 3704781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1985057 |
End bp | 1986205 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637738247 |
Product | cytochrome c, class I |
Protein accession | YP_343766 |
Protein GI | 77165241 |
COG category | [C] Energy production and conversion |
COG ID | [COG2863] Cytochrome c553 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000218169 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCGAT TGCGTACCGT TATTGCTCGC CTGAAAAGCC TCGATTGGCG TCGGCAATGG CCCATAATCG CTGTTTTTCT GGTAGTGCTT GCCCTGGGAG GATTGCTGAT AGCGGTTGCC GGCATTATTC CTATCAAGGC CAGCTCCGGC CACTGGGCCA TTACCGATTT TCTGCTTCAC TTTGCCATGG AGCGCTCCGT GGCGACCCAT AGCTTGAATA TCCAGGCGCC GGCGCTGGCC AAGACTCACC TGATATTGCG GGGGGCGGGT CATTATGAAA CCGATTGCCG CTTCTGCCAT GGCAGTCCAC GCTTGCACCA TCCTCCCATT GCCCAGCAGA TGACCCCCCC ACCGCCTTAT TTGCCTGAAA CCGTTTCTAA ATGGGAGGAC CGGGAACTAT TTTATATCAT CAAGCACGGC GTTAAATTCA CCGGGATGCC CGCTTGGCCA GCCCTCCAGC GGGACGATGA GGTGTGGGCC ATGGTGGCGT TCCTGCGGGT ATTGCCAGAA ATGGATGGAG AGGAATACGA AAAACTGGTT TGGGGGGAGA TTCCGGCTTC CCCGGAGGCA GAAGAGGCGC CGCCGGCGGT GGCGAAAAGT TGCGGCCGTT GTCACGGCGT GGAGGGCCTG GGGCGCGGAG TGGGGGCGTT TCCCCGTCTG GCGGGGCAAA ATGCGGCCTA TCTGTATGCC TCCCTGGAGG CTTACGGGCA AGGAGAACGC CATAGCGGGA TGATGGAACC TATCGCCGCG GAATTGAATC CAGAAGAAAT AAAAGCCCTT GCCCGTTACT ACGGCGGGTT AACCCCCTAT CAGCCGCCCT TGCCCCAACC TGGGGAAAGC GCGGCCATTG AGCGGGGTGA GGCCATTGCC AAGGAGGGTA ACCAGCGGAT CGCCTCCTGC GCCGATTGCC ACGGTCCTAG CGGTATTCCC CGCAATCCCG ACTACCCCCT CCTGGCCGGT CAGTACGCCG AGTATATTAT TTTGCAGCTT GAACTTTTGG CGGAGGAAAA GCGGGGCGGA ACCCCCTACA TTCACCTCAT GGACCCCATC GCCCACCGGT TGAGCCAGCA GCAAAGGGAG GATGTGGCCC GGTATTATGC TTCCCTGCCC CCCTCCGGTT CGGTTGATCC GGCGGAGAGG GCACCTTAG
|
Protein sequence | MGRLRTVIAR LKSLDWRRQW PIIAVFLVVL ALGGLLIAVA GIIPIKASSG HWAITDFLLH FAMERSVATH SLNIQAPALA KTHLILRGAG HYETDCRFCH GSPRLHHPPI AQQMTPPPPY LPETVSKWED RELFYIIKHG VKFTGMPAWP ALQRDDEVWA MVAFLRVLPE MDGEEYEKLV WGEIPASPEA EEAPPAVAKS CGRCHGVEGL GRGVGAFPRL AGQNAAYLYA SLEAYGQGER HSGMMEPIAA ELNPEEIKAL ARYYGGLTPY QPPLPQPGES AAIERGEAIA KEGNQRIASC ADCHGPSGIP RNPDYPLLAG QYAEYIILQL ELLAEEKRGG TPYIHLMDPI AHRLSQQQRE DVARYYASLP PSGSVDPAER AP
|
| |