Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2973 |
Symbol | |
ID | 3707355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3358923 |
End bp | 3360395 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637739447 |
Product | hypothetical protein |
Protein accession | YP_344945 |
Protein GI | 77166420 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000455039 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGAAGA AGGAAAGCTT AGTGGGAGTT GGGTTAGCCG CGCTCACACT GGCTGTAAGC TTACCCTTGG GAGCGACGGA AATGCCCCAA ACCATGGAAG AAATGTGGCG GATCATTCAG CAGCAACAAC AAGAAATTGA AGCGCTTAAA GCGAAATCCC AACCCCTAGA AACTGAGAAA GCTCATCCGG AAATTTCTAA AGAAGTACCC GAGAAAACTA AAGAAGCCAC AAAAACAACT ACCTCTACTT CTGAGGAGAA TGCAGAGACA AAAGCTCAGG TTAAGGAACT GGAACACAAA ACGGGTGTGC TCGCTGAAGC GGTGGAAAGT CTGCGGACCG CCATGCATAT TCCAGAAGAA TTCGAATATA AAAGTATGTA TGGTCTAGGG CCGGCGGCTT CCAAGGTTTA TCAAGTCGGT AAAGGACTAT CTATTGGTGG TTATGGTGAA GGTCGCTATC AAACTTTTGT GAATGGAGAT GGGGACGATA ATGCCGATTT TGCCCGGCTA GTACTTTATA CTGGGTATAA GTTCACCGAC CGGATCATCT TTAACAGTGA GATTGAGTTT GAGCATGGGA CTACCGGCGA AGGGGCTGAG GAGAAGGGCG AAGTTTCCGT CGAGTTTGCA GCGCTTGATT TCTTTCTTGA TCCGAGAGTT AATATTCGTG CCGGTTTGGT GTTGATGCCC ATGGGGTTTA TCAACCTCAT CCATGAACCG CCTTTCTTTT TCGGAAATAA CCGTCCTGAG GTTGAGCGGC GAATTATTCC CAGCACCTGG CGCGAGATTG GCGTGGGCCT TTTTGGCGAG CTGCTGCCAG GGTTAACCTA TACCATGTAC GGAGTGAATG GACTGAACGC TGAAGAATTC AGCTCCAGGG GTATTCGCGA TGGTCGCCAA AGTGGCAGTA AAGCTTTAGC GGAAGATTTA GCTTTTGTGG GCCGCATGGA TTATGCGCCT CCCGGAATGC CTGGACTTTC CTTTGGGGGC TCCGCCTATG CGGGTAACTC TGGCCAAGAT CAAAGCTATG GGGGGCAAGA TCTGGATGTC TTTACTCAGC TCTATGAGGG CCACCTCCAG TGGCAATACC GAGGCTGGTG GTTACGGGCT CTGGGGGCCT GGGGGCATAT CGGTGATGCC GAAGCGCTTA GTGCCGCCAA GGGGGAAACC ATCGGCGAGA GCAATTTTGG TTGGTACACG GAGCTGGCTT ATAACTTGTT ACCGTTAGTG TGGCCGGAAA CCATCCAGTA TCTGGCCCCT TTCTTCCGTT TTGAGCAACT GAATACTATT GCCAGCGCTC CGGCGGGATT TTCGGATAAA GGCGGTATCA ATCAGGATAT CTACCAGGTA GGTATCAACT ATAAACCTAT TCCCAATGTG GTTATTAAGG CGGATTATCG TAACTTCGTA GGTAGAGATG GCAACCCTTC TGCTGCCGAT GAGTTTAATC TGGGGCTTGG GTTTATCTTT TAA
|
Protein sequence | MRKKESLVGV GLAALTLAVS LPLGATEMPQ TMEEMWRIIQ QQQQEIEALK AKSQPLETEK AHPEISKEVP EKTKEATKTT TSTSEENAET KAQVKELEHK TGVLAEAVES LRTAMHIPEE FEYKSMYGLG PAASKVYQVG KGLSIGGYGE GRYQTFVNGD GDDNADFARL VLYTGYKFTD RIIFNSEIEF EHGTTGEGAE EKGEVSVEFA ALDFFLDPRV NIRAGLVLMP MGFINLIHEP PFFFGNNRPE VERRIIPSTW REIGVGLFGE LLPGLTYTMY GVNGLNAEEF SSRGIRDGRQ SGSKALAEDL AFVGRMDYAP PGMPGLSFGG SAYAGNSGQD QSYGGQDLDV FTQLYEGHLQ WQYRGWWLRA LGAWGHIGDA EALSAAKGET IGESNFGWYT ELAYNLLPLV WPETIQYLAP FFRFEQLNTI ASAPAGFSDK GGINQDIYQV GINYKPIPNV VIKADYRNFV GRDGNPSAAD EFNLGLGFIF
|
| |