Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0611 |
Symbol | |
ID | 3706843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 656756 |
End bp | 658042 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637737119 |
Product | TPR repeat-containing protein |
Protein accession | YP_342660 |
Protein GI | 77164135 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000267471 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAGGTT TTTTTCAGTG CTTAAGAGGG CACGGCCGTA CGCTTGTGGG AATATGGGCT AGGCGGTTAC CGCTTTTTAT GCTAGCAAAG GTATTGCTCT TAGGGGGAGG ATTATTGGTG AATGCCCAAG CGGCTGAGCA GTACCTTCTG ACCCCTTCTA CCTATGAATC CCTGAGCGCT GTCCATAAGC TCATGGACAA GCAGCAGTAT ACATCTGCCC TTAAACAACT CACCGCACTA CAAGACGAGG TGAATGGTAA GGCTTACGAG CAGGCGGTTG TGCTCCAAAC CCTCGGCTAT GTGTATTCTT CTTTAGAAAA ATATCCCAAG GCGATCCAGG CATTTAAAGC TAGCCTAGCC CTAGATGCGC TGCCTGCCCG GGTCACTCAT GATTTGCGTT ATGGTCTGGC GCAGCTTTAC ATGGCTACGG AGCAGTATGG AAAAGCTCTC CAGTTGCTAG AGGCATGGTT TAAGGCTGCG GAATCTCCCC CGGCGGAAGC CCATGTATTG GCTGCTAGTG CCTACTACCA TCTGAAGCGG TATGCCGAAG TCATTCCTCA TATTGAGGTA GCCATTGAGC TTGCCCAAGC GCCGCAAGAG GAGTGGTATC AACTGCACCT TGCCGCCCGT TTAGAGTTGA AGCAATATTC CCAGGCGGCC CAAATATTAG AAACCTTGAT AGGCCACTTC CCTAACAAGG AGCAGTATTG GAAGCAGCTG GGAGCGGTGT ACATGGAGAT GAATAAAGAG CATCGGGCCT TAGCCGTGGA AGCGCTAGTA GCACATATGG AGCCTCTTGA TAGTAAAAGC CTCATTCACC TTGCTAATCT TTATCGTTAC CTCCATATTC CCTATAAAGC CGCGCAAGTT TTGCAGCAGG GTTTAAGGGA TGAAACTATT CAAACAAGCA GCAAGCATTG GGAATTTCTT GCCGATGCCT GGCTCGCCGC CCGGGAATGG GAACGCGCCG CTGCTGCTTT TAAGGAGGCA GGGCGGTTGA GGCAGGATGG CAAAATGGCC CTTCGCCGCG GTCAGGTTCT CATCGAGCTG CAAGACTGGA AGCAAGCAGA GAAAGCCTTT GCGCAAAGTT TGCGCAAAGG GGGACTGGAT GATCCTGGGC AAGCCCGTTT TTTCTTGAGT CAGGCGAGAT ATGAGCAGGG GCACTTTGCA GAAGCTATTC AGGCGTTAAA GTTAATTCAG GCTTCCTCAG CTTATAGCAA ACAGGCCGCC CAATGGTTAA AGCATTTACA GGTAGTCCGG AAGCAGGGAG CTGACGGCAA AGGTTAA
|
Protein sequence | MEGFFQCLRG HGRTLVGIWA RRLPLFMLAK VLLLGGGLLV NAQAAEQYLL TPSTYESLSA VHKLMDKQQY TSALKQLTAL QDEVNGKAYE QAVVLQTLGY VYSSLEKYPK AIQAFKASLA LDALPARVTH DLRYGLAQLY MATEQYGKAL QLLEAWFKAA ESPPAEAHVL AASAYYHLKR YAEVIPHIEV AIELAQAPQE EWYQLHLAAR LELKQYSQAA QILETLIGHF PNKEQYWKQL GAVYMEMNKE HRALAVEALV AHMEPLDSKS LIHLANLYRY LHIPYKAAQV LQQGLRDETI QTSSKHWEFL ADAWLAAREW ERAAAAFKEA GRLRQDGKMA LRRGQVLIEL QDWKQAEKAF AQSLRKGGLD DPGQARFFLS QARYEQGHFA EAIQALKLIQ ASSAYSKQAA QWLKHLQVVR KQGADGKG
|
| |