Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0185 |
Symbol | |
ID | 3706218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 204089 |
End bp | 205252 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637736702 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_342248 |
Protein GI | 77163723 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.340896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGAAT GGTTATTGTT GCTGTTACCT GTGGCAGCGG CTTCAGGCTG GCTAGCAGGT AAGCGCAGTG CAGAAACCGT AAATGCGGAT AGTCACTCCC AACTAAATTC CGCCTACTTT GCCGGCCTAA ATCATTTGTT AAATGAGCAG CCAGATAAGG CTATTGATAC CCTGCTTAAT GCTTTGAAAG TGGATAGCGA CACGGTAGAA CCCTATCTAG CATTAGGTAA TCTATTTCGC CGGCGCGGGG AGGTAGACCG GGCGATTCGG GTTCATCAAA ATCTTATTGA GAGGCCTTAT TTAAGCAGCT CGCAAAGAGG ACAAGCCCTT TTGGAATTGG GTTTGGATTA TATGCGCGCG GGAATGTTGG ATCGGGCTGA AAGCTCTTTT CTTGAGGTCC TTAAGCGAAG AAGCCACATA GGTATTACCC TGCGCCAGTT ACTTGATCTC TATCAGCAGG AAAAGAATTG GCATCAAGCT ATTGCTATGG CTCAAAAGCT GCACGAGGAG AGTGGCGAAG CGACGGAATC CATGATCGCT CATTTTTATT GTGAACTTGC GGAACAGCAT TGGGCCCAGA AAAAAGCTGT GGAAACGACC CGGTTTATCA AGCAGGCGCT GGCCTCGGAT TGGCGCTGTG TTCGAGCAAC TCTGCTCCAG TCCAGTTTAG CAATGGAGAA AGGGGATTAT AAGAGGGCTA TTCGCTGTTT GCGGCAGGTT GAGAGGCAAG ATCCAGACTA TTTGCCGGAG ATATTAAAGC CGCTCTCGGA ATGCTACCAG TACCTGGAGG GCCAAGATAA ATTCTTTTTC TGGCTAACTG AAGCATCAAA GCGCCATCCA GGATGTACTT CATTAGTTTT AGCCAGAGCG GCATACTTAC AGCAGCGGGG AGAACAGAAA GAGGCTCGCT ATTTCCTAAT CGAGCAACTT AGAGTGTATC CTTCCGTTGA GGCACTTCAG CAGTTGCTTG CTTTGGGAGT GCCAGAGGAT ATTGAGGCTG CTTCAGAGCC TTGGTCTTTA ATAGAAGAAG TGGCTAGCCG CCTGTTAAAA GCTAAATTAA ATTACGTTTG CGGTTTTTGC GGATTTGGCG GCAAGTATTG CTATTGGCAA TGCCCAGGTT GTAAACGCTG GGGGACCGTT AAGCCCTTGG CTGTAGGTAC TTAA
|
Protein sequence | MIEWLLLLLP VAAASGWLAG KRSAETVNAD SHSQLNSAYF AGLNHLLNEQ PDKAIDTLLN ALKVDSDTVE PYLALGNLFR RRGEVDRAIR VHQNLIERPY LSSSQRGQAL LELGLDYMRA GMLDRAESSF LEVLKRRSHI GITLRQLLDL YQQEKNWHQA IAMAQKLHEE SGEATESMIA HFYCELAEQH WAQKKAVETT RFIKQALASD WRCVRATLLQ SSLAMEKGDY KRAIRCLRQV ERQDPDYLPE ILKPLSECYQ YLEGQDKFFF WLTEASKRHP GCTSLVLARA AYLQQRGEQK EARYFLIEQL RVYPSVEALQ QLLALGVPED IEAASEPWSL IEEVASRLLK AKLNYVCGFC GFGGKYCYWQ CPGCKRWGTV KPLAVGT
|
| |