Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1984 |
Symbol | |
ID | 3704868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2279740 |
End bp | 2281341 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637738460 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_343976 |
Protein GI | 77165451 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGAGC TGTGGTTACA GCTATATAGT TATTGGTGCG GGATTTGGCG TTACCGTTGG TATGGCGTTC TAATGGCATG GGTGGTGGTG ATTGTGGGTT GGACATATGT GATCCAGATG CCTGATAGAT ATGAATCCTC AGCACGAGTT TATGTCGATA CTGATTCGCT ACTGCGCCCC CTTTTGAAAG GATTGGCTAT TCAGCCTAAT GTAGACCAAA GGCTTAGAAT AATGACTCAG ACTTTATTAA GCCGGCCTAA TTTAGGAAAG GTACTCCGTC AAACCGATAT GGATTTGTCC GTGACCACGC CGGAGCAGGA AATCAAACTC CTTAACCAGC TTGAAAAAAA TATCCATATA AAAGGGGCGA GAAGAGATAA TCTATATACA ATTGCATATG AAAATACTGA TCCTCAACTT GCACAGCGGG TGGTGCAGGC CATATTGAAT ATTTTTGTGG AAAGCACTAT GGGTGCCTCC CGTAAAGATA GTAATACTGC TCAACAGTTC ATTGGCCAAC AAATTAAAGA ATATGAAAAA CTGCTTCGTA CTGCTGAACA AAGGTTAATG GACTTTAAGC GGGAGCATGT AGGCATGATG CCCAATGAAA AGGGCGATTA TTATCAGCGT TTGCAGACTG CGGTAGAGAA TTTACGTACG GCGCGCACTG AGCTTAATAT GGCGATAGGG CGCCGGGATG TACTTAAACG TCAATTGAGA GGAGAGGAAC CGGTTTTTGG ATTTGGAACA GGTGGAACAG GAACCTCGTC GAAAGATAGT AGTCCTGCGG GTATGCGTAT TCAATCCTTG CAAGCGGAAC TAGATGAGGT GTTGCTCAAG TACACGGATA AACATCCAAA GGTTTCGGCG ATTAAAGAAA CTATTGCGAT GTTACAAAAA AGAGAAGAGC AGCAATTTTC TTTGCCGCAG AAGCAGCAAG GTGAAGGAGA AGAAGCTAAT GAAGCGGGAG AATATGCAGT GGGGGGTAAC TTTTATTATC AGCAAATGCA GATTTCTCTG GCAGAAGCGG AAGCCAATAT TGCTTCTCAA GAAGCGGAAG TTAGCGCCTT AGAGAAAGAT GTAGAACGGT TGCATGAGCT CGTCGATACT ATTCCTAAGG TAGAAGCGGA ATTAGCTCAG CTAAATCGTG ACTATGGGGT TTATAAAAGC AACTATCAGC AACTATTAAC CCGCCTAGAA TCAGCGAAAA TGGGTGAAAG GGTGGAAGAG TCGCCAGATA ATGTTAAGTT TAAAATTGTG GAGCCGCCTA TACAGCCGCT CCTCCCTTCT GGTCCTGACC GCCCCTTGTT ATTAACGTTA GTTCTGGTGG TTGCGGGAGG TGCCGGAGGG GCTTTGGCAT TTTTTCTTTC CCAATTAAGG CCCGTTTTTT ATACCCGCCG GGATTTAGAA GAAGCGACGG GGCTTCCCGT CCTAGGCCCG GTATCAATGA TATTATCAGG GCGTATTTTA TGGAAGCATC GGCTTAATCT GGCGTTTCTT CTTACTTTTC TAGGTCTTCT CATCGCTGGA TATGGATTAT TGGTATCCAA TTATCTCTTT GGTATCAAAA TGTTCGACAC AATTAAGCAT TCGCTTTTTT AG
|
Protein sequence | MHELWLQLYS YWCGIWRYRW YGVLMAWVVV IVGWTYVIQM PDRYESSARV YVDTDSLLRP LLKGLAIQPN VDQRLRIMTQ TLLSRPNLGK VLRQTDMDLS VTTPEQEIKL LNQLEKNIHI KGARRDNLYT IAYENTDPQL AQRVVQAILN IFVESTMGAS RKDSNTAQQF IGQQIKEYEK LLRTAEQRLM DFKREHVGMM PNEKGDYYQR LQTAVENLRT ARTELNMAIG RRDVLKRQLR GEEPVFGFGT GGTGTSSKDS SPAGMRIQSL QAELDEVLLK YTDKHPKVSA IKETIAMLQK REEQQFSLPQ KQQGEGEEAN EAGEYAVGGN FYYQQMQISL AEAEANIASQ EAEVSALEKD VERLHELVDT IPKVEAELAQ LNRDYGVYKS NYQQLLTRLE SAKMGERVEE SPDNVKFKIV EPPIQPLLPS GPDRPLLLTL VLVVAGGAGG ALAFFLSQLR PVFYTRRDLE EATGLPVLGP VSMILSGRIL WKHRLNLAFL LTFLGLLIAG YGLLVSNYLF GIKMFDTIKH SLF
|
| |