Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2913 |
Symbol | |
ID | 3707430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3294953 |
End bp | 3295891 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637739390 |
Product | hypothetical protein |
Protein accession | YP_344889 |
Protein GI | 77166364 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.410648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGTT GGAAGTTACG CCAGGTACCT TTCAAATATC AGCTCAGTGA CCGGACATTG TGGAGCGCTT CATTGCCGCT CCAATGCCGA TCGGTCGATC TGTTTGATGA ATCGTTGCCG CAGTTGCCAC TGGAACCGCC CGCTGACGAA CTTGTGGAGG GAAGCCAGGG ATTCAGCGTC CGTGCGCTAC CGGTAGGCAC TGAACTTCCC AGAATCAGCA GACATGGCAG CTACCTCTGT TACGTGCCGT TGAATTATCA GCATTACTAT ATTGATTTGT CCCTGACCTA TGACGACTAT CAGAAGACGT TTTCTTCGAA AACGCGTTCC ACGATTAATC GCAAAGTGAA GAAATACGCC AAACATTGCG GCGGCAACAT CCGCTGGAAA ACCTATAAAG CACCTGGGGA AATCGGTGAT TTTTTCCGCC AAGCACGGCG AGTGTCAAAG TTGACGTATC AGGAACGGCT TCTCGATGCG GGGATCCCTG GGTCCGATGC CTTCATCCAG CAGGCGGAAG CACTGGCGGG CGAAGACCGC TTGCGGGCCT ATATCCTATT CGACGGCGAG CGGCCCGTAT CGTACCTGTA TTGCCCCGCT CGTAACGATG TGCTCATCTA TGCCTATCTC GGCTATGATC CTGACTACAT GAAGCTGTCG GTCGGCACAG TACTTCAATG GCTAGCTCTG CAAGAGCTTT TCGGCGAGGG TCGATTTAAA ATTTTTGATT TCACTGAAGG CCAGTCGGAT CATAAGCGCC TCTTTGCCAC GCACCAAAAA AATTGCGCCA ATGTATTCTT TGTCAAGAAT TCGATTCGCA ACGGAGCTAT CATCTACAGC CACAATTTTT CGTCTTGCGT ATCAGGTTGG CTGGGTTCCA GGCTCGACCA GCTAGGCCTG AAAGCAAGAA TCAAACGTCT ACTGCGATTC GCGGGATAA
|
Protein sequence | MKGWKLRQVP FKYQLSDRTL WSASLPLQCR SVDLFDESLP QLPLEPPADE LVEGSQGFSV RALPVGTELP RISRHGSYLC YVPLNYQHYY IDLSLTYDDY QKTFSSKTRS TINRKVKKYA KHCGGNIRWK TYKAPGEIGD FFRQARRVSK LTYQERLLDA GIPGSDAFIQ QAEALAGEDR LRAYILFDGE RPVSYLYCPA RNDVLIYAYL GYDPDYMKLS VGTVLQWLAL QELFGEGRFK IFDFTEGQSD HKRLFATHQK NCANVFFVKN SIRNGAIIYS HNFSSCVSGW LGSRLDQLGL KARIKRLLRF AG
|
| |