Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1961 |
Symbol | |
ID | 3704975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2247729 |
End bp | 2248820 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637738437 |
Product | hypothetical protein |
Protein accession | YP_343953 |
Protein GI | 77165428 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGGT TTTCTTGGAT TAGCCCTCCC TCTGGTAGTG GGGCGGAGGA GTTTGGGTTT TGGCGTGTAG TGCCTGCTAG GGAATTGATA ACAACCTATA GGACTCATTG GCAGCAGTTA AATTCCCATC ACTATAAAAA CCATCCTTTA TTGGATATCC GCTTTGTTGG TTTATTGCTT GAGTATTTTG GTGCCGACGA TGTCTTTCTC GTCGTATATC AGGAGGGTGA ATCTGTCACT GGCTTGCTGC TGCTTCGATC GGAGAGAAAT GGAATGTGGA ATACGTTTCT GCCAAGTCAG GCGCCCATTG GACCTGCCCT ATTAGGAGGC AGCGCAGGCT ATTCGGCACT TCGTAAATTA TTTCCTGCTC TGCCCGGCTA CGCTTGGGGT TTGGGACTTG CCTGCCAGGA TTCGGACTAC GCTATTCAGA TGCGACCGAT GGATTATAAA CACATGCTGA CTATACCCCA CTGGAATACT ATACAAATCA AACTGGAAGG TTCCTTTGAT GCCTTTTGGA ATCAACGGTC GAAGAATCTG AAGAAAAATA TTTATCGCTA TCTGCGGCGC GTAAAAAAGG TGGGCCTCAT TCCTCGTTTG GAAGTTATAC GAAATTTAGA CTCTATGGGA GAAGCGGTAG ATCAGTACGG AGAGCTTGAG TCCAAAGGTT GGAAGGGGCG ATTGGGAACC GCATTACATA GAAATAATGT ACAAGGTCAA TTCTATCGAC AAGTAATGGA GGCGTTTGCC CAAGAGGGAA ACGCTGCAGC CTGGAATCTC TATTTAGGAG ATAAGCTTGC TGCTTCCCGT CTAACCATTA CCGGGGGCGA CATGACTGTA ATACTAAAAA CGGCCTTTGA TGAGGCGCTT TCTCGATATG CGCCCGGGCG TCTGTTGTTA TATTTGTTTT TAGAGCGTAT CTTTCAGGAA GAAAAAGGAA GTACTCTCGA GTTCTATACT AATGCTACCC AAGACCAACT TGCCTGGTGT ACGGGTTCAA GATCCATTTA TCATATAAAT TATTATCGAT ATCCTCTCTA TCGATGGGGT GTAGCGACTA CGAAGAGAGT TGGAGCATTA TGGCAAGGGT AA
|
Protein sequence | MQRFSWISPP SGSGAEEFGF WRVVPARELI TTYRTHWQQL NSHHYKNHPL LDIRFVGLLL EYFGADDVFL VVYQEGESVT GLLLLRSERN GMWNTFLPSQ APIGPALLGG SAGYSALRKL FPALPGYAWG LGLACQDSDY AIQMRPMDYK HMLTIPHWNT IQIKLEGSFD AFWNQRSKNL KKNIYRYLRR VKKVGLIPRL EVIRNLDSMG EAVDQYGELE SKGWKGRLGT ALHRNNVQGQ FYRQVMEAFA QEGNAAAWNL YLGDKLAASR LTITGGDMTV ILKTAFDEAL SRYAPGRLLL YLFLERIFQE EKGSTLEFYT NATQDQLAWC TGSRSIYHIN YYRYPLYRWG VATTKRVGAL WQG
|
| |