Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1988 |
Symbol | |
ID | 3704872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2285796 |
End bp | 2287067 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637738464 |
Product | putative glycosyl transferase |
Protein accession | YP_343980 |
Protein GI | 77165455 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0156552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGACT CCAGCACCGG GCCAATTAGC TCGATAAGCA AGCGGCGGCG AATACTTTTT TTTGCTGAAG CCGTGACATT GGCGCATGTG GCGCGGCCAG TGGCCCTGGC AAAGAATCTG AACCCGGCTC TTTATGAGGT TCATTTTGCG TGCGACGCAA GGTATCACAA ACTACTGGGC AAGCTCCCAT TTATCTGGCA CCCCATTCAT TCTCTTGCTA GCGAAAAATT TCTTGAGGCG CTCTCCAAGG GCAGTCCCGT TTATAGGGCT GATACACTAC GCGCTTATGT CAACGAAGAC GCGAAGGTAA TCAAGGAAGT AAGCCCGGAT GTGATCGTAG GGGATTTTCG TCTGTCCCTA GCCGTTAGCG CGCCGCTCGC TCAAATTCCT TATATGACAA TTGCTAATGC TTATTGGAGT CCGTATGCTA AACGGCGTTT TCCCGTACCG GATATTCCTT TAGCAAAGAT AATCGGAATC AAGGCGGCAC AATACCTGTT TAACGCCATC CAGCCCCTGG CCTTTGCCTA TCATGCTCTG CCCTTAAATA AGATCAGGCA CGAATATGGC TTACCCAAGA TAAGCTTGGA TTTACGCCAT ATCTACACCT ATGCCGATCA CACACTTTAT GCCGATATCC CAGCGCTGGC GCCGACCATC GATCTCCCCT CCGGGCATCA TTATCTCGGT CCGGTTCTTT GGTCGCCCGC AGTCCCTCTT CCTGCTTGGT GGGAGAAAAT ACCTGCGGAC AAACCCGTTC TCTATGTGAG CTTGGGCAGT TCTGGGCAAA GTCAATTATT ACCGGAGATG TTAAAGGCGC TAGCCGATCT GCCTATCACT CTCCTGGTGG CAACGGCGGG GCGAATCAAA CTCCCTAGCC CGCCAAAAAA TGCCTTTATA GCGGATTATC TTCCCGGCGA TAAAGCAACA GCCCGCGCCA GCCTCGTGAT CTGTAATGGC GGCAGCCTTA TGACCCAACA AGCGCTCATA AAGCGTGTGC CAGTGTTGGG AATTGTCAAT AACCTTGACC AGCACCTCAA TATGGAGGCG GTGCAAAGCG CAGGCGCGGG CGAACTCCGG CGGGCGGCAA ACGTGACCAC GGCGCATATT CTCGCCACCA CACGCCAAAT GCTAGACCAG CCTCGTTATG CTCAGGCCGC TACCCGTCTA GCGGACTTAT TGTCCAACTA CAACGCCTCC GATAAGTTTA ACTCCATTTT AGGCCGGATG TTCTCGAGGA AATTATGCGC AAAATCGGTT TCAGCCAGAT AA
|
Protein sequence | MIDSSTGPIS SISKRRRILF FAEAVTLAHV ARPVALAKNL NPALYEVHFA CDARYHKLLG KLPFIWHPIH SLASEKFLEA LSKGSPVYRA DTLRAYVNED AKVIKEVSPD VIVGDFRLSL AVSAPLAQIP YMTIANAYWS PYAKRRFPVP DIPLAKIIGI KAAQYLFNAI QPLAFAYHAL PLNKIRHEYG LPKISLDLRH IYTYADHTLY ADIPALAPTI DLPSGHHYLG PVLWSPAVPL PAWWEKIPAD KPVLYVSLGS SGQSQLLPEM LKALADLPIT LLVATAGRIK LPSPPKNAFI ADYLPGDKAT ARASLVICNG GSLMTQQALI KRVPVLGIVN NLDQHLNMEA VQSAGAGELR RAANVTTAHI LATTRQMLDQ PRYAQAATRL ADLLSNYNAS DKFNSILGRM FSRKLCAKSV SAR
|
| |