Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2606 |
Symbol | |
ID | 3704361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2960141 |
End bp | 2961181 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637739087 |
Product | threonine aldolase |
Protein accession | YP_344589 |
Protein GI | 77166064 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2008] Threonine aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000143153 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGAAC GGCATTTTAT CAGCGACAAC GCAGCCGGTA TACACCCAGA AGTCATCGCC ATGCTGGAAA GGGCTAGCCG CGGCCACGCC ATTGCTTATG GCAACGATTC CTTAACGCAG CAGGCTCTCC AGCTATTCAA ACAGCACTTC GGGGCACAGA CTGAGACATT TTTTGTACTG ACTGGCACCG CCGCCAACGT CATTGCCCTG CAAAGCGTCC TATCTTCCTT CGAGGCTGTT ATTTGTGCCG ACTGTGCCCA CCTCCACCGG GACGAATGCG GAGCGCCGGA AAAATTCCTG GGATCAAAGC TGCTAATCGC CCAAACTCAG CAAGGCAAGC TAAGCGTAGC AACTGTAGCG CCACTATTGC GCGACACGGC TATGGTCCAT CGCGTCCAGC CGAAAGTCCT TTCCATTACC CAATGCACGG AATGGGGGAC TATCTATACT CCTGCAGAGA TCAAAACCCT GGCGGATTTC TGCCATGAGC AAGGGTTGCT GCTACACATG GATGGGGCTC GGTTAAGTAA CGCCGCTGCC CGACTCAATT TAAGCCTAAA AGAGATGACC GCAGATGTGG GCGTGGATGT ACTTTCCTTT GGTGGCACCA AAAATGGGCT GCTAGCAGCT GAAGCGATTG TTTTCTTCGA TCCCCAGTTG GCGAAAAAAA CCGGCTTCTA CCGTAAACAA AGCATGCAAC TAGCCTCCAA AATGCGTTTT ATCTCAGCTC AATTTTTAGC TCTATTAATC AACGATCTCT GGTGGAAAAA TGCGCAGCAC GCCAATGAAA TGGCGGCTTT ACTGGAACGA GAACTCAAGA ATATCCCGCA GGTGGAACTT GTCGTCCCCG TAGAAACCAA CGGGATATTT GCCCGAATAC CTCCCTCTTG GGTACCCTGT TTACAACAAC ATTATGCCTT TGCGGTCTGG GACTCGGCTA GCACGGTAGT GCGCTGGATG ACCTCATTTG ATACCACGGC GGAGGAAGTG CAAGATTTTG CGCAAAAGAT CCGAAACATG AACGAGGATA ACGCCCCCTA A
|
Protein sequence | MGERHFISDN AAGIHPEVIA MLERASRGHA IAYGNDSLTQ QALQLFKQHF GAQTETFFVL TGTAANVIAL QSVLSSFEAV ICADCAHLHR DECGAPEKFL GSKLLIAQTQ QGKLSVATVA PLLRDTAMVH RVQPKVLSIT QCTEWGTIYT PAEIKTLADF CHEQGLLLHM DGARLSNAAA RLNLSLKEMT ADVGVDVLSF GGTKNGLLAA EAIVFFDPQL AKKTGFYRKQ SMQLASKMRF ISAQFLALLI NDLWWKNAQH ANEMAALLER ELKNIPQVEL VVPVETNGIF ARIPPSWVPC LQQHYAFAVW DSASTVVRWM TSFDTTAEEV QDFAQKIRNM NEDNAP
|
| |