Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1491 |
Symbol | |
ID | 3705982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1650427 |
End bp | 1651527 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637737978 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_343507 |
Protein GI | 77164982 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.139847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTC CTACTGAAGA TCTTCGCATT AAGAACATTC AAGAAGTTAT TCCCCCGGCC CAACTCCATG AAGGCTTGCC CATTACCAGC GAGGCATCAA AAACAGTCTA TCGAACCCGG CAAGCCATTC AGGAAGTGCT CGGCGGGAAA GATGACCGCC TGCTGGTGGT CGCTGGCCCC TGCTCCATCC ATGATCCCCA GGCTGCACGG GATTATGGGA AAAGACTCAA GCTACTAATT GATGAGCTTG CCGATGAATT GCTCATCGTC ATGCGCGTTT ATTTTGAGAA ACCTCGCTCT ACAGTAGGCT GGAAAGGGCT TATTAACGAT CCCCATTTGG ACGGCAGCTT TCAGATTAAT GAAGGCTTGC GCTTGGCCCG CAGGCTGCTG TTGGATCTGG CCGAAACAGG TGTGCCGGCA GGCACCGAAT ACCTGGATCT CGTTAGCCCA CAATATATCG CTGATCTAAT TGCCTGGGGC GCAATCGGGG CTCGCACAAC TGAAAGTCAA GTGCATCGGG AACTCGCCTC GGGACTCTCA TGTCCGGTTG GCTTTAAGAA TGCGACCAAC GGCAGCTTAG GTGCCGCCAT GTCCGCTATC GTCTCGGCCT CAAAGCCCCA TCATTTTCTC TCCCTCACCT TGGCTGGCCG TTCGGCTATT TTTTCAACTG CCGGAAATCC AGATTGCCAT CTCATTTTAC GGGGTGGGCA GAAACCTAAT TACGATGCAG CAAGCGTCAA CGAGACGGCC CATAATCTTA TTCAAACGGG CCTTCGGCCT CAAGTCATGA TTGATTGCAG CCATGGCAAC AGCAGCAAGA ATCCCAAGAA GCAAGTTCTG GTTGCCAGGG ATATTGCGGG ACAAATTGCT GCGGGCGATA GGCGAATCAT GGGGGTTATG CTGGAAAGCC ATCTCGTAGC AGGACGCCAG GACGTCATCC CAAACACCCC TCTTACCTAT GGCCAAAGCA TCACCGATGC CTGTATAGGC TGGGAGGAAA GCGAGCAGTT ACTTCGTGAA TTTGCCCGCG CCATACAGAA GCGGCGGCAA ATGCCAGAAA AACACATTGA AGCTAAACAT GGATGTTCAG CAACCCCCTA G
|
Protein sequence | MSFPTEDLRI KNIQEVIPPA QLHEGLPITS EASKTVYRTR QAIQEVLGGK DDRLLVVAGP CSIHDPQAAR DYGKRLKLLI DELADELLIV MRVYFEKPRS TVGWKGLIND PHLDGSFQIN EGLRLARRLL LDLAETGVPA GTEYLDLVSP QYIADLIAWG AIGARTTESQ VHRELASGLS CPVGFKNATN GSLGAAMSAI VSASKPHHFL SLTLAGRSAI FSTAGNPDCH LILRGGQKPN YDAASVNETA HNLIQTGLRP QVMIDCSHGN SSKNPKKQVL VARDIAGQIA AGDRRIMGVM LESHLVAGRQ DVIPNTPLTY GQSITDACIG WEESEQLLRE FARAIQKRRQ MPEKHIEAKH GCSATP
|
| |