Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1126 |
Symbol | |
ID | 6374801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1213935 |
End bp | 1215002 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683628 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001959545 |
Protein GI | 189500075 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.963046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.386119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGGT TACAGGATAT ACGTGTATCA AATATTGAGC GGTTGACAAC TCCCCGCAGT TTAAAGGAGA AACTACCGGT AACGCAAGGG GTTGCCGATG TAGTGTGTCA GGGGCGTGAG GAAGTCGAGG GTATTTTATC TGGTCGTGAT TCAAGGCTTC TTGTTGTTGT CGGGCCCTGT TCCATCCATG ACATCAACGC GGCGATGGAG TATGCTCGCC GGTTGAAAGC GCTTCGTGAT GAATTGAAGG ATGATCTCTG TATCATCATG CGGGTTTATT TTGAAAAGCC GAGGACGACC ATCGGCTGGA AAGGATTTAT CAATGATCCA CACCTTGACG GGACGTTTGA CATTGAGCAC GGTCTCTATT ATGCCCGTAA GCTGCTTCTG GACATCAACG CTCTTGGACT GCCTGCCGCA ACCGAGTTTC TCGATCCGTT TACGCCGCAA TATGTGGCCG ATCTTGTCAG CTGGGCTGCG ATCGGTGCAA GGACAATAGA ATCTCAGACC CATCGTCAGA TGGCCAGCGG CCTGTCGATG CCGGTCGGGT TTAAAAATTC TACCGACGGG AGGGTACAGG CTGCCATTGA CGCGATACGT TCGGCAATGC ACTCGCACAG TTTCCTGGGG ATCGATGCTG ACGGGCACAG CAGTGTTATT ACAACAACCG GCAATCCGTA TGGTCATATG GTGCTTCGCG GTGGATCAGG ACGCCCTAAT TATGACGCGG AAAATATCGC GGATGCTGAA AGACGTCTTG AAAAAGAGGG GCTTGATAAA AACCTTCTGG TCGACTGCAG CCATGCCAAT TCAGGGAAAA ACTATGAACG TCAGTCAACA GTATGGAACA GCATCATCGA GCAGCGGGTG ACCGGGACCG AGAGTATTCT CGGCGTTATG CTTGAAAGTA ATCTTCTTTG TGGGAAACAG TCTGTTTCGA CTGATCCGTC ATCATTGCAG TATGGCGTAT CGATTACAGA TGCCTGTATT TCATGGGAAG AGACCGCCAC GCTGCTGCGA GACGGAGCGA TGAAACTTCA TCATTTTCTG TCCAGGGCGG AAGTGTAA
|
Protein sequence | MQRLQDIRVS NIERLTTPRS LKEKLPVTQG VADVVCQGRE EVEGILSGRD SRLLVVVGPC SIHDINAAME YARRLKALRD ELKDDLCIIM RVYFEKPRTT IGWKGFINDP HLDGTFDIEH GLYYARKLLL DINALGLPAA TEFLDPFTPQ YVADLVSWAA IGARTIESQT HRQMASGLSM PVGFKNSTDG RVQAAIDAIR SAMHSHSFLG IDADGHSSVI TTTGNPYGHM VLRGGSGRPN YDAENIADAE RRLEKEGLDK NLLVDCSHAN SGKNYERQST VWNSIIEQRV TGTESILGVM LESNLLCGKQ SVSTDPSSLQ YGVSITDACI SWEETATLLR DGAMKLHHFL SRAEV
|
| |