Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3803 |
Symbol | |
ID | 7267877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4638718 |
End bp | 4639686 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643568611 |
Product | Prephenate dehydrogenase |
Protein accession | YP_002465075 |
Protein GI | 219850642 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0287] Prephenate dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000296114 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.352645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGAA TCGCGATTAT TGGGCTTGGT CTGATCGGCA CCTCACTTGG GATGGCGTTG CGTAACGTCG ACCCCAAAGA GTCGCCGCTC GGTGCGGTCG AGGTGATCGG GTTCGACCGC GAGCCGCGTG TGATCCGCGA GGCGCGGGGA AGGTTGGCGA TTGATCGCGA AGCACGCACG CTAGCCGAAG CGGTGCATGA GGCACAGATG GTGGTGGTGG CAACACCGGT GCGTGCGATG CAAGAGGTAT TCCAGGAGCT TGCCACCCTG TTGCCTGCTG GGGCAGTGGT GACGGATGTT GCCAGTACCA AGGCACTAGT TTGTCGTTGG GCTAACGAAC TATTGCCACG TACCGTGAGT TTTGTTGGCG GGCACCCGAT GGCCGGGCGG GAAAAATCTG GTCCGGCAGC CGCCGATCCT GACCTGTTCC GTGAGGCGAT TTACTGCCTT ACTGCAACGC ACGATACCGT ATCGCAAGCG GTCGAAGCGG TCGAAGCACT CGTGCGCACG GTTGGGGCTA AGCCCTACTA TATCGACCCC GAAGAGCACG ATGTGTACGT TGCCGGCATT TCCCATCTCC CCCTCCTCCT CTCGGTGGGG TTGGTCACCG CCACCGGTAG TAGCCCGGCG TGGAAGGAGA TGGCGCCGTT GGCGGCCACC GGGTTTCGCG ATGTATCGCG CCTTGCCTCC GGCGACCCGC AGATGCAGCG TGACATCTTG CTCACTAACC CGCATGGCCT CACCCGCTGG ATCGACGAGC TAATCCGCTT CCTCGTCACC GCGCGCGAAC AGATTAATAC CGGCGATGCC GCGGCCATCG AGCAGATGCT GCAACAGGCC AAAGCCACCC GTGACGCATG GCTGGAAAGC AAGCCGCACC TGCGCCCCGG CGAAGCCGAT TTTACCGCCA TGCCCACAGT CGAACGACCT AGCCTGCTTG GTTTCCGCTT GCCGAAGCGG GAGAAGTGA
|
Protein sequence | MIRIAIIGLG LIGTSLGMAL RNVDPKESPL GAVEVIGFDR EPRVIREARG RLAIDREART LAEAVHEAQM VVVATPVRAM QEVFQELATL LPAGAVVTDV ASTKALVCRW ANELLPRTVS FVGGHPMAGR EKSGPAAADP DLFREAIYCL TATHDTVSQA VEAVEALVRT VGAKPYYIDP EEHDVYVAGI SHLPLLLSVG LVTATGSSPA WKEMAPLAAT GFRDVSRLAS GDPQMQRDIL LTNPHGLTRW IDELIRFLVT AREQINTGDA AAIEQMLQQA KATRDAWLES KPHLRPGEAD FTAMPTVERP SLLGFRLPKR EK
|
| |