Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1333 |
Symbol | |
ID | 3831043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1379381 |
End bp | 1380508 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829269 |
Product | prephenate dehydrogenase |
Protein accession | YP_430189 |
Protein GI | 83590180 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0287] Prephenate dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000339582 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.134491 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAGTA GCAAATCTGG TCCCGGGATA GAAAGGGTGG CTATCCTCGG TCTGGGGCTC ATCGGCGGCT CCCTGGGCCT GGCCCTCCGC AAAAGGGGGG TAAAAGAGGT GGCCGGTTAT GACCGGCACC CGGAGACTAT CGAGACGGCC TTGACCCTGG GGGCCATCAA CCGCCCGGCA GCAGACCCGG CAACTGCCGT CCAGGGGGCG CAAGTAGTCA TACTGGCCGT TCCTGTCGGT GCCCTGGGCT CCCTGGCAGG AAGTATCGTG CCCTTCCTCG ACCCGGAGGC CATCGTCACC GATACGGGGA GCGTTAAGGG GGCGGTAGTC CGGGATCTCG AAGCAATCTT CCGGGATCGG GCCCGGTATG TCGGCGGGCA TCCCATGGCT GGCTCCGAAC GGGCCGGCAT TGCCGCCGCC GATGGTTACC TCCTGGAGAA TGCCGTTTAT GTCCTCACAC CGACCCCGGC CACTGACACA AGGGCTTTAA AAAGCCTCGA GGGGTTATTT CAATCCCTGG GTTCCCGGGT TATCACCCTG GACCCCGATG AGCATGACCT GATCGTAGCC GGTGTCAGTC ACCTGCCCCA CTTCCTGGCT GTGAGCCTGG TACAGGCTGC CGGGGAACTT GCCCGGGAGC ACCCCCTGGC CTTAATGCTG GCTGCGGGTG GTTTCCGGGA TACCACCCGC ATCGCCGGCG GTGACCCGGT GATGTGGCGG GATATCTTTC TCTACAACCG GGAGGCTATC CTGGCGCTTT TAAAATCCTG GCGCTGCCAG ATTGACGCCC TGGAAGAGAT GATCCGCGCG GGCGACGCCA CCGGCCTGGA AACCGTCCTC AATGAGGCCC GGGCCTTACG GGCCAGGGTA CCGGCCCGGC AAAAAGGCCT CCTCCCGGCC CTCCATGAAC TGGTGGTTAC CGTCCCCGAC CGGCCCGGGG TTATCGGGGC CATGGCCACC TCCCTGGGGG ATGCCGGCAT CAATATCATT GATATTGAAA TTCTCCGCGT CCGGGAAGGG GAGGGCGGCA GCATCCGCCT GGGATTTACC ACGGCGGCTG CTGCCACCAG GGCCTTGGAG ATATTACAAA ATTCCGGGAT TAATGTACGA CTACTGGAAA ATGCTTGA
|
Protein sequence | MASSKSGPGI ERVAILGLGL IGGSLGLALR KRGVKEVAGY DRHPETIETA LTLGAINRPA ADPATAVQGA QVVILAVPVG ALGSLAGSIV PFLDPEAIVT DTGSVKGAVV RDLEAIFRDR ARYVGGHPMA GSERAGIAAA DGYLLENAVY VLTPTPATDT RALKSLEGLF QSLGSRVITL DPDEHDLIVA GVSHLPHFLA VSLVQAAGEL AREHPLALML AAGGFRDTTR IAGGDPVMWR DIFLYNREAI LALLKSWRCQ IDALEEMIRA GDATGLETVL NEARALRARV PARQKGLLPA LHELVVTVPD RPGVIGAMAT SLGDAGINII DIEILRVREG EGGSIRLGFT TAAAATRALE ILQNSGINVR LLENA
|
| |