Gene Moth_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1333 
Symbol 
ID3831043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1379381 
End bp1380508 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content63% 
IMG OID637829269 
Productprephenate dehydrogenase 
Protein accessionYP_430189 
Protein GI83590180 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000339582 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.134491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAGTA GCAAATCTGG TCCCGGGATA GAAAGGGTGG CTATCCTCGG TCTGGGGCTC 
ATCGGCGGCT CCCTGGGCCT GGCCCTCCGC AAAAGGGGGG TAAAAGAGGT GGCCGGTTAT
GACCGGCACC CGGAGACTAT CGAGACGGCC TTGACCCTGG GGGCCATCAA CCGCCCGGCA
GCAGACCCGG CAACTGCCGT CCAGGGGGCG CAAGTAGTCA TACTGGCCGT TCCTGTCGGT
GCCCTGGGCT CCCTGGCAGG AAGTATCGTG CCCTTCCTCG ACCCGGAGGC CATCGTCACC
GATACGGGGA GCGTTAAGGG GGCGGTAGTC CGGGATCTCG AAGCAATCTT CCGGGATCGG
GCCCGGTATG TCGGCGGGCA TCCCATGGCT GGCTCCGAAC GGGCCGGCAT TGCCGCCGCC
GATGGTTACC TCCTGGAGAA TGCCGTTTAT GTCCTCACAC CGACCCCGGC CACTGACACA
AGGGCTTTAA AAAGCCTCGA GGGGTTATTT CAATCCCTGG GTTCCCGGGT TATCACCCTG
GACCCCGATG AGCATGACCT GATCGTAGCC GGTGTCAGTC ACCTGCCCCA CTTCCTGGCT
GTGAGCCTGG TACAGGCTGC CGGGGAACTT GCCCGGGAGC ACCCCCTGGC CTTAATGCTG
GCTGCGGGTG GTTTCCGGGA TACCACCCGC ATCGCCGGCG GTGACCCGGT GATGTGGCGG
GATATCTTTC TCTACAACCG GGAGGCTATC CTGGCGCTTT TAAAATCCTG GCGCTGCCAG
ATTGACGCCC TGGAAGAGAT GATCCGCGCG GGCGACGCCA CCGGCCTGGA AACCGTCCTC
AATGAGGCCC GGGCCTTACG GGCCAGGGTA CCGGCCCGGC AAAAAGGCCT CCTCCCGGCC
CTCCATGAAC TGGTGGTTAC CGTCCCCGAC CGGCCCGGGG TTATCGGGGC CATGGCCACC
TCCCTGGGGG ATGCCGGCAT CAATATCATT GATATTGAAA TTCTCCGCGT CCGGGAAGGG
GAGGGCGGCA GCATCCGCCT GGGATTTACC ACGGCGGCTG CTGCCACCAG GGCCTTGGAG
ATATTACAAA ATTCCGGGAT TAATGTACGA CTACTGGAAA ATGCTTGA
 
Protein sequence
MASSKSGPGI ERVAILGLGL IGGSLGLALR KRGVKEVAGY DRHPETIETA LTLGAINRPA 
ADPATAVQGA QVVILAVPVG ALGSLAGSIV PFLDPEAIVT DTGSVKGAVV RDLEAIFRDR
ARYVGGHPMA GSERAGIAAA DGYLLENAVY VLTPTPATDT RALKSLEGLF QSLGSRVITL
DPDEHDLIVA GVSHLPHFLA VSLVQAAGEL AREHPLALML AAGGFRDTTR IAGGDPVMWR
DIFLYNREAI LALLKSWRCQ IDALEEMIRA GDATGLETVL NEARALRARV PARQKGLLPA
LHELVVTVPD RPGVIGAMAT SLGDAGINII DIEILRVREG EGGSIRLGFT TAAAATRALE
ILQNSGINVR LLENA