Gene Moth_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1555 
Symbol 
ID3832188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1598427 
End bp1599521 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content65% 
IMG OID637829487 
Product3-dehydroquinate synthase 
Protein accessionYP_430407 
Protein GI83590398 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.267288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGGAG AGCTTAACGT AGACCTGGGC GAGCGAACTT ATAAAATCCA CTGCGGCTCC 
GGATTGCTGC CGGTGGCAGG TTCCATCTTG CGTAATCTTA ATCTGGCAGC CCCCTGCCTG
GTGGTAAGCA ATGCCACTGT GGCCGGGCTC TACTGGCCTG TGCTGGAATC CAGCCTGAAG
GCCGGAGGCT TTAACCCCCA CCTGGCCCTG GTACCCGATG GAGAAGAGGC CAAAACCCTT
CAGGTGGCAG CTAGCCTCTA CGACTCCGCC CTGGCCGCCG GGATCGAGCG CCAGGCCGCC
GTCATCGCCC TGGGCGGCGG AGTGGTGGGC GATGTCTCCG GGTTTATAGC CGCCACCTGG
CTGCGGGGGG TTCCTTTCAT CCAGGTGCCC ACCACCCTCC TGGCCCAGGT GGATTCCAGC
GTCGGCGGCA AGGTGGCCGT CAACCACCCC GGCGGCAAAA ACCTCATCGG CGCCTTTTAC
CAGCCGCTGG CCGTTATCGC CGATCTGGAT ACCCTGACCA CCCTGCCGCC GCGGGAGATC
CGGGCCGGCC TGGCTGAGGT CATCAAGTAC GGGGTAATCG GCGACGCCAG CTTTTTTGCT
TATTTAGAAG AGCACCTGGA GGGAGCCCTG GCCGGGGATA AGGAAGTCCT GGAAACCATC
GTCCTGCGTA GCTGCGCCAT GAAGGCGGCG GTGGTGGCCA GGGACGAGCG GGAAAGCGGC
CTGCGGGCCG TCCTGAACTT CGGGCATACA GTCGGTCATG CCGTTGAGGC CGTTACCGGT
TTTACCGCCT ACCGCCACGG TGAGGCGGTG GCCATGGGTA TGGTGGCCGC CGCCCGCCTG
GCCGTCCGGC GGGGTATGTT TTCTATGGAA GAGACCGGGC GGCTGGTGCG CCTTTTAGCA
AGGGCCGGCC TGCCGGTGAC CCTGCCGGAC CTCGATCCGG CAACCTTCCG GGCGGCCCTG
GGCCACGACA AAAAGATCCG CCAGGGCCAG CTGCGGATGG TCCTGCCGGA AAGCCTGGGC
CGGGTGCGTC TTGTCCCCGT CAGTATCGAG GAAATCGCGG CGGTGGTTGA AAACGGTGCG
GCAGCGGAGG GGTGA
 
Protein sequence
MAGELNVDLG ERTYKIHCGS GLLPVAGSIL RNLNLAAPCL VVSNATVAGL YWPVLESSLK 
AGGFNPHLAL VPDGEEAKTL QVAASLYDSA LAAGIERQAA VIALGGGVVG DVSGFIAATW
LRGVPFIQVP TTLLAQVDSS VGGKVAVNHP GGKNLIGAFY QPLAVIADLD TLTTLPPREI
RAGLAEVIKY GVIGDASFFA YLEEHLEGAL AGDKEVLETI VLRSCAMKAA VVARDERESG
LRAVLNFGHT VGHAVEAVTG FTAYRHGEAV AMGMVAAARL AVRRGMFSME ETGRLVRLLA
RAGLPVTLPD LDPATFRAAL GHDKKIRQGQ LRMVLPESLG RVRLVPVSIE EIAAVVENGA
AAEG