Gene Moth_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1597 
Symbol 
ID3832743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1633017 
End bp1634192 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content58% 
IMG OID637829526 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_430446 
Protein GI83590437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000656599 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.759481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA AAAGGAGCCG AAAGCAATTC CCCATGGAAA ATGAAAAAAA ATTAAGGGAA 
CACCACCCCA TAAAACAAAC TCCAGTCAGG GAACCGGTCA GTAAACCCCT TTGCCCTTTC
TGCGGCTTGC CCATTGCTAA ACCGCGGGAA CTGGCGACCC ACCGCCCGGG GGAGATGCCC
GTCGGTTCCT GTTCCTGCGG TGCTGTTTAT GCCTGCGACG TCACCGGCCA TAACCTGGGG
GCCGCCTTTA TTGAAGCCCT GGTTTTCGGT TGTAACCAGG ATTGGGACCT GGCCTGGGGA
TTGTTACCGG AAGAAGATTA CCTGGAAAGA CTGGTGGAAC ATTACGATTA TGATAGCCAC
CTCATTGTTC CGGAGGGAGT TTATGAGGGC CGCAGGGTGA CCGGGGCCCT CTATTTTGTA
CGGTTGCAAG CAGATATCCG GGAAGTAACC GGGCCTGGAG TGCAGAGGAA ATTAAGGGAG
GCCGCTACCC CTGGGGCAGA TAGGCCAGGC TACTGCCACC CTGCCGGTGC CAGGAAATAT
AGTAAAAAAG AGATTACCCA ACTGGTAAAG GGATATCACC TTGAAACTCT GGTGGATATA
GCCCGGCAGG ACAGGCGGGT CCTCGGCGAT CTCCAGCGCC TCCTCTGCTC GGGTGACGCC
CTCCTACGCC TGCAGGCGGC AGACATTCTC GGCCGGGCTG CTGCCGTCTA TGCTACCGGC
GCGCCGGAGA TCATTGCCAA TCTACTCCAG CGCCTCCTGG CCTCCGTCTC CGATCCCGGT
GCCGCCAGCT GGGGCGCCGT TGATGCCATG GGCGAAATTA TCGCCAACGC TCCAGCTACC
TTTGCCGGGT ACATCCCTCA GCTCTATCCC TTCCTGGAAG ATAAAATCCT CCGGCCCAGG
GTACTGCGGG CCTTAGGCAG GATCGCCGGG GTTAAACCGG GGCTTTTACA GCGGGCAGCC
CTGCATTTTT TAACCTTCCT CAGGGACCCC GACCCGGAAA CCAGGGGTTA TGCCGCCTGG
CTCCTGGGCA TCCTGAGAAC GGAGGCCGCC CGGGAGGATC TAAAAGGGCT TCTCGATGAT
CGGCAGGTCG TCGCCGTCTA TCATAACGGC GCCATTGATG AAAAGACGGT GGGGGAACTG
GCCGCCGCAG CCCTGGACCG GCTGGCGGGA GCTTAA
 
Protein sequence
MNKKRSRKQF PMENEKKLRE HHPIKQTPVR EPVSKPLCPF CGLPIAKPRE LATHRPGEMP 
VGSCSCGAVY ACDVTGHNLG AAFIEALVFG CNQDWDLAWG LLPEEDYLER LVEHYDYDSH
LIVPEGVYEG RRVTGALYFV RLQADIREVT GPGVQRKLRE AATPGADRPG YCHPAGARKY
SKKEITQLVK GYHLETLVDI ARQDRRVLGD LQRLLCSGDA LLRLQAADIL GRAAAVYATG
APEIIANLLQ RLLASVSDPG AASWGAVDAM GEIIANAPAT FAGYIPQLYP FLEDKILRPR
VLRALGRIAG VKPGLLQRAA LHFLTFLRDP DPETRGYAAW LLGILRTEAA REDLKGLLDD
RQVVAVYHNG AIDEKTVGEL AAAALDRLAG A