Gene Moth_0498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0498 
Symbol 
ID3832821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp513340 
End bp515514 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content56% 
IMG OID637828432 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_429371 
Protein GI83589362 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAT TCCTGGCCCA CACTGCAAAT TCCCAGGGTA AAGAACAATC ACTATTAGAT 
CATCTTGAGG CGGTTGCTAA CCTAACCCAA AAATTTAACG AACCACTGGG AGTACCACAT
ATTGGTTACC GTACCGGCCT GTGGCATGAC CTGGGGAAAT TCCATCCGGA GTTCCAGGCT
TACCTGCAAG GCGAAGGCGG CCGCCGCGGC CCCAACCATT CCAGCGCCGG TGCCATGTTG
GCCTCAAAAT ATTTTGAACC CCTGGCCTTC GTTATCGCCG GCCATCACGG CGGCCTGCCC
GCCCGGGCAG AATTAAAGAC TCGCTTAAAA GATAAGGTAC AGCTGAAACG TTATAACACC
GCCCTGCAAA ACGCCCGCCG GGTTATCCCC TCTTTAGAAC CGAAGCAACC CCTGGATATG
GAATTGCCGC CCTTTTTGCA GGAAGGAGCA GGTGTAGATG TAGCTAGCAG GGTTGAATTA
TTCCTGCGCC TTCTATTCAG TTCCCTGGTT GATGCCGATT TCCTTGACAC GGAAAGGCAT
TTTGATCCTT CCCTGACAAA ATTGCGACAA AAGGAAGCCT CCATTGCTAA TTTATGGTCT
TTATTAGCAG CGAACCAGCA AAGGCTAATG GATAAAAGCG GCGGCCATGT CAACCAGATC
CGCCGGGAAA TCTACGAGCA CTGTTGCCGG GCGGCCGAAC TAACGCCAGG TTTCTTTTCT
TTGACCGTAC CCACCGGTGG CGGCAAGACC CGTTCCGGGA TGGCCTTCGC CCTGCTCCAC
GCCCTGCGTT ATCAGAAGGA GAGAATCATT GTCGCCATAC CCTACACAAG CATTATCGAG
CAGACGGCCG ACGTTTACCG CGATATCTTT GGTAACGCCA ATGTCCTTGA GCATCACAGC
GCCGTTGCCC CGCCGCTAGA TCCCGAAAAC CCTACCCCTG AGGAACTGTG GGCCCGCCTG
GCCAGCGAGA ACTGGGATGC CACCCTCATC GTTACAACCA CGGTACAGCT ATTTGAGAGT
TTATTCTCCG ACCGCAGCAG CAGTTGCCGC AAACTCCACA ACATCGCCAA CAGCGTCATC
ATCCTGGATG AGGTCCAGAC CCTGCCGGTA CATCTTTTAG AACCCATCCT TGATGTCCTG
CAGCAGCTGG TCCACTTTTA TGGAGTAACG GTCATCCTGT GCTCGGCCAC CCAGCCGGCC
CTGGAGACCA GCCCCTTTTT CCGGGGACTG CAGGGCGTAA GGGAAATAAT TCCTGATCCG
CAAAAATACT TTGCCCTGCT CAAAAGGGTG GGTTACCAGG TACCCGCCGG CAACGAAAAA
TGGAGCTGGG AACAGGTAGC CGGAGTAATA CGCCAATCCC GCCAGGCCAT GGCTGTTGTT
AACACCAAAC AGGATGCCCT GGCCCTCCTG GAGGCTCTGG ATGACCCTGA AGCTTTACAT
CTCTCCACGC TCCTTTGCGG GGCTCACCGG CGTAAGGTCC TGCAGGAGGT ACGCCGGCGC
CTGGGAAACG GCCGGCCCTG CCGCCTGGTC GCCACCCAGG TAGTCGAGGC GGGGGTTGAT
CTGGATTTCC CCCTCGTCCT GCGGGCTGTG GGTCCCCTGG ACAGGATTGT CCAGGCCGCC
GGGCGGTGCA ACCGGGAAGG GAAGCTGCAA GAAGGCCGGG TGATAATTTT CAATCCGGAA
GAAGGCTGCC TTCCTCCTGG CAGCTATAAG ACCGGGACCG AAATTGCCGC CACGCTGCTC
GCTAGTGTGA CAGGGGTTGA CCTGCATGAC CCCGGCCTGT ACCGGATCTA TTTTCAGCGC
CTCTACCAGT CCTGCACCCT GGATGCCAAA GGAATTCAGG CCAGCCGCCG CAGCTTGAAT
TATCCCGAAG TGGCGCAAAA ATTCCAGATG ATCGAGCAAA GGGTGGTTCC GGTAATTGTC
AACTACCATG AAGGCCCCGA TGACCGGAGG GTGGATGAAC TAATCAGCGC CCTCCGCCAC
CGGGGTGTAA GTCGTCTAAT CATGCGGCAG TTGCAGCCCT ACCTGGTAAA TATCAACGCT
TACACTGTTA ATTCTCTCCA AAGGGAAGGC GTGGTCCAGG AAATATTTCC CGGCCTGTAT
GAATGGCTGG GGGGTTACGA CGAAATTCGT GGCCTGGTGA CGACGGCCCG CGACCCGGAG
GAGCTGGTAT TTTAG
 
Protein sequence
MEQFLAHTAN SQGKEQSLLD HLEAVANLTQ KFNEPLGVPH IGYRTGLWHD LGKFHPEFQA 
YLQGEGGRRG PNHSSAGAML ASKYFEPLAF VIAGHHGGLP ARAELKTRLK DKVQLKRYNT
ALQNARRVIP SLEPKQPLDM ELPPFLQEGA GVDVASRVEL FLRLLFSSLV DADFLDTERH
FDPSLTKLRQ KEASIANLWS LLAANQQRLM DKSGGHVNQI RREIYEHCCR AAELTPGFFS
LTVPTGGGKT RSGMAFALLH ALRYQKERII VAIPYTSIIE QTADVYRDIF GNANVLEHHS
AVAPPLDPEN PTPEELWARL ASENWDATLI VTTTVQLFES LFSDRSSSCR KLHNIANSVI
ILDEVQTLPV HLLEPILDVL QQLVHFYGVT VILCSATQPA LETSPFFRGL QGVREIIPDP
QKYFALLKRV GYQVPAGNEK WSWEQVAGVI RQSRQAMAVV NTKQDALALL EALDDPEALH
LSTLLCGAHR RKVLQEVRRR LGNGRPCRLV ATQVVEAGVD LDFPLVLRAV GPLDRIVQAA
GRCNREGKLQ EGRVIIFNPE EGCLPPGSYK TGTEIAATLL ASVTGVDLHD PGLYRIYFQR
LYQSCTLDAK GIQASRRSLN YPEVAQKFQM IEQRVVPVIV NYHEGPDDRR VDELISALRH
RGVSRLIMRQ LQPYLVNINA YTVNSLQREG VVQEIFPGLY EWLGGYDEIR GLVTTARDPE
ELVF