Gene Moth_1025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1025 
Symbol 
ID3832645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1053899 
End bp1054984 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content59% 
IMG OID637828953 
ProductDNA processing protein DprA, putative 
Protein accessionYP_429882 
Protein GI83589873 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00128425 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000696536 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGAGGAAA AACCATTCTG GGTAGCCCTG CAGCAGACAC CCGGCCTGGG AGCCCGCCGG 
GTCCTGCAAC TGGTCAAATA CTTTGGCGGT GCCCGGGCTG CCTGGGAGGC TCCGGAAGGG
GAACTCTTAA CCCTGGAGGG CCTGGGTAAA GGAGCAGTTT CCCTGCTAAA TTGGCGCCGC
CAGGTACATC CGGAGAAAAT AATGGTTTCC TTGGCTGCTG CCGGCATCGG GGTCATAACC
ATCGAAGAAG AAGTCTATCC TCCGGAATTG AAACGTATTT ACGACCCGCC CCCGGTTCTT
TACTGGCGGG GCAGCCGGTT GCCGGGGGAA GGCTTTAAGA TAGCCATTGT TGGTACCCGC
CGGGCGACGG CCTACGGTCT GAAGGTGGCC GAAGAACTGG CGGCCGGCCT GGCGGAGGCT
GGCGTAGGGG TAGTCAGTGG CCTGGCCCGG GGTATTGATG CTGCTGCCCA TAGGGGCGCC
ATCAAGGGCG GGGGATTGAC CTGGGGCATC CTTGGCTGCG GCGTTGATAT AGTTTACCCG
CGGGAACACC GGGAACTCTA CCGCCAGGTT ATGGAACACG GGGCAATTAT CTCGGAGTTT
CCCCCAGGGA CGCCGCCGGA TGCCGGCCAT TTTCCGGCCA GGAACAGGAT TATCAGCGGC
CTGACGGCTG GGACAGTGGT AGTCGAGGCG GCGGCCAGGA GCGGCGCCCT CATAACTGCC
GACCTGGCCC TGGAGCAAAA CCGCGATGTT TTTGCGGTCC CAGGTCCCAT CACCAGCCGT
TATAGCCAGG GCCCACACGA TTTGATTAAG CAAGGAGCCA AGTTAGTAAG CGGCGTAGCA
GATATTTTGG AAGAATATGA GCCCCGGTCG CTATGGAGTT TACCCCGGGA AGAAACTCGG
GCCTCTGTAA CCCTGAACGC TATCGAGGAG AAGGTGTTGG CCGTTTTGGA GGCGACCCCA
TCTCACCTGG ATGTCATTAT GGCGGCCACT GGTTTACCGG CCGGTGAACT TAATACAGCT
TTAATCATGT TGGAGATGAA GCAATTGATC CGGCGGTTGC CGGGAGGTTT TTATGTGCGT
TGCTAA
 
Protein sequence
MEEKPFWVAL QQTPGLGARR VLQLVKYFGG ARAAWEAPEG ELLTLEGLGK GAVSLLNWRR 
QVHPEKIMVS LAAAGIGVIT IEEEVYPPEL KRIYDPPPVL YWRGSRLPGE GFKIAIVGTR
RATAYGLKVA EELAAGLAEA GVGVVSGLAR GIDAAAHRGA IKGGGLTWGI LGCGVDIVYP
REHRELYRQV MEHGAIISEF PPGTPPDAGH FPARNRIISG LTAGTVVVEA AARSGALITA
DLALEQNRDV FAVPGPITSR YSQGPHDLIK QGAKLVSGVA DILEEYEPRS LWSLPREETR
ASVTLNAIEE KVLAVLEATP SHLDVIMAAT GLPAGELNTA LIMLEMKQLI RRLPGGFYVR
C