Gene Moth_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2014 
Symbol 
ID3831968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2099878 
End bp2102067 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content61% 
IMG OID637829943 
ProductATP-dependent DNA helicase PcrA 
Protein accessionYP_430853 
Protein GI83590844 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01073] ATP-dependent DNA helicase PcrA 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTAT TGGCAGATTT TATGGCTTTA CTCAATGGTC CCCAGCAGGA GGCCGTCAAG 
CACCGGGGCA CACCCCTCCT GGTCCTGGCC GGAGCCGGCA GCGGCAAGAC CCGGGTCCTT
ACTTACCGGG TGGCCGCCCT GATTCAGGAA GGTGTTCGCC CGGAAAACAT CCTCGCGGTT
ACCTTTACCA ATAAAGCGGC CCAGGAAATG AAAGAACGCC TGGAGGGCCT GGTAGGGGAG
GCGGCCCGGG GCCTCTGGGT CAGCACCTTC CATTCAGCCT GTGTCCGCAT TCTGCGGCGG
GAGGCCCACC TCCTTGGTTA TCGGCCGAAT TTTGTCATCT ACGATACCGA CGATCAGCAG
GCGGCCCTTA AAGAGGTCTT AAAAGAGTTA GACCTGGATG ATAAAAAATA TCCGCCTCGT
TCCCTGGCTC AGGTCATTAG CATGGCCAAG AACGACCTTA AGACACCGGA GCGTTTCCTG
GACGGCGCGG CCACCTTCCG GGAGCAACAG CAGGGGCAGA TTTACCGCCG TTATCAGGAG
AAGCTGCGGG AATTAAACGC CATGGACTTT GACGATCTTA TCATGCAAAC GGTCTTTCTC
TGGCAGCAGA ATCCCCTGGT ATTACGTTAC TACCAGCAGC GCTGGCAGCA TATCCTGGTG
GACGAATACC AGGATACCAA CCACGCCCAG TACATCCTGG TGCGTCTCCT TGCCGGTAAA
GGGGACAACC TCTGTGTCGT CGGCGACCCG GATCAGGGTA TCTACGGCTG GCGGGGAGCT
GATATCGGTA ACATACTGGC CTTTGAGGAG GACTTCCCCC GGGCCCGGGT GATTCTCCTG
GAAGAAAACT ACCGGTCCAC CCGGCCCATC CTCCAGGCCG CCAATGCCGT TATCCAGCAT
AATGAGGGGC GCCGGGAGAA ACGCCTCTGG ACACGGCGGC GGGAAGGAGA GCTGTTGCAC
CTGTACCGGG CGACTGACGA GCGGGATGAG GGGCGGTTTA TCGCCGGCGA GGTTTACCGC
CGGCACCAGC AGGAGGGGCG GCCCTTCAGC GACTTTGCCG TCCTCTACCG CACCCATGCC
CAGTCGCGGG CCCTGGAAGA GGCCTTTATC CAGGCCGGTG TCCCCTACGA GATTGTCGGC
GGCCTGAAGT TCTACCAGCG CAAGGAGATT AAGGATATCC TGGCCTATCT GCGGGTAATT
GCCAACCCGG ACGATAGCCT CAGCCTGTTG CGGATTATCA ACGTCCCCCG GCGGGGTATC
GGCGAGGCCA CCCTGGCCAG GCTGGAAGCC GCCGCTACCA GTGAGGGGGA GAGCCTTTAC
CGGGTCCTGG AGCGGGTGGA CACCATCCCC GGCATCCCGG CCCGGGGCCG GCAGGCACTG
CGGGAACTGG TGGAAATGCT GGATAACCTT CGCCAGCAGC AGGAAAAAAT AACGGTGACG
GACCTGGTGG CTACCATCTT GCAAGAGACG GGTTATCAAG CAGAACTGGA GGCGGAGCGG
ACTCCTGAGG CCCAGGCCCG GCTGGAGAAC TTGAAGGAAT TCCAGACGGT GACCCGGAGC
TACGACCAGG GGGCTCCGGA ATCATCCCTG GGAGATTTCC TGACCCAGGT AGCCCTGGTG
GCGGAGAGTG ACACCTATAG CGGCAATGCC GCGGTGGCCT TAATGACCAT GCATACGGCC
AAGGGACTGG AATTCCCGGT GGTCTTCCTG GCCGGCCTAG AAGAGGGGGT TTTCCCCCAT
TTTCGCTCCC TGGACGACCC GGCGGAGATG GAAGAAGAAC GGCGACTCTG CTATGTGGGC
ATGACCAGGG CCAGGGAGGT CCTGTATCTC ACCCATGCCT GGACGCGTAA CCTTTACGGT
AGCACCATGA GCAATCCTCC CTCCCGGTTT CTGGATGAGA TACCGGCCGA CTTGATCCAG
GGCGAAGGAA CCGGCCTGCG GTCCGGGGTC CTTAGCCGGA CCAGTGAGGA ACGGGATGAC
CGGGGCCGAT CAACCCGGCG CCAGCCGGCA CCGGCAAGGA GCACCCGGGC CTCCTGGCAG
GTGGGGGACA AGGTCCAGCA TGACGCCTGG GGCCTGGGCG TCATAGTAAA GATCAGCGGC
GAAGGCGATG ATGCCATCAT CAGCGTCGCC TTTCCCGGGC GGGGCATCAA GCAACTGGCC
CTCCGCTACG CCCCGATTAG GAAGGCTTAA
 
Protein sequence
MKVLADFMAL LNGPQQEAVK HRGTPLLVLA GAGSGKTRVL TYRVAALIQE GVRPENILAV 
TFTNKAAQEM KERLEGLVGE AARGLWVSTF HSACVRILRR EAHLLGYRPN FVIYDTDDQQ
AALKEVLKEL DLDDKKYPPR SLAQVISMAK NDLKTPERFL DGAATFREQQ QGQIYRRYQE
KLRELNAMDF DDLIMQTVFL WQQNPLVLRY YQQRWQHILV DEYQDTNHAQ YILVRLLAGK
GDNLCVVGDP DQGIYGWRGA DIGNILAFEE DFPRARVILL EENYRSTRPI LQAANAVIQH
NEGRREKRLW TRRREGELLH LYRATDERDE GRFIAGEVYR RHQQEGRPFS DFAVLYRTHA
QSRALEEAFI QAGVPYEIVG GLKFYQRKEI KDILAYLRVI ANPDDSLSLL RIINVPRRGI
GEATLARLEA AATSEGESLY RVLERVDTIP GIPARGRQAL RELVEMLDNL RQQQEKITVT
DLVATILQET GYQAELEAER TPEAQARLEN LKEFQTVTRS YDQGAPESSL GDFLTQVALV
AESDTYSGNA AVALMTMHTA KGLEFPVVFL AGLEEGVFPH FRSLDDPAEM EEERRLCYVG
MTRAREVLYL THAWTRNLYG STMSNPPSRF LDEIPADLIQ GEGTGLRSGV LSRTSEERDD
RGRSTRRQPA PARSTRASWQ VGDKVQHDAW GLGVIVKISG EGDDAIISVA FPGRGIKQLA
LRYAPIRKA