Gene Moth_0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0823 
Symbol 
ID3831615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp851587 
End bp853662 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content61% 
IMG OID637828753 
Producthypothetical protein 
Protein accessionYP_429683 
Protein GI83589674 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00604121 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0176105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCCG GGGAAATAAG AGTCGAAGGC AGGAACGTCG CCGAAGCAAT ACAGCAGGCA 
GCAGCCATCC TGGAGGTCTT GCCTGAGGAG CTGGAGGTAG AGATAGTTCA ACCTGCAAGC
CAGGGGTTTC TTGGCCTCGG CCGCCGGCCG GCGGTGATTA TCGCCCGCCT GGCCGGGACA
TTCCCCGGAG CTGCAGGGGA AGACGAAAAT GTGACCCCGG CAACCGGTAA GGGAGGAGCA
GGTTCCGTAA CCGGCGGTCC GGATGCAGGT GCGGTGGCAG GTGGGATGGC CCCCGGGCTT
GGGGGAGTGG ACGAGGAACG GCCAGGTTAT GCCTGGGTGG AAGGCGGCCG GGTGATGGTG
GGCGATGGGG AACCGCCGGC CACCATCATC GCCGGGGAAC ACTTTGACCT GTATGTAAAC
GGTCAACATG CACCCAGCGA AATAACGGTC CGAGCCGGGG ATACTATAGA GGTACAGTCC
CGGGTCGAGG TCGTAAAGGG CCAGTGGAAA CTGGCTGTAA GCAGTGACGG CTTGGCGGCC
CGCCTGACCC TCAAGCCGGG GTATCAGAGG ACCTGGCGGT TAAAGGACCA ATTCCCGGCG
GGGCGGCTGC AACTGGAGGG CGAGGAAGTG GTCCAGGCCC TTCCGGCCCT GACCCGGGAG
GACCTGCTGG CCGAACTAAA GCGCCAGCAG GTGGTTTTCG GCGTTGATGA AGAAGCCGTC
CGGCAGGCCT GTGCCTCTAC CGAAGCCGGA GAAATAGTAG TAGCCAGGGG CCAGCCGCCC
CAACCTGGTG AAGACGGCCG GGTGGAATGT CTTTTTTCTA CTGCCGTAGC TGACAGGGTG
GAGGTAGGAG AAGAAGAACG GGTCGATTAC CGGGAGATGA TGGTACGTAC TTCGGCCGCC
ATAGGCGATC TCCTGGCGGT CAAGGTGCCG CCCCGGCCGG GAAAGCCTGG CAAGACGGTT
ACCGGGAAGG TAATCTCCCC GCCGGAGCCC CGGGATGTGG AACTGGTGGC CGGTAAAGGC
ACCGAGGTTA TTGACGGCCT GCGCTGCATA GCCCGGGAGG AAGGGCGTCC CCAGCTGCGC
CAGCTGCGGG GCCGGGCCTT GATTGACATT ATTCCGGTAC TCCTCCACCA GGGGGATGTG
GACCTGTCCT CAGGGAACCT CAAGTTTAAG GGTGGCATCC ATATCACAGG CAACGTCACG
GAAACGATGC AGGTGGCTGC CAGCCAGGAT GTGGAGATTG GCGGTGATGT AACCCAGGCC
ACTGTAAACG CCGGCGGTTC AATTATCGTC CGCCGCAACT GCATCGGTTC CACCCTGGTG
GCCGGGGGCC TGAATAGCGT CTACCAGAGC GCGGAACCCA TCCTGAGCGA TCTGGCCAGC
CAGTTGCCGT TGCTCCAGGC GGCCCTCCAC CAACTGGAGA AGGGGGCCAG CCGACAGCGG
TACCAGGTTC GCCCCCTTGA TACCGGATAT ATGGTAAGCA TTCTGGTAGA AAACAAGTTT
CAGAAGCTGC CCTTCCTGGC CGGCAAGCTG GAGCAGCTGA CAAAGGGCAT CCAGGGCGGG
CCCCAGGAGC AAAAGCTAGT CCAACTGGCC CGCGAGCTCG CCCAGTCCTT CCGTAATCCG
GCTGCCATGC AGGGATTGGG TTCGTCTATC CTGGCCAGGC TGGTAAATGG TGTCCAGGAG
ATGGCAGCCT ATTGCCGGGA GATCCCCCAG GACGGCGGTC ATATCACCCT GAGCTATGCT
CTCAACAGCA AGCTTCAGGC CAGCGGCCAG GTAAAGGTCA CCGGCCGGGG GTGCTTCAAT
ACCGAGATTA TTGCCGGCGG CAAAGTAGAA ATCCATGGGG TATTCCGGGG CGGTAGCATT
TATGCCGGCG ATGAGGTCTA TGTGCAGGAA ATGGGTTCCA GCGGCGGCGC CAGAACTTTG
ATTCGAGTGG CGGAGGGCAA GGTCATCAAA GCCGGCAGGA TCTGGCCTAA TAGTACCCTG
CAGGTAGGCA AGCGCGTCCG CCAGATTGAT AACGAGGAGA ACCAGGTCAT GGCCTACCTG
AATAGTGAAG GCGACCTGGT TCTAGGTACT TTTTAA
 
Protein sequence
MTSGEIRVEG RNVAEAIQQA AAILEVLPEE LEVEIVQPAS QGFLGLGRRP AVIIARLAGT 
FPGAAGEDEN VTPATGKGGA GSVTGGPDAG AVAGGMAPGL GGVDEERPGY AWVEGGRVMV
GDGEPPATII AGEHFDLYVN GQHAPSEITV RAGDTIEVQS RVEVVKGQWK LAVSSDGLAA
RLTLKPGYQR TWRLKDQFPA GRLQLEGEEV VQALPALTRE DLLAELKRQQ VVFGVDEEAV
RQACASTEAG EIVVARGQPP QPGEDGRVEC LFSTAVADRV EVGEEERVDY REMMVRTSAA
IGDLLAVKVP PRPGKPGKTV TGKVISPPEP RDVELVAGKG TEVIDGLRCI AREEGRPQLR
QLRGRALIDI IPVLLHQGDV DLSSGNLKFK GGIHITGNVT ETMQVAASQD VEIGGDVTQA
TVNAGGSIIV RRNCIGSTLV AGGLNSVYQS AEPILSDLAS QLPLLQAALH QLEKGASRQR
YQVRPLDTGY MVSILVENKF QKLPFLAGKL EQLTKGIQGG PQEQKLVQLA RELAQSFRNP
AAMQGLGSSI LARLVNGVQE MAAYCREIPQ DGGHITLSYA LNSKLQASGQ VKVTGRGCFN
TEIIAGGKVE IHGVFRGGSI YAGDEVYVQE MGSSGGARTL IRVAEGKVIK AGRIWPNSTL
QVGKRVRQID NEENQVMAYL NSEGDLVLGT F