Gene Moth_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2067 
Symbol 
ID3831098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2159458 
End bp2160396 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content58% 
IMG OID637829995 
Producthypothetical protein 
Protein accessionYP_430905 
Protein GI83590896 
COG category[L] Replication, recombination and repair 
COG ID[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02778] DNA polymerase LigD, polymerase domain 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0552463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGGG AAATGACAAC AGGACAGCAG GTTAACGGGG AGCCCTATAT GCTGCCCCGG 
TTGCCGGGAC AGCAGCTCCG GCTGACCAAC CTGGACAAGG TCTTCTGGCC GGAGGGGCTG
ACCAAGTTTG ATCTCGTCGA ATATTATGTC GACATGGCCC CCGGCATCCT ACCTTACCTG
CGGGAACGTC CCCTGGTCCT GAAGCGCTAC CCGGACGGCA TTACAGGGGA GGCCTTTTAC
CAGAAAGAGT GCCCTGCCTA TGCCCCAGAG TGGGTGGCGA CCCTGCCTGT CTATCACACC
GATAGCGATA AAACCATCAA TTACGTTCTC TGCAATAACG AAGCAACCCT GGCCTGGCTG
GCCAACCAGG GGTGCATCGA GGTCCATGCC TGGCTCTCCC GGGCCGGTCG CCTGGAATAC
CCGGATATCG TTGTCATGGA CCTCGACCCT GCGGACGGCA CTACCCTTGT CGATGTGCTG
GAAATCGCCC TCTTGGTCAA CCGGGCTTTA AAGGAACTCC ACCTCACCGG CTACCCCAAA
AATTCAGGCG CCAGGGGCCT GCATATTTTC ATCCCCCTTT ATCCCCGCTG GACCTTCCGG
GAAGTTACGG CTGCCATGGG ATACCTGGCG CATCTCATTG TGCAGGTTTA CCCCCGCAAA
GCCACCACCG AGCACCTTAT CCACAGGCGC CGGGGCAAAG TCTACCTGGA TTACCTGCAA
AATGTACAGG GGCGGTCCAT GACCTTTCCC TACAGCCTAC GGCCCCTGCC CGGGGCCCCG
GTTTCCGCCC CCTTGACCTG GGAAGAAGTG GCGGCGAAAA AGATTTATCC CGGAGATTTC
AATATCATCA GGCGCCGCCT GGAAGAATGG GGCGACTTGT ACCGGGAACT CCTGGAGCGC
CCCAATGATT TAACACCCCT GTTAGAGCTG GCCATATAA
 
Protein sequence
MDREMTTGQQ VNGEPYMLPR LPGQQLRLTN LDKVFWPEGL TKFDLVEYYV DMAPGILPYL 
RERPLVLKRY PDGITGEAFY QKECPAYAPE WVATLPVYHT DSDKTINYVL CNNEATLAWL
ANQGCIEVHA WLSRAGRLEY PDIVVMDLDP ADGTTLVDVL EIALLVNRAL KELHLTGYPK
NSGARGLHIF IPLYPRWTFR EVTAAMGYLA HLIVQVYPRK ATTEHLIHRR RGKVYLDYLQ
NVQGRSMTFP YSLRPLPGAP VSAPLTWEEV AAKKIYPGDF NIIRRRLEEW GDLYRELLER
PNDLTPLLEL AI