Gene Moth_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2082 
Symbol 
ID3831832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2173097 
End bp2174017 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content57% 
IMG OID637830008 
Producthypothetical protein 
Protein accessionYP_430918 
Protein GI83590909 
COG category[L] Replication, recombination and repair 
COG ID[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02778] DNA polymerase LigD, polymerase domain 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCAGG GCCGGCAATA TATACGGTCC CAATTGCTGA AGCGACAGCT CCACTTGACC 
AACCTGGACA AGGTTTTCTG GCCGGAAGGT CTGACCAAGT TTGATCTTAT CAAGTATTAT
GTCGACATGG CTCCCTTTCT CTTGCCCTAC CTCCGGGATC GCCCCCTGGT CCTTAAGCGC
TACCCGGATG GCATAACAGG GGAGGCCTTT TACCAGAAAG AGTGCCCTGC CTATGCCCCG
GACTGGGTTA CGACCCTGCC TGTCTATCAT GCTGATAGCA GCAAGACTAT TAATTATGTT
CTCTGTAATA ATGAAGAAAC CCTGATCTGG CTGGCCAATC AGGGGTGTAT CGAGGTCCAT
GCCTGGCTCT CCAGGGCCGG CCGCCTGGAA TACCCCGATA TCGCCGTCAT GGATCTGGAC
CCAAGTGCGG GGGCAACTTT TAAAGATGTC TTGGATATCG CCCTCCTGGT CCACCAGGCT
TTAAAGGAGT TTAACCTCAG CGGCTATCCT AAAACCTCCG GTGCTACTGG GTTGCATATC
TTTATCCCCC TTGAACCTCG CTGGACCTTT CACCAGGTGA CAGCCGCCAT GGGGTACCTG
GCGCGGCTGG TCGCCGGGGT TTACCCCCGC AAGGCCACCA CCGAACGGTC GATCCCGAAG
CGTAAAGATC GGGTCTACCT GGACTACCTG CAGAACGTCC GCGGACGGTC CATGGCCTTC
CCCTACAGCC TGCGACCCTT ACCCGGGGCG CCGGTTTCAA CGCCCCTGAC CTGGGAGGAG
GTAAAGAGGG GGATGTTCAG CCCCAAAGAC TTCAACATCC ACACCGCCCG GGAGCGCCTG
CAGGCGTATG GCGACCTTTA TCGGGGTTTT CTGGCGCAAC CAAACGATCT GGAACCGCTG
CTTAAACTGG CCGGGGTTTA A
 
Protein sequence
MGQGRQYIRS QLLKRQLHLT NLDKVFWPEG LTKFDLIKYY VDMAPFLLPY LRDRPLVLKR 
YPDGITGEAF YQKECPAYAP DWVTTLPVYH ADSSKTINYV LCNNEETLIW LANQGCIEVH
AWLSRAGRLE YPDIAVMDLD PSAGATFKDV LDIALLVHQA LKEFNLSGYP KTSGATGLHI
FIPLEPRWTF HQVTAAMGYL ARLVAGVYPR KATTERSIPK RKDRVYLDYL QNVRGRSMAF
PYSLRPLPGA PVSTPLTWEE VKRGMFSPKD FNIHTARERL QAYGDLYRGF LAQPNDLEPL
LKLAGV