Gene Moth_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1052 
Symbol 
ID3831858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1082781 
End bp1083749 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content58% 
IMG OID637828980 
Productphosphoesterase, RecJ-like 
Protein accessionYP_429909 
Protein GI83589900 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000287381 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000192246 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGATGA GTGAGATCGG GCAGATAGCG GCTATTCTGG CAACCGCCCG GGAAGTGGCG 
GTTGCCACCC ATATTATACC GGATGGCGAT TGCCTGGGGT CGATGCTCGG CCTTACCCTG
GCCCTCCGGA AACGGGGCAC CAGCGTAATA GCTATTAATG CCGACCCGGT TCCGGAGATG
TTTCAGTACC TGCCTGGCCA GGAAACCATC ATCGACCCGG ACCAGGTAAC CTCTATGCCG
CCGTTACTGG TCATGGTCGA CTGCACTGAT ATGGAACGGG CCGGTAAGGG TTTTAGCAAC
TGGCAACAGC GGGTAGAGAA AATAATTAAT ATCGATCATC ACGTCAGCAA CACCCGTTTC
GGCCACCTGA ACCTGGTTGA CAGCCGGGCG GCGGCCACGG CGGAATTGAT TTACGCCGTC
CTAGAACAAA TACCGGCAAC CTTTACGCCG GAGGTAGCAA CCTGCCTTTA TACAGCCCTG
GCCACTGATA CCGGCTCTTT CCAGTATGAA AATTGTACGG CCAGGACCCT GCGCCTGGCA
GCCAGCCTGC TGGAGAAAGG GGCCGATATG CCCTTAATCC GGGAGCACCT CTGGGAGAGT
AAGCCCTTAA ACAGCATCCG CCTTCTGGCG GCTACCCTAC CCACCCTGAC TCTGGCCTAT
GAAGGTCGGG TGGCCTGGAT GACAGTATCC AGGGCAGCCC TGGAGGCCAA TGGCGCCAGG
CCGGAACACG CCGAGGGCCT GGTAAATTAT CCTCGCAGTA TTGCCGGCGT AGAAGTAGGC
ATGCTTTTCC GGGAATTGCC GGATGGCAAG GTCAAAGTAA GCCTGCGTTC GAAAAAAATT
GTCGACGTCA ACAGGGTGGC GGCATTATTT GGCGGCGGCG GTCACCGCCG GGCCGCTGGC
TGTACCCTTG ACGGCGATCT GGATACAGTA GTCGCCAGGG TTGTGGCTGC AGCCGGTGAG
GCCCTGTAA
 
Protein sequence
MLMSEIGQIA AILATAREVA VATHIIPDGD CLGSMLGLTL ALRKRGTSVI AINADPVPEM 
FQYLPGQETI IDPDQVTSMP PLLVMVDCTD MERAGKGFSN WQQRVEKIIN IDHHVSNTRF
GHLNLVDSRA AATAELIYAV LEQIPATFTP EVATCLYTAL ATDTGSFQYE NCTARTLRLA
ASLLEKGADM PLIREHLWES KPLNSIRLLA ATLPTLTLAY EGRVAWMTVS RAALEANGAR
PEHAEGLVNY PRSIAGVEVG MLFRELPDGK VKVSLRSKKI VDVNRVAALF GGGGHRRAAG
CTLDGDLDTV VARVVAAAGE AL