Gene Moth_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0005 
Symbol 
ID3831877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp3229 
End bp4341 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content54% 
IMG OID637827932 
ProductDNA replication and repair protein RecF 
Protein accessionYP_428888 
Protein GI83588879 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000369508 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000145417 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCTGCTT TAAGCTTACA ACAGCTCCAG CTGATTAATT TCCGCAGTTA TAAATGCCTT 
ACCTGGGATT GCCGGCCCGG GCTAAATATT ATTTTTGGGC CCAATGCTGC CGGTAAAACT
AATCTCCTGG AAGCCATTGG TTACCTGGCC CTCGCCCGAT CCTTCAGGCA GCAACAGGAC
CAGCAATTGT TGACCTGGGG AGCAAGCTCC TTCCAGGTGC GAGGATTATG CCACAGTAAT
GGTGAAAAAA TTGAGCTGGT AATTAACTAC CAGCAGCACA ATAAAAGGTT GACAATTAAC
GGCAACCGCA ACCGCTTAAT CGAACTCCTG GGTATATTCC CCGTCATTTA CTTCGGACCT
GATGACCTGC ACCTCCTTAA GGGCGGCCCG GCCTACCGGC GCCATTTCCT GGATCGGGAG
ATCAGTATGG GCGATCGCCT TTACTGCCGC AATCTGCAAG ATTACCGGCG CATTCTCTTC
CAGCGTAATC TTTTGTTGCG GGCCATTAAG GCCGGCCGGG GGAAGGAGGG GGAGCTGGAA
CCCTGGGATA TCCAGTTATT AGCAACAGGT AAGGCTATCT GCGAAAAGCG ATCTTGTTTT
TTACAATCTT TGGCACCGAG GGTTGCCGCT ACCTACCGGG ATATGGCCGG GGGGGAAGAA
CTGGCCCTCA TTTATCGGCC CGGGGTTGCC AGCCAGGAAG AGTGGGCGGA AAGGCTAAAG
GTCGGCCGCG AAAGGGAGGT CCAGGCCGGT ATGACCCTCT GGGGTCCCCA CCGGGATGAC
TTTACCTTTA CCCTGGACGG TCACGAGGCC CGTTATTTTG CCTCCCAGGG CCAGCAAAGA
GCTATCGTTC TGGCCTTGAA ACTGGCTGAG GCCCGGTATT ACCGGGAACT TTTGCATGTA
ATGCCAGTTT TGCTCCTGGA TGACGTTTTT TCCGAACTTG ATGAAGCGCA CCAGGGGGCG
TTACTGGAGT TACTGGCAGG GGCCGACCAG GCTTTTTTAA CGACCACGGA GGTCGGCTTA
TTACCAGCCA GGCTTATACA ACGCTCCCAT CTCTGGGAAT TAGCCCGGGG AAGGGAACCC
CGGCTCACCT CCGGGCCTGT TGAGGCACAG TAA
 
Protein sequence
MPALSLQQLQ LINFRSYKCL TWDCRPGLNI IFGPNAAGKT NLLEAIGYLA LARSFRQQQD 
QQLLTWGASS FQVRGLCHSN GEKIELVINY QQHNKRLTIN GNRNRLIELL GIFPVIYFGP
DDLHLLKGGP AYRRHFLDRE ISMGDRLYCR NLQDYRRILF QRNLLLRAIK AGRGKEGELE
PWDIQLLATG KAICEKRSCF LQSLAPRVAA TYRDMAGGEE LALIYRPGVA SQEEWAERLK
VGREREVQAG MTLWGPHRDD FTFTLDGHEA RYFASQGQQR AIVLALKLAE ARYYRELLHV
MPVLLLDDVF SELDEAHQGA LLELLAGADQ AFLTTTEVGL LPARLIQRSH LWELARGREP
RLTSGPVEAQ