Gene Moth_0296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0296 
Symbol 
ID3832958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp302365 
End bp303384 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content52% 
IMG OID637828231 
ProductrecA protein 
Protein accessionYP_429173 
Protein GI83589164 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.197187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCG GCCGCGAACA AGCCCTTTCC AAAGCCCTGG CAGAGATTGA AAAAAAGCAT 
GGTAAAGGCG CCATCATGCG CCTGGGCCAG CAGGAACGCC TGAACGTCGA AGCCCTGCCC
ACAGGGATAC TCCCCCTGGA CCTGGCCCTG GGCGTAGGCG GCTTGCCCAG GGGCCGCATA
ATCGAAATCT TCGGCGAGGA AGGATCAGGA AAGACAACCA TTGCCCTGCA CGCCGTCACC
GAAATCCAAA AAGGCGGCGG CAATGCCGTT CTGATCGACG CCGAGCACGC CTTTGACCCC
AACTACGCTA AAACCATCGG GGTAGATATT GAGAATTTGC TTGTCAGCCA GCCTGAATAT
GGCGAGCAGG CTTTAGAAAT AGCCTCGGCC CTTATCCAGT CTTCCGCCGT GGATATAATC
GTGATAGATT CCGTTGCCGC CCTGGTACCG AAAAAAGAAT TAGAAGGGGA ATTCGGGGAT
GCTATTGTCG GGCTCCAGGC CAGGCTCATG TCCCAGGCCA TGCGGAAGCT TTCCGGGATA
GTCTCCAAGT CCAAAACGAT AGTCATTTTT ATCAATCAAC TAAGGGAAAA GATAAACACC
GGCGGCTCAT TCGGCAAACC GGATTTCGTT ACGACCGGCG GCAGAGCGCT TAAATTCTAT
TCATCTGTAA GGATTGAGGT GCAGCGAGGG GATCAGATTA AAAAAGGGAC GGAAGTGGTA
GGACATAAAA TGAAAACGCG GGTTGTAAAG AACAAGGTTG CGCCACCGTT CCGGGGATGT
GAGTTAGATT TGATCTATGG CCGGGGGATG TCCAAAGAAG GAGCGCTATT GGAAATGGCT
GTTGAGAAAG GAATAGTGAC TAAGAGCGGC GCTTGGTATT GCTGGCAAGG CGACCACTTA
GGTCAAGGTC ATGAAAATGC CTGCGAATTT TTACGGCAGC ATCCGGACGT TGCCAGGGAG
ATAGAAAGTG TGCTAAAAGA GAGCCTAAGC GGGGTATTAC CCCAGGGGAA GATTATTTAA
 
Protein sequence
METGREQALS KALAEIEKKH GKGAIMRLGQ QERLNVEALP TGILPLDLAL GVGGLPRGRI 
IEIFGEEGSG KTTIALHAVT EIQKGGGNAV LIDAEHAFDP NYAKTIGVDI ENLLVSQPEY
GEQALEIASA LIQSSAVDII VIDSVAALVP KKELEGEFGD AIVGLQARLM SQAMRKLSGI
VSKSKTIVIF INQLREKINT GGSFGKPDFV TTGGRALKFY SSVRIEVQRG DQIKKGTEVV
GHKMKTRVVK NKVAPPFRGC ELDLIYGRGM SKEGALLEMA VEKGIVTKSG AWYCWQGDHL
GQGHENACEF LRQHPDVARE IESVLKESLS GVLPQGKII