Gene Moth_2490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2490 
Symbol 
ID3831593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2595130 
End bp2596599 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content58% 
IMG OID637830412 
ProductDNA repair protein RadA 
Protein accessionYP_431315 
Protein GI83591306 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.858464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0874227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACCAT CGGTGAATGA AAATAGCGGG GCTGATAAAA GCAAGGGGGC AGAACTCCTG 
ACCAGGATCA AAGAACGCTT CGTTTGCCAG CAGTGCGGTT ATGAATCCCA GGGCTTCCTG
GGCCGATGCC CGGCGTGTGG TAGTTGGAAT AGCCTGGTGG CCGAGGCCAT AATTCCAGAG
GGTATAAAAA AAGCGGCAGG CTCCGGGGAA GTCCCCGTCC TGCTATCCCG GGTAAATGAT
ATCAGTGAAA AGAGGCTGGT GACTACCCTG GGCGAATGGG ACCGGGTCCT GGGAGGCGGG
CTAGTTCCCG GTTCCCTGGT CCTGGTTGGC GGGGCTCCCG GGATTGGTAA GTCTACCCTC
CTTCTCCAGG TGGCCCACTT ACTCTCTTCG AGGTACGGTA AAATACTCTA TGTCACCGGA
GAAGAATCCG CCAGCCAGAC CCGGTTAAGG GCTCGACGCC TGGGCGCCGA GGAAGGCGAG
ATCTACTTAC TGGCGGAAAC TAATATTGAA GGGATCCTCC TGCAGATAGA ACGGCTGCAG
CCGGTAGTAG TTATGGTGGA TTCTATCCAG ACGATGCTTC TTCCTGATAT CCAGGCTGCC
CCGGGCAGCG TTTCCCAGGT GCGGGAAGGA GCGGCCCGCT TTTTACGCCT GGCCAAGGAT
GGCGGCCCGG CAGTAATTCT GGTGGGTCAC GTCACCAAGG AAGGATTCCT GGCCGGCCCG
AAGGTCCTGG AACACCTGGT GGATTGTGTC CTCTACCTGG AGGGTGAACG CTACCAGGCC
TACCGCATTC TGCGGTCCGT TAAAAATCGC TTCGGCTCCA CCAATGAGAT TGGCGTTTTT
GAGATGACCG GCTCCGGTTT GCAGGAAGTA ACCAACCCCT CGGCCATGCT TATGGCCGAG
CGCCCGGCCG GAGTGGCGGG CTCCAGTGTC GTCGCCTGCC TGGAAGGCAC CCGGCCCCTT
CTACTGGAGA TCCAGGCCCT GGTGAGTAAG ACTGCCTTTG GAAACCCGCG GCGGCTAGCT
ACCGGTATTG ATTTCAACCG GGCCCTCCTG CTGGCAGCGG TCCTGGAGAA ACGGGCCGGC
CTGCCCCTGG GGGGCTACGA TATATACCTT AACGTGGCCG GTGGTATTGC CATCAATGAA
CCGGCAGCCG ACCTGGGTAT ATGCCTGGCC ATTGCCTCTG GTTTGAAGGA TCGTCCCCTG
GAATCCCGGA CCCTTGTCCT GGGGGAGGTT GGCCTTGCTG GAGAGGTAAG GGCCGTCACC
CAGCTGGAAA GGCGCGTTGA GGAAGCAGCC AGGCTGGGTT TTAACCGCTT TATAATTCCG
GCTGGCAATA GGGGGGGTCT TAAAGGGCAG AGCGGCTGCG AAATATATAA AGTATCTACA
ATAAATGAGG CCCTGCGACT GGCCCTCGTT AATACCGGCT CAGGGGCAGG CGATAATACG
TTGAGTAACC CGTTTTATAA ATACTCTTAG
 
Protein sequence
MPPSVNENSG ADKSKGAELL TRIKERFVCQ QCGYESQGFL GRCPACGSWN SLVAEAIIPE 
GIKKAAGSGE VPVLLSRVND ISEKRLVTTL GEWDRVLGGG LVPGSLVLVG GAPGIGKSTL
LLQVAHLLSS RYGKILYVTG EESASQTRLR ARRLGAEEGE IYLLAETNIE GILLQIERLQ
PVVVMVDSIQ TMLLPDIQAA PGSVSQVREG AARFLRLAKD GGPAVILVGH VTKEGFLAGP
KVLEHLVDCV LYLEGERYQA YRILRSVKNR FGSTNEIGVF EMTGSGLQEV TNPSAMLMAE
RPAGVAGSSV VACLEGTRPL LLEIQALVSK TAFGNPRRLA TGIDFNRALL LAAVLEKRAG
LPLGGYDIYL NVAGGIAINE PAADLGICLA IASGLKDRPL ESRTLVLGEV GLAGEVRAVT
QLERRVEEAA RLGFNRFIIP AGNRGGLKGQ SGCEIYKVST INEALRLALV NTGSGAGDNT
LSNPFYKYS