Gene Moth_2397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2397 
SymbolprfA 
ID3830764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2517644 
End bp2518714 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID637830316 
Productpeptide chain release factor 1 
Protein accessionYP_431222 
Protein GI83591213 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.456174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAAA AACTGGAACA GATCGAAGCC AGATACGAAG AACTGGGCCG GCTGATGGGT 
GACCCGGAAG TAATAGCCGA CCCCGAGCAA TTACAGAAAC ACGCCCGGGC CCACGCAGCC
CTGGAAGATA TAGTCACCAC CTTCCGCCGG TACCGCCAGG TCAGCAATGA GCTGGCAGAA
GATAAAGCCA TGCTGGAGGA AGAAAAAGAC CGGGAATTCC AGGAACTCCT CAAGGCCGAG
ATTGAACGCC TGACCGGGGA GCAGGAACGC CTGGAACAGG AGTTGAAGAT CCTCCTCCTG
CCCAAAGACC CCAATGATGA AAAGGACATC ATCATGGAGA TCCGCGCCGG TGCCGGGGGC
GAAGAGGCGG CCCTCTTTGC CGGCGATCTC TTCCGCATGT ACCAGCGCTA TGCCGAAAAG
AAACGCTGGC GGACGGAGAT TATCAGCTCC CACCCCACCG AACTGGGCGG TTTCAAGGAG
ATCATCTTCC AGGTCGAGGG GCAGGGGGTT TACAGCCGCC TGAAGTTTGA GAGCGGGGTA
CACCGGGTGC AGCGCATCCC GACCACGGAA TCCGGCGGGC GCATTCACAC GTCAACGGCT
ACCGTGGCCG TGTTGCCCGA GGCGGAAGAG GTAGACGTGG AGATCAAGCC CGAAGACCTG
CGGGTGGACA TCTTCTGTTC CAGCGGTCCC GGCGGCCAGT CGGTCAACAC CACCTACTCC
GCCGTCCGCA TTACCCACCT GCCGACGGGC CTGGTGGTCT CCTGCCAGGA CGAGAAGTCT
CAGTTAAAGA ATAAAGAAAA GGCCATGAGG GTGTTGCGCG CCCGGCTCCT GGATATGGCC
CGGGCTGAGC GGGAAGGGGA GCTGGCTGAA GAGCGGCGCT CCCAGGTAGG CAGCGGCGAC
CGCAGCGAGC GGATACGCAC CTATAACTTC CCCCAGAACC GGGTGACGGA CCACCGTATC
GGCCTGACCC TCCACCACCT GGACCAGGTC CTGGCAGGAG AACTGGACGA GATTATCGAC
GCCCTGGTCA CCACCGACCA GGCAGAACGC CTGAAAAACA TGGAGGCCTG A
 
Protein sequence
MLEKLEQIEA RYEELGRLMG DPEVIADPEQ LQKHARAHAA LEDIVTTFRR YRQVSNELAE 
DKAMLEEEKD REFQELLKAE IERLTGEQER LEQELKILLL PKDPNDEKDI IMEIRAGAGG
EEAALFAGDL FRMYQRYAEK KRWRTEIISS HPTELGGFKE IIFQVEGQGV YSRLKFESGV
HRVQRIPTTE SGGRIHTSTA TVAVLPEAEE VDVEIKPEDL RVDIFCSSGP GGQSVNTTYS
AVRITHLPTG LVVSCQDEKS QLKNKEKAMR VLRARLLDMA RAEREGELAE ERRSQVGSGD
RSERIRTYNF PQNRVTDHRI GLTLHHLDQV LAGELDEIID ALVTTDQAER LKNMEA