Gene Emin_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1098 
Symbol 
ID6263542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1191150 
End bp1192226 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content45% 
IMG OID642611578 
Productpeptide chain release factor 1 
Protein accessionYP_001875987 
Protein GI187251505 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.516502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGAG AGCAAAAAAT GGATATCAGT AAATATTTGG AAGAATTTAA AACTTTAGAG 
GCCGAAATGT CTTCAGGCAA TATGCCCAGC GAGGAATTGC AAAAAAAATC CAAACGCCAT
GCTTTTTTAC GTCCGATAGT TGAAAAAGGT TTGGAACTTG AAGCGGCGCA AAAAGGAATT
AAAGACGCGC AGGAAATTAT TAAAGACGGG TCTGACAAAG AGATGTCGCA GATGGCGGCT
GAGGAAATGG AAACTCTTAA CGCTCAAATA CCTGTTTTGG AAGCTGAGCT GCGCGTGCTT
GTTATTCCGC CTGATCCGAA CGACTCAAAA AGCATTTACC TTGAACTGCG CCCGGGAGCC
GGGGGAGACG AATCTTCTAT TTTTGCGGCT GAAATGTTAC GCGTGTACCA GCGCTTTGCC
GACGCTAAAG GTTGGAAGAC AGAACTTTTG GAATACACGC CCACAGGTCT TAAGGGATGT
AAATACGCCA GTATGTTTAT TAAAGGCGAC GGGGCTTATT CCTGGCTTCG CGACGAATCA
GGCACACATC GCGTACAGCG CGTGCCTGAC ACGGAAACAA GCGGCCGTGT GCATACTTCA
ACAATAACAG TTGCTATTAT GCCCGAAGCG GAAGAAGTTG ACATACAGAT TAACCCCGCG
GATATTGAAA TGGAAACCTG CCGCGCCGGC GGAGCGGGCG GACAAAATGT TAATAAAGTT
GAAACGGCTG TCAGACTTAT CCACAAACCT ACCGGCGTTG TTGTCAGCTG CCGTGAGGAG
CGCAGCCAGG GCGCAAACAG AATTAAGGCC ATGAACATGC TTAGAGCGAA ACTTTACCAA
ATGGAAGAAG AAAAAAGAAA TAAAGAAATT TACGATACCC GTAAATCCCA GGTAGGCACA
GGCGACAGGA GCGAAAAAAT AAGAACTTAT AACTTTCCGC AAAGCCGCGT TACTGACCAC
AGGACGGAAA AATCCTACCA CAACATCACT GAAATCATGG AAGGACAGAT TGAGGAAATT
CTAAACGATT TAAGAACCTT AAGGCTTGAG GCTAAAATAA AAAATCTTGA GCTTTAA
 
Protein sequence
MLREQKMDIS KYLEEFKTLE AEMSSGNMPS EELQKKSKRH AFLRPIVEKG LELEAAQKGI 
KDAQEIIKDG SDKEMSQMAA EEMETLNAQI PVLEAELRVL VIPPDPNDSK SIYLELRPGA
GGDESSIFAA EMLRVYQRFA DAKGWKTELL EYTPTGLKGC KYASMFIKGD GAYSWLRDES
GTHRVQRVPD TETSGRVHTS TITVAIMPEA EEVDIQINPA DIEMETCRAG GAGGQNVNKV
ETAVRLIHKP TGVVVSCREE RSQGANRIKA MNMLRAKLYQ MEEEKRNKEI YDTRKSQVGT
GDRSEKIRTY NFPQSRVTDH RTEKSYHNIT EIMEGQIEEI LNDLRTLRLE AKIKNLEL