Gene Emin_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0848 
Symbol 
ID6262569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp936793 
End bp937986 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content39% 
IMG OID642611326 
ProductPBSX family phage terminase large subunit 
Protein accessionYP_001875740 
Protein GI187251258 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00062306 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGA AAAACCTAAT TAAACCCAAA ATAGGACCTG TCTTTAAATT AAATAATAAA 
GCGGGCAAAA GAACCGTTAT AAACGTGGGC GGGGCAAGAA GCGGCAAAAG CCACGCCGTG
GCGCAGCTTT TAATAATGCG CGCTTTAAAC CTGCAGGGTA TTAACGTGGG TATAACGCGC
AAAACAATGC CCGCTTTAAA AATGACGGCG GCGCGGCTTG TAACGGACCT TCTTAAAGAA
TATGGCCTTT ACTCCGAAAA AAACCATAAT AAAATGGAGC ATTATTATAA TTTGGGTAAA
AGCAGAATAC AGTTTTTTTC TTTAGATAAC CCGGAGAAGA TAAAATCAGC TGAGTTTAAC
TATATTTGGA TGGAGGAGGC AACGGAATTT ACGTATGAGG ACTACGTTAC CCTTCTTACC
CGTCTTTCCG CTCCCATAAA AGAGCCTTAC AAAAACCAAA TATTTTTAAC GTTAAACCCT
TCGGACTCAA ATTCTTGGAT AGCAAAAAAA CTGCTTTCAG CACAAAACAC GCAAATTATA
AAAAGTTCTT ATAAGGACAA TCCTTTTTTA AGCAAAGATT ATATTAACAC TTTGCTGGGT
TTAAAAGATA TTGACGAGAA TTATTACCGT GTTTTCGCTT TGGGCCAATG GGGCGCTAAC
AAAAATATTG TTTATGACAA CTATACTTTT GTTGACGAAA TAAAAAACAC GGACAATGTT
ATCTGGGGTC TTGATTTCGG GTTTAACAAC CCGTCTGCGC TTGTTAAACT TTATATATCG
GACGAAGGTG TTTACACCGA GGAAAAACTT TACAAAAGTG GACTTACAAA CAGCGCGCTT
ATAAAAAATT TAGCAGAAAT TATACCCCCC TCACAAAGGC ACGAAAGTAT TTACGCCGAC
GCGGCCGAGC CTGCCCGCAT AGCCGAAATA AGTGAAGCCG GTTTTAACAT ACACCCGGCT
TTGAAAGATG TAAAAGCGGG TATTTTAAGC GTAAAAACCA AAAAACTTTT TATAAACAAA
AACTCATCAA ATCTTATAAA AGAGATTCAA GGTTACTGCT GGAAAACGGA CTTAAACGGC
AATGCGCTTG AAGAAGCGGT TAAATTTAAC GACCACGCGC TTGACGCCTT ACGTTACGCT
TTGCACACTC ATTTTTTTAT ATCTGGAAAG AAACCCGATG TAAGTTTTTT TTAA
 
Protein sequence
MKQKNLIKPK IGPVFKLNNK AGKRTVINVG GARSGKSHAV AQLLIMRALN LQGINVGITR 
KTMPALKMTA ARLVTDLLKE YGLYSEKNHN KMEHYYNLGK SRIQFFSLDN PEKIKSAEFN
YIWMEEATEF TYEDYVTLLT RLSAPIKEPY KNQIFLTLNP SDSNSWIAKK LLSAQNTQII
KSSYKDNPFL SKDYINTLLG LKDIDENYYR VFALGQWGAN KNIVYDNYTF VDEIKNTDNV
IWGLDFGFNN PSALVKLYIS DEGVYTEEKL YKSGLTNSAL IKNLAEIIPP SQRHESIYAD
AAEPARIAEI SEAGFNIHPA LKDVKAGILS VKTKKLFINK NSSNLIKEIQ GYCWKTDLNG
NALEEAVKFN DHALDALRYA LHTHFFISGK KPDVSFF