Gene Emin_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0697 
Symbol 
ID6263511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp771318 
End bp772517 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content41% 
IMG OID642611169 
ProductHK97 family phage portal protein 
Protein accessionYP_001875589 
Protein GI187251107 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family
[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.25481e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATATAA TACAAAAAAT AGTAAAACAA TTAACCCGTG ATAAAGACGA AAAATCTTTC 
ACGGCGCCTA CGGTATTTGA ACTTACCGGT AAATACAGCC GCCTGCCTTC TCCTTCGGGA
CGGAACTACG CTCCTTACGT AGAAGCTTAC GCCGATAAGC CTTGGATATA CTCAACAGTA
TCGGTAATAG CTGAAACAGT ATCCTCCACT GAATTTCTTT TAAAAAACGC TAAAGGGGAA
ATAATAACAA AACATCCTGT TCTTGAACTT ATGTATAAAC CAAACCCTCT TATGACGGGG
CGCCAGTTAA GACAATGGAT AACGGCTAGC CTTGAACTTA CGGGCAACGC TTATATATTA
AAAGACTCTT TACGCTCGGA CGGTTCCCCC GTGGAGCTGT TTCCTCTTTT AAGCCATTTG
GTTGAGGTTG TACCCGGTAC CATGGCGGCC GAACCTGTAC AAGGGTATAA ATACAGAGTA
GGCTCCAAAA CCGCTTATTA CAGGGCTAAG GATATTATTC ATATAAAATA TTTTAACCCT
TTTGATTTTT TTTACGGATT GTCGCCTTTG GCGGCAGCAA GAGGAGCGGC CGACGCTATA
GAATCAGCCG AAAATTATAA CAGAGCTTTT TTTGATAATT CCGCCACAAT ATCGGGTATA
CTTTCAACCG AAAATAAATT AGACGACGCT ACCAGAACAC GCATAAGCAA AGCGTGGAAC
GACAAATATA CTTCCGCCGC TAAAGCGCAT AAAGTAGCTT TATTAGAAGG CGGTTTAAAG
TGGCAGTCTA TAGGAATGAG CCAAAAAGAT ATGGATTTTA TAAGCGGCGT TAAAATCAAT
AGGGAAACAA TACTTTCCGT GTTTCATGTT CCTCCCGCGC TTGTAGGAAT TTTTGACCAC
GCCCCTCAGT TTAACACGCG CGAACAACAG CGTATTTTTT ACCAGACCTG CGTACTGCCT
AAACTTACTT TAATACTTGA ATCTTTAACG GAATTTTTAC TGCCTGATTT TGATTCCTCA
CGCGAATTAT ATTTAACGCC TGATATAAGC GCGGTATCGG TATTAAAAGA CGACGAAGTG
CAACGCGCCC AGGCGGCTAA ATTATATTTG GATATGGGTT TTGGGCGTGA TGAGGTTATT
AACGCGTTAG GCTTGCCGTT TAGCGTAAGC ACTGTAAAAA AAGCAAAGAG GAAATTTTAA
 
Protein sequence
MNIIQKIVKQ LTRDKDEKSF TAPTVFELTG KYSRLPSPSG RNYAPYVEAY ADKPWIYSTV 
SVIAETVSST EFLLKNAKGE IITKHPVLEL MYKPNPLMTG RQLRQWITAS LELTGNAYIL
KDSLRSDGSP VELFPLLSHL VEVVPGTMAA EPVQGYKYRV GSKTAYYRAK DIIHIKYFNP
FDFFYGLSPL AAARGAADAI ESAENYNRAF FDNSATISGI LSTENKLDDA TRTRISKAWN
DKYTSAAKAH KVALLEGGLK WQSIGMSQKD MDFISGVKIN RETILSVFHV PPALVGIFDH
APQFNTREQQ RIFYQTCVLP KLTLILESLT EFLLPDFDSS RELYLTPDIS AVSVLKDDEV
QRAQAAKLYL DMGFGRDEVI NALGLPFSVS TVKKAKRKF