Gene Emin_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1086 
Symbol 
ID6263441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1177206 
End bp1178408 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content44% 
IMG OID642611566 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001875975 
Protein GI187251493 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones99 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAA TAATGCAATC CATAAAAGAA TTAAGAAAAA CTTTAGAAAC TAAAATCGAT 
TCATGCATTA CTAAAGAAAA GGCTGAAAAC ATAGTTTCTG ATTTAGTAAA AAAAGTTCAC
CCCGAACAAA GAAAGGCTTT ACTTCCTACC TCGGCGGACG ATGTTCTGGA ACGTTTTGCC
GAATTTTCAA AATCCTCAAA ACACGCGCCT GAAAAACCTT GGACAAGCGA TTACGGCAGA
AAGTTTGGCA ACATGAAAAA CTTTCTTTTA GCGGCTAAAG ACAGGCACGC TTCTTTCGCG
GACAGCAAAT CAGTTTTAGC CGAATCAGGC GCTGGCGGCG GCGGATATCT TGTTCCCACG
GAATTTTCAA ACCATGTTGC CCGTATAATG GCCGATATTT CACCCATTAT GCAAATTGCC
AACGTTGTGC CCATGGGCTC TTGGAAAAGG CAAATACCTA AGCAGATTTC TAATTTAAGC
GTAGGCTGGG TTGCCGAGAA CGGTGTAAGG GGCATTAATA ACCCCGCGTT TGGGCAAATT
GAACAGGTAG CAAAAGTAAT GGCAACAGTT ATCAAATGCA CGGACGAGCT TATAAGAGAC
AGCGCCATTA ATTTAACGCA GTTCTTATCC GAGCTTGTGG CTGAAGCCAT GGCATTGGAA
GTTGAAAGAG TTTCCTTAGT TGGCGACACG GCCGCCGGCG ATCCTTTTAA CGGCGTTTAT
AACACGGCAG GATTACAAAA TGTAACAATG GGCGGGCAAA ACGTTTCTTT TGATGATATT
GCTAACTTAA TTTTTCAACT AAACGATGCT AACGCCGCAG GCAGTGTTTT GGTCCTTTCC
CGCACCGGTC TTAGCAAACT TTTAAAATTA AAAGACTCTA ACGGCAATTA TTTGTGGCAG
CCCCCCGCGG GCGGCGCTCC GGCTACAATT TGGAACACTC CTTATGTCGT AAGCTCTAAA
ATACCTAATG ATGTCGACGG TGATAAAACG ATAGCTCTTT TCGGACGTTT TAATAAACAC
CTTCTTATCT CACCCAGACA GGAAATGGCG GTAAAAGTTT CACAGGACGC CTCTTCATGG
AACGCGGCTT CCGAAACGGC CGACTCCGCA TTTATGTTAG ACCAAACCTG GCTGCGTTTT
ACTCAGGCAC TTTCAATAGA CGTTACATTC GGAAGCGCCT TTAGCTGCCT TAAATTCAAA
TAA
 
Protein sequence
MDEIMQSIKE LRKTLETKID SCITKEKAEN IVSDLVKKVH PEQRKALLPT SADDVLERFA 
EFSKSSKHAP EKPWTSDYGR KFGNMKNFLL AAKDRHASFA DSKSVLAESG AGGGGYLVPT
EFSNHVARIM ADISPIMQIA NVVPMGSWKR QIPKQISNLS VGWVAENGVR GINNPAFGQI
EQVAKVMATV IKCTDELIRD SAINLTQFLS ELVAEAMALE VERVSLVGDT AAGDPFNGVY
NTAGLQNVTM GGQNVSFDDI ANLIFQLNDA NAAGSVLVLS RTGLSKLLKL KDSNGNYLWQ
PPAGGAPATI WNTPYVVSSK IPNDVDGDKT IALFGRFNKH LLISPRQEMA VKVSQDASSW
NAASETADSA FMLDQTWLRF TQALSIDVTF GSAFSCLKFK