Gene Mlg_0795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0795 
Symbol 
ID4270558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp886105 
End bp887244 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content61% 
IMG OID638125545 
ProductHK97 family phage major capsid protein 
Protein accessionYP_741639 
Protein GI114319956 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATC AAGTGAAGGA AATGGTCGAG GGCGTGGCCC GAGACTTCGA GGGGTTTAAG 
TCCCGCCAGG ATCAAGCCCT GTCCGACCTG GACGAGCGCG TGAAGCGTGC CGAGACCCTG
GCCGCCCGCA AGGTGGGCAT GACCGTGGAC AGCGGCGCAG GCATGGGGCC GGAGACCAAG
GCGGTACGCG AGTGGCTGCG TGGTGGCGAC TTCGACCGCA AGGCCCTGAG CATCGAGGAT
GACGGCCAAG GCGTGACCGT GCGCAGCGAT TGGGCGGATC AAATCTTTAA GAAGATTCGC
GAATCCAGCC CCGTCCGCCA GGTGGCTAAT AACCTGTCCA CCAATTCCAA CGAATTGGAG
GTTCTGGTTG ACCGTGGTGA ACCGGACTCA GCGTGGGTGG CTGAAAAGGG CGACCGCGAT
CCGACTGCCG CTAGTTTCAT GTCTCGGCAT AAGATCGCCG TGCATGAGCA CTACTCGTAT
CCGAGCGTAA CCCAGCAATT CCTGGAGGAT AGCCGGCTCG ACCTGGAGCA GTGGTTGCAG
GATAAAATCG GCACCCGTTT CGGTCGGCAG GAAGCCGAAT CCTTTATCAA GGGTGACGGC
AACGGCAAGC CTCGCGGCAT TCTGGATTAC GACACCGTGC CGGATGGTGA TTTCGAGTGG
GGCGCTGATC CTGCCGATTA CACCATCGGG GCGATCTATA CCGGCGAATC GGGTGACTTC
CCGAGCAACA ATCCCGATAA CGTGCTCTAC GATGTTGTGG ACGCGCTCAA GTCGGATTAC
CTGGGCAATG CACGGTTCAT GATGAGCCGC GCCACGATGA ACAAAATTCG GAAGCTGCGG
GATGGTGACG ACCGTGCCTT GCTCCAGATG AGCCTGGCGG AAGGTCGGCC CAATACCCTG
CTGGGGTTCC CGGTGGTGAT TGCCGAGGAT ATGCCGGACC CGGCGGCGGA TTCCGAGTCG
ATCCTGTTCG GTGACTTCGG CCAGGCGTAC ACCATCGTTG ACCGGATCGG GGTAAGCGTG
CTGCGTGATC CCTACACCCT GCCCGGCTGG GTCCGCTGGT ACGTCCGCAA GCGTATCGGC
GGGGCGCTGA CCAACCCCGA AGCCCTGAAG GCCGTGGTGT TCGGCAGCGA GCCGAGCTGA
 
Protein sequence
MTHQVKEMVE GVARDFEGFK SRQDQALSDL DERVKRAETL AARKVGMTVD SGAGMGPETK 
AVREWLRGGD FDRKALSIED DGQGVTVRSD WADQIFKKIR ESSPVRQVAN NLSTNSNELE
VLVDRGEPDS AWVAEKGDRD PTAASFMSRH KIAVHEHYSY PSVTQQFLED SRLDLEQWLQ
DKIGTRFGRQ EAESFIKGDG NGKPRGILDY DTVPDGDFEW GADPADYTIG AIYTGESGDF
PSNNPDNVLY DVVDALKSDY LGNARFMMSR ATMNKIRKLR DGDDRALLQM SLAEGRPNTL
LGFPVVIAED MPDPAADSES ILFGDFGQAY TIVDRIGVSV LRDPYTLPGW VRWYVRKRIG
GALTNPEALK AVVFGSEPS