Gene Mlg_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0644 
Symbol 
ID4270833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp694910 
End bp696049 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content61% 
IMG OID638125392 
ProductHK97 family phage major capsid protein 
Protein accessionYP_741488 
Protein GI114319805 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.146587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATC AAGTGAAGGA AATGGTCGAG GGCGTGGCCC GAGACTTCGA GGGGTTTAAG 
TCCCGCCAGG ATCAAGCCCT GTCCGACCTG GGCGAGCGCG TGAAGCGTGC CGAGACCCTG
GCCGCCCGCA AGGTGGGCAT GACCGTGGAC AGCGGCGCAG GCATGGGGCC GGAGACCAAG
GCGGTCCGTG AGTGGCTGCG TGGTGGCGAC TTCGACCGCA AGGCCCTGAG CATCGAGGAT
GACGGCCAAG GCGTGACCGT GCGCAGCGAT TGGGCGGATC AGATTTTCAA GCGCATTCGG
GAATCCAGCC CGGTCCGGCA GGTGGCTAAT AACCTGTCCA CCAATTCCAA CGAATTGGAG
GTTCTGGTTG ACCGTGGTGA ACCGGACTCA GCGTGGGTGG CTGAAAAGGG CGACCGCGAT
CCGACTGCCG CTAGTTTCAT GTCTCGGCAT AAGATTGCGG TGCATGAACA TTATGCGTAT
CCGAGCGTAA CCCAGCAATT CCTGGAGGAT AGCCGGCTCG ACCTGGAGCA GTGGTTGCAG
GACAAGATCG GAACCCGTTT CGGTCGGCAG GAAGCCGAAT CCTTTATCAA GGGTGACGGC
AACGGCAAGC CTCGCGGCAT TCTGGATTAC GACACCGTGC CGGATGGTGA TTTCGAGTGG
GGCGCTGATC CTGCCGATTA CACCATCGGG GCGATCTATA CCGGTGAATC GGGTGACTTC
CCGAGCAACA ATCCCGATAA CGTGCTCTAC GATGTTGTGG ACGCGCTCAA GTCGGATTAC
CTGGGCAATG CACGGTTCAT GATGAGCCGC GCCACGATGA ACAAAATTCG GAAGCTGCGG
GATGGTGACG ACCGTGCCCT GCTCCAGATG AGCCTGGCGG AAGGTCGGCC CAACACGCTT
CTTGGGTTCC CGGTGGTGAT TGCCGAGGAT ATGCCGGACC CGGCTGCCGA TTCCGAGTCG
ATCCTGTTCG GTGACTTCGG CCAGGCTTAC ACCATCGTTG ACCGGATCGG AGTAAGCGTG
CTGCGTGATC CCTACTCCCT GCCCGGATGG GTCCGCTGGT ATGTGCGCAA GCGGATCGGT
GGGGCGCTGA CCAACCCCGA GGCCGTGAAG GCCGTGGTGT TCGGCGCTGA GCCGAGCTGA
 
Protein sequence
MTHQVKEMVE GVARDFEGFK SRQDQALSDL GERVKRAETL AARKVGMTVD SGAGMGPETK 
AVREWLRGGD FDRKALSIED DGQGVTVRSD WADQIFKRIR ESSPVRQVAN NLSTNSNELE
VLVDRGEPDS AWVAEKGDRD PTAASFMSRH KIAVHEHYAY PSVTQQFLED SRLDLEQWLQ
DKIGTRFGRQ EAESFIKGDG NGKPRGILDY DTVPDGDFEW GADPADYTIG AIYTGESGDF
PSNNPDNVLY DVVDALKSDY LGNARFMMSR ATMNKIRKLR DGDDRALLQM SLAEGRPNTL
LGFPVVIAED MPDPAADSES ILFGDFGQAY TIVDRIGVSV LRDPYSLPGW VRWYVRKRIG
GALTNPEAVK AVVFGAEPS