Gene Mlg_1758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1758 
Symbol 
ID4268818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2012570 
End bp2014144 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content62% 
IMG OID638126516 
Productphage tail sheath protein 
Protein accessionYP_742594 
Protein GI114320911 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.53821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCT ATTTACACCC TGGGGTCTAT ATCGAGGAAA TACCCAGTGG TTCCAGGCCT 
ATTGACGCGG CAGGGACGTC CACCGCCGCT TTCATCGGCT ACACCACCCG GGGGCCGGTC
GACGAACCGA CCTTCATCAC CAGTTGGGAA GACTACGAGA ACATCTTCGG GGGCATCCGC
GAGGTAGAGG ACTCGGTTAG CGGTATTGAT TCGATGAAGA ATGCCAGGCT CGGCCAGGAG
ATCGACACCC TCGGCCTGTC GGTGTTCGCC TATTTTCAGA ATGGCGGGGG CAAGGCCTAC
ATCATCCGCA CCGCCAGTGA CACAAAGGTG GCCGACGGTG CACTGGAGGA CTCCGGCACC
GACCTGATTG GGTTCGAGGC GGTCAATCCC GGAACCTGGG GCAACCGGCT GCGGGTCCGC
CTCACAGCCA AGCCGGACGC CAGCGATTCC CGTTTCACCG TGGAGATCGG CCGCGGGGAT
GGCGATGACT TCGTCGCCGA CGAGACTTTC ACCGACGTCT CGCTGGAGAA GGGGGATAAC
GATTATCTCA CCACCGTCCT GAAGGAGGGC TCGGAGCTGC TAAGGGTCAG CGACGAGGAC
GATATCGCCG ATGCGGTCGC GGTCATTAAC GCTGCCGGAA CGGACGGTGT CAGCGTGGAG
ATGACCGGCG GCGAGGACGG GACACCCGGT GGCGCCAATG AATACACCGG TATCTTTTCG
AAGCTTCTGA AATACCGGGA TATCAACATC ATTCTGCTGC CCGATCAGAC CTGGGGAGGC
GCAGGGCAGG GGATCATTGA AAGCGCCATC AGTCATTGCG AGACCATGAA AAACCGCATG
GTCATCTTCG ATCTGCCTCC CGGCCAGGAG CTGGAAAAAG AAAAGGACGT GACCGACCTG
GCGCTGACCA CCTCGACGTA CGCGGCCACC TATTACCCCT GGGCGCTGGT CAGCAACCCG
CACTACAACC CCGACACCAA CCCGGGCGCG GAACGGATGG TGCTGGCGCC GGCGGGCGGC
TTTGCCGCAG GGCAGTGGGC ACGAACGGAC GGCCGTCGGG GGGTGTGGAA GGCCCCGGCG
GGGGTTGAGA CGAACCTTCT GGGCATCAGA AAGCTGCTCT ACACGGTCGA GGACGCCGAG
CAGCAGTACC TCAACCCGTT GGGTGTCAAC GCCCTGCGCC AGTTGCCCAA CTACGGATCG
GTGATTTGGG GCTCGCGCAC GCGGGCCACC CGCGCCAACC CGGAGTGGCG CTACATCCCG
GTGCGGCGCA CCGCCATTTT CATCGAGGAG AGCATCTTTC ACGGCATCCA TTGGGCGGTC
TTCGAGCCGA ACGACCACCG CCTGTGGTCG GCCCTACGCA CGAATATCGA ATCCTTCCTG
GGCGGGCTCC ACCGCTCGGG GGCCTTCCAG GGTGAGAAGG CCAGTGATGC CTACTTTGTG
CGCTGCGGTC TCGGGCAGAC CATGCGCCAG GGCGATATTG ATCGCGGCCA GGTGATCGTC
GAGGTGGGCT TTGCGCCGCT CAAGCCGGCG GAGTTCGTCA TCGTGCGCAT TCAGCAGAAA
GTCGGCCAAC AGTAA
 
Protein sequence
MASYLHPGVY IEEIPSGSRP IDAAGTSTAA FIGYTTRGPV DEPTFITSWE DYENIFGGIR 
EVEDSVSGID SMKNARLGQE IDTLGLSVFA YFQNGGGKAY IIRTASDTKV ADGALEDSGT
DLIGFEAVNP GTWGNRLRVR LTAKPDASDS RFTVEIGRGD GDDFVADETF TDVSLEKGDN
DYLTTVLKEG SELLRVSDED DIADAVAVIN AAGTDGVSVE MTGGEDGTPG GANEYTGIFS
KLLKYRDINI ILLPDQTWGG AGQGIIESAI SHCETMKNRM VIFDLPPGQE LEKEKDVTDL
ALTTSTYAAT YYPWALVSNP HYNPDTNPGA ERMVLAPAGG FAAGQWARTD GRRGVWKAPA
GVETNLLGIR KLLYTVEDAE QQYLNPLGVN ALRQLPNYGS VIWGSRTRAT RANPEWRYIP
VRRTAIFIEE SIFHGIHWAV FEPNDHRLWS ALRTNIESFL GGLHRSGAFQ GEKASDAYFV
RCGLGQTMRQ GDIDRGQVIV EVGFAPLKPA EFVIVRIQQK VGQQ