Gene Mlg_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1545 
Symbol 
ID4270550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1767893 
End bp1769086 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content65% 
IMG OID638126301 
Producthypothetical protein 
Protein accessionYP_742382 
Protein GI114320699 
COG category[S] Function unknown 
COG ID[COG3864] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0659219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.827191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGG CCGAGGTCAA GGAAAAGGCG CTCGCGCTGT GGGAGAACGA CCGGGCGCGG 
CTGGTCTTCG AGCAGCCGTT CATCGGCATG CTGGCCATGC AACTGGAACT GCAACCGGTG
GTGGACGGGG CCCTGCCCAC CGCTGCCACC GATGGCCGCA CGGTTTTCGG CAATGCGAAC
TTTCTCTGCC GGCTCAGCGA CGAGGACCGG CTCTTCGTGC TGGCGCACGA GGTATGGCAC
TGCGCGGCCC TGCATCACCT GCGCCGCCAG GGTCGGGACA CGGTCCGGTG GAATCACGCG
GTGGACTACG AGGTGAATGG CCTGCTCGCG GAGACCGGCA TGACCGTGCC CAAAGGCGCG
CTGTACAGAC GGCACTGGCG CGGATGGAGC GCAGAGTCCA TCTACGAGGT TCTGCCGGAG
GACCTGACGG ATGAGGGCCG GGGCCGATTC CGGGACAGGC ACACCTGGCG CGCCCCGTCC
CTTGGCCGTA GCGATCCCGA CCTCTGCCCG CGACCCGACG AGCGTATCTG GAAAGGTTGG
CAGCAACGGG TGGTGGGGGC CTGGCAGCAA GTGCAGGCCC GGGGCCACGG TGGTCGGGGC
ATCGGGCGTC TGCCCGGCGT CATGGGCAGC CTGGTGCGCA GCCTGACCCG TCCGCAAGTG
CCCTGGCAGA CTGTCCTGAG ACGCTACCTT GTTCCCCGCC TGGACCCGGG CAGGCGTCAA
TGGACCACAC CGAACCGCAG GTACCTCAGT CGTGAGCTTT ACCTGCCCGG GCCGGCGCGG
GAGCGGGTGG ACCTGGCGGT GGCCATCGAC ACCAGCGGCA GCACGCAGGA CTACCTGCCC
GCGTTCCTGG CAGAGTTGCG GGGCATTGCG GGTCAGTGGC CGGATACGCG GATTCGGTTG
ATCCAAGCGG ATGCCGATAT CCAGAGCGAT GAGTACGTGA CTGCTGCGGA TCTGAGGCCT
GAGATGATTC TGAAGGGGGG TGGGGGGACG GACTTCAGGC CGGTGTTCGA GGCCTTGAAG
TCGGATCCGC CGAGGGTGCT GGTGTATTTC ACGGATGGGT TCGGTCAGCT ACCTGGCCGC
CACGAGGTGG AGGCTCTCGA TGTTGTATGG GTCATATGGC GTGATACGAT CACCATAAAA
TGCCGCTATG GAGCGGTTGT TACAACTGGA CAACCCTTGT CGCTATGGCA ATAA
 
Protein sequence
MKRAEVKEKA LALWENDRAR LVFEQPFIGM LAMQLELQPV VDGALPTAAT DGRTVFGNAN 
FLCRLSDEDR LFVLAHEVWH CAALHHLRRQ GRDTVRWNHA VDYEVNGLLA ETGMTVPKGA
LYRRHWRGWS AESIYEVLPE DLTDEGRGRF RDRHTWRAPS LGRSDPDLCP RPDERIWKGW
QQRVVGAWQQ VQARGHGGRG IGRLPGVMGS LVRSLTRPQV PWQTVLRRYL VPRLDPGRRQ
WTTPNRRYLS RELYLPGPAR ERVDLAVAID TSGSTQDYLP AFLAELRGIA GQWPDTRIRL
IQADADIQSD EYVTAADLRP EMILKGGGGT DFRPVFEALK SDPPRVLVYF TDGFGQLPGR
HEVEALDVVW VIWRDTITIK CRYGAVVTTG QPLSLWQ