Gene Mlg_2294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2294 
Symbol 
ID4268392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2602999 
End bp2606376 
Gene Length3378 bp 
Protein Length1125 aa 
Translation table11 
GC content68% 
IMG OID638127054 
Producthypothetical protein 
Protein accessionYP_743126 
Protein GI114321443 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.237724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCC AGGGACTGGA TTTCGCCGGC TCCGACCAGC GCACCGGTTT TCGCCTCCAG 
CGGCTGGAGG TGTTCAACTG GGGCACCTTC CACGACCGCG TCTGGTCGCT GCAGCCCGGC
GGCGACAACA GCCTGCTGAC CGGGGACATC GGCTCCGGCA AGTCCACCTT GGTGGACGCC
ATCACCACGC TGCTGGTGCC GGCTCAGCGG ATCACCTACA ACAAGGCCGC CGGCGCGGAG
GCCAAGGAGC GCAGCCTGCG CTCTTACGTC CTCGGCTACT ACAAGTCGGA GCGTAGCGAC
GCCGGCTTGG CCAGCAAGCC GGTGGGCCTG CGCGACCATA ACAACTACTC GGTCATCCTC
GGCCACTTCC ACAACGAGGG CTTCGGCCAC GATGTGACCC TGGCCCAGGT CTTCTGGATG
AAGGACACCC AGGGCCAGCC GGCGCGGTTT TTCGTGGTCG CCGACCGCCC GCTGAACATC
GCCGAGCACT TCGCCAATTT CGGCACCGAC CTGGCCAACC TGCGCAAGCG CCTGCGCGGG
CTGCCCCACG TGGAGTTGTA CGACACCTTT CCGCCCTACG GCGGTGCCTT CCGGCGGCGC
TTCGGCATCG ACAATGAGCA GGCCCTGGAG CTCTTTAACC AGACGGTCTC CATGAAGTCG
GTGGGTAACC TGACCGAGTT CGTGCGCAGT CACATGCTGG AGGCCTTCCC CGTCGAGGAG
CGGATCGAGG CGCTGATCCG CCATTTCGAC GACCTCAACC GGGCCCACGA GGCGGTGTTG
AAGGCCAAGT CGCAGGTGGC GCAGTTGATT CCCCTGGTGC GCGATGCCGA CGAGCACGAC
CAACGGGCCA CCCGGGCCCG GGAGCTGCGC GATTGCCGCC AGGCGTTGCG GCCCTGGTTC
GCCTCGCTGA AGGCCGAGCT GCTGGACCGC CGGTTGGACA ACCTGGGCCA GGAGCACGAG
CGGCTGCAGC AGCGCATTGA GGGGCTGAGC ACTCGGCTCA CCGAGCAGCG GGCCCGGCGC
GACGAGCTCA AGCAGGCGAT CGCCGAGAAC GGCGGCGACC GCCTGGAGCG GATCAAGGCG
GAGCTGGAGC GCAAGCGGCG GGAGAAGGCG GAGCGCAGCG AGAAGGCCGA GCGCTACCGG
CGGCTGGCGC ACCGGCTGGA GCTGCCCGAT GCCACCAGCG ACGAGGTCTT TCACGCCAAC
CGCGAGGCGC TCACCGTGGC CCGCGAGGCC ACCGAGACCG CCCGCAGCGA GCTGCAGAAC
CGGATTACGG AGGCCAGCGT GGAGCTGCAC CAGCTCAAGG GGCGGCACGA GGACCTGGAT
GCCGAGCTGG CATCGCTGCG CGGTCGACGC TCCAACATCC CGGCGGAGAT CCTCGCCATC
CGCACCGCCC TGTGCCGCGC GCTGGAGGTG CCCGCCGAGG ACCTGCCCTT CGCCGGTGAA
CTGCTGCAGG TGCGTGACGA CGAGCGCGAC TGGGAGGGCG CCATCGAACG GGTGCTGCAC
AACTTCGGCC TGTCGCTGCT GGTGCCCGAC AGCCACTACG CCCGCGTCGC CGAGTGGGTG
GACCGCACCC GGTTGCGGGG GCGGTTGGTC TACTACCGCG TGCGCGAAGC CCGCGACGCG
CCGCCGCCGT CGCTGCACGC CGACTCGCTG GTGCGCAAGC TGGCCATCCG GCCGGAGTCG
GCGTTTTATA CCTGGCTGGA GCAGGCGCTG GCGCAGCGCT TCGATTACGC TTGCTGCACC
GATCTGCAGC AATTCCGTCG TGAGAAGCGG GCGATCACCC GTGCCGGGCA GATAAAGGCC
GGCGGTGGCC GCCACGAGAA GGACGACCGC CACCGTATCG ATGACCGCAC CCGTTATGTG
CTGGGCTGGA GCAACGAGGC GAAGATCGCC GCGCTGGAGG CCGAGGCCGA CGGGCTGGCC
GAGCGCATGC AGGCCATCGC CAAGCGCATC GGTGGCTGGG AGGACGAACG GCGGGCCCAG
GAGGCCCGCC GCGACACCAT CCAGGAGCTG ACCCTGTTCC AGGATTTTCA GGCCCTGGAC
TGGCGTCCGC TGGCTGCCGA CATTGAGGCG CTGGAGCGCG AGTACCGGGA GCTGGCCGAG
GGCTCGGACG TGCTCCGGGC GCTGGAGCGG CAACTGGATG AGCTGGAAAA GACCATCAAG
GGCGTCGAGG GCGAGGTCCG CCAGGCGGAT CGCGAGATCA GCGCCAACGA GCTGAAACAG
GAGCAGGCCC GGGCCCAGTG GCAGGATTGC CACACGCTGT TGGAGTCGAC ACCGGAAGAG
GCGCAGCAGA CCTACTTTCC GCGCCTGGCC GCCATGCGCG ACGAGGCCCT GGGTGAGCAC
ACCCTGACGG TGGAGTCCTG CGACAACCGC GAGAAGGACA TGCGCGAATG GCTGCAGGGC
AAGATCGACG CGGAGGACAA GCGCCTGGCG CAGTTGCGCG ACCGCATCAT CGACGCCATG
CGCCGCTACC AGGACGCCTG GCCGCTGGAC AGCCAGGAGG TGGATGTGAG CGTCGAGGCC
GCCGGCGACT ATCGCGCCAT GCTCGAGCAG TTGCAGGCAG ATGATCTGCC CCGCTTCGAG
GCCCGCTTCA AGGAGCTGCT CAACGAAAAC ACCATCCGCG AGGTGGCCGC CTTCCAGAGC
CGGCTGCATC GGGAGCGCCA GGCGATCCTG GAGCGCATCG GCACCATCAA CCGCTCGCTG
CGTGAGATCG ACTACAACGA CAACCGTTAC ATCACCCTGA TGGCGGAAAG CGCCCCGGAC
GCGGAGATCC GCGACTTTCA GCAGAGCCTG CGCGCCTGCA CCGAGGGGGC GGTGACCGGC
TCGGACGACG ATCAGTACTC CGAGGCCAAG TTCCTGCAGG TGAAGGCCAT CATCGAGCGC
TTCCGCGGCC GGGAGGGAAC CACCGACCTG GACCGGCGCT GGACCCGCAA GGTCACCGAT
GTGCGCAACT GGTTCGTCTT CTCCGCCTCC GAACGCTGGC GCGAGGACGA CAGCGAGTAC
GAGCACTACG CCGATTCCGG GGGCAAGTCC GGGGGGCAGA AGGAGAAGCT GGCCTATACC
GTGCTGGCCG CGAGCCTGGC CTACCAGTTC GGGCTGGAGT GGGGCGAGAC CCGCTCGCGC
TCGTTCCGCT TCGTGGTCAT CGACGAGGCC TTCGGCCGCG GCTCGGACGA GTCCGCCCGT
TACGGCCTGG AGCTGTTCCA GCGGTTGAAC CTGCAGTTGC TGATCGTCAC CCCGTTGCAG
AAGATCCACA TCATCGAGCC CTTCGTGGCC AGCGTGGGGT TTGTGCACAA CGTGGAGGGG
CGCGAGTCTA TGGTGCGCAA TCTCACCATT GCGGAGTACC AGGCGGAGAA GGCCGCCCGC
CAGCGTATCG CGCCATGA
 
Protein sequence
METQGLDFAG SDQRTGFRLQ RLEVFNWGTF HDRVWSLQPG GDNSLLTGDI GSGKSTLVDA 
ITTLLVPAQR ITYNKAAGAE AKERSLRSYV LGYYKSERSD AGLASKPVGL RDHNNYSVIL
GHFHNEGFGH DVTLAQVFWM KDTQGQPARF FVVADRPLNI AEHFANFGTD LANLRKRLRG
LPHVELYDTF PPYGGAFRRR FGIDNEQALE LFNQTVSMKS VGNLTEFVRS HMLEAFPVEE
RIEALIRHFD DLNRAHEAVL KAKSQVAQLI PLVRDADEHD QRATRARELR DCRQALRPWF
ASLKAELLDR RLDNLGQEHE RLQQRIEGLS TRLTEQRARR DELKQAIAEN GGDRLERIKA
ELERKRREKA ERSEKAERYR RLAHRLELPD ATSDEVFHAN REALTVAREA TETARSELQN
RITEASVELH QLKGRHEDLD AELASLRGRR SNIPAEILAI RTALCRALEV PAEDLPFAGE
LLQVRDDERD WEGAIERVLH NFGLSLLVPD SHYARVAEWV DRTRLRGRLV YYRVREARDA
PPPSLHADSL VRKLAIRPES AFYTWLEQAL AQRFDYACCT DLQQFRREKR AITRAGQIKA
GGGRHEKDDR HRIDDRTRYV LGWSNEAKIA ALEAEADGLA ERMQAIAKRI GGWEDERRAQ
EARRDTIQEL TLFQDFQALD WRPLAADIEA LEREYRELAE GSDVLRALER QLDELEKTIK
GVEGEVRQAD REISANELKQ EQARAQWQDC HTLLESTPEE AQQTYFPRLA AMRDEALGEH
TLTVESCDNR EKDMREWLQG KIDAEDKRLA QLRDRIIDAM RRYQDAWPLD SQEVDVSVEA
AGDYRAMLEQ LQADDLPRFE ARFKELLNEN TIREVAAFQS RLHRERQAIL ERIGTINRSL
REIDYNDNRY ITLMAESAPD AEIRDFQQSL RACTEGAVTG SDDDQYSEAK FLQVKAIIER
FRGREGTTDL DRRWTRKVTD VRNWFVFSAS ERWREDDSEY EHYADSGGKS GGQKEKLAYT
VLAASLAYQF GLEWGETRSR SFRFVVIDEA FGRGSDESAR YGLELFQRLN LQLLIVTPLQ
KIHIIEPFVA SVGFVHNVEG RESMVRNLTI AEYQAEKAAR QRIAP