Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2294 |
Symbol | |
ID | 4268392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2602999 |
End bp | 2606376 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638127054 |
Product | hypothetical protein |
Protein accession | YP_743126 |
Protein GI | 114321443 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.237724 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCC AGGGACTGGA TTTCGCCGGC TCCGACCAGC GCACCGGTTT TCGCCTCCAG CGGCTGGAGG TGTTCAACTG GGGCACCTTC CACGACCGCG TCTGGTCGCT GCAGCCCGGC GGCGACAACA GCCTGCTGAC CGGGGACATC GGCTCCGGCA AGTCCACCTT GGTGGACGCC ATCACCACGC TGCTGGTGCC GGCTCAGCGG ATCACCTACA ACAAGGCCGC CGGCGCGGAG GCCAAGGAGC GCAGCCTGCG CTCTTACGTC CTCGGCTACT ACAAGTCGGA GCGTAGCGAC GCCGGCTTGG CCAGCAAGCC GGTGGGCCTG CGCGACCATA ACAACTACTC GGTCATCCTC GGCCACTTCC ACAACGAGGG CTTCGGCCAC GATGTGACCC TGGCCCAGGT CTTCTGGATG AAGGACACCC AGGGCCAGCC GGCGCGGTTT TTCGTGGTCG CCGACCGCCC GCTGAACATC GCCGAGCACT TCGCCAATTT CGGCACCGAC CTGGCCAACC TGCGCAAGCG CCTGCGCGGG CTGCCCCACG TGGAGTTGTA CGACACCTTT CCGCCCTACG GCGGTGCCTT CCGGCGGCGC TTCGGCATCG ACAATGAGCA GGCCCTGGAG CTCTTTAACC AGACGGTCTC CATGAAGTCG GTGGGTAACC TGACCGAGTT CGTGCGCAGT CACATGCTGG AGGCCTTCCC CGTCGAGGAG CGGATCGAGG CGCTGATCCG CCATTTCGAC GACCTCAACC GGGCCCACGA GGCGGTGTTG AAGGCCAAGT CGCAGGTGGC GCAGTTGATT CCCCTGGTGC GCGATGCCGA CGAGCACGAC CAACGGGCCA CCCGGGCCCG GGAGCTGCGC GATTGCCGCC AGGCGTTGCG GCCCTGGTTC GCCTCGCTGA AGGCCGAGCT GCTGGACCGC CGGTTGGACA ACCTGGGCCA GGAGCACGAG CGGCTGCAGC AGCGCATTGA GGGGCTGAGC ACTCGGCTCA CCGAGCAGCG GGCCCGGCGC GACGAGCTCA AGCAGGCGAT CGCCGAGAAC GGCGGCGACC GCCTGGAGCG GATCAAGGCG GAGCTGGAGC GCAAGCGGCG GGAGAAGGCG GAGCGCAGCG AGAAGGCCGA GCGCTACCGG CGGCTGGCGC ACCGGCTGGA GCTGCCCGAT GCCACCAGCG ACGAGGTCTT TCACGCCAAC CGCGAGGCGC TCACCGTGGC CCGCGAGGCC ACCGAGACCG CCCGCAGCGA GCTGCAGAAC CGGATTACGG AGGCCAGCGT GGAGCTGCAC CAGCTCAAGG GGCGGCACGA GGACCTGGAT GCCGAGCTGG CATCGCTGCG CGGTCGACGC TCCAACATCC CGGCGGAGAT CCTCGCCATC CGCACCGCCC TGTGCCGCGC GCTGGAGGTG CCCGCCGAGG ACCTGCCCTT CGCCGGTGAA CTGCTGCAGG TGCGTGACGA CGAGCGCGAC TGGGAGGGCG CCATCGAACG GGTGCTGCAC AACTTCGGCC TGTCGCTGCT GGTGCCCGAC AGCCACTACG CCCGCGTCGC CGAGTGGGTG GACCGCACCC GGTTGCGGGG GCGGTTGGTC TACTACCGCG TGCGCGAAGC CCGCGACGCG CCGCCGCCGT CGCTGCACGC CGACTCGCTG GTGCGCAAGC TGGCCATCCG GCCGGAGTCG GCGTTTTATA CCTGGCTGGA GCAGGCGCTG GCGCAGCGCT TCGATTACGC TTGCTGCACC GATCTGCAGC AATTCCGTCG TGAGAAGCGG GCGATCACCC GTGCCGGGCA GATAAAGGCC GGCGGTGGCC GCCACGAGAA GGACGACCGC CACCGTATCG ATGACCGCAC CCGTTATGTG CTGGGCTGGA GCAACGAGGC GAAGATCGCC GCGCTGGAGG CCGAGGCCGA CGGGCTGGCC GAGCGCATGC AGGCCATCGC CAAGCGCATC GGTGGCTGGG AGGACGAACG GCGGGCCCAG GAGGCCCGCC GCGACACCAT CCAGGAGCTG ACCCTGTTCC AGGATTTTCA GGCCCTGGAC TGGCGTCCGC TGGCTGCCGA CATTGAGGCG CTGGAGCGCG AGTACCGGGA GCTGGCCGAG GGCTCGGACG TGCTCCGGGC GCTGGAGCGG CAACTGGATG AGCTGGAAAA GACCATCAAG GGCGTCGAGG GCGAGGTCCG CCAGGCGGAT CGCGAGATCA GCGCCAACGA GCTGAAACAG GAGCAGGCCC GGGCCCAGTG GCAGGATTGC CACACGCTGT TGGAGTCGAC ACCGGAAGAG GCGCAGCAGA CCTACTTTCC GCGCCTGGCC GCCATGCGCG ACGAGGCCCT GGGTGAGCAC ACCCTGACGG TGGAGTCCTG CGACAACCGC GAGAAGGACA TGCGCGAATG GCTGCAGGGC AAGATCGACG CGGAGGACAA GCGCCTGGCG CAGTTGCGCG ACCGCATCAT CGACGCCATG CGCCGCTACC AGGACGCCTG GCCGCTGGAC AGCCAGGAGG TGGATGTGAG CGTCGAGGCC GCCGGCGACT ATCGCGCCAT GCTCGAGCAG TTGCAGGCAG ATGATCTGCC CCGCTTCGAG GCCCGCTTCA AGGAGCTGCT CAACGAAAAC ACCATCCGCG AGGTGGCCGC CTTCCAGAGC CGGCTGCATC GGGAGCGCCA GGCGATCCTG GAGCGCATCG GCACCATCAA CCGCTCGCTG CGTGAGATCG ACTACAACGA CAACCGTTAC ATCACCCTGA TGGCGGAAAG CGCCCCGGAC GCGGAGATCC GCGACTTTCA GCAGAGCCTG CGCGCCTGCA CCGAGGGGGC GGTGACCGGC TCGGACGACG ATCAGTACTC CGAGGCCAAG TTCCTGCAGG TGAAGGCCAT CATCGAGCGC TTCCGCGGCC GGGAGGGAAC CACCGACCTG GACCGGCGCT GGACCCGCAA GGTCACCGAT GTGCGCAACT GGTTCGTCTT CTCCGCCTCC GAACGCTGGC GCGAGGACGA CAGCGAGTAC GAGCACTACG CCGATTCCGG GGGCAAGTCC GGGGGGCAGA AGGAGAAGCT GGCCTATACC GTGCTGGCCG CGAGCCTGGC CTACCAGTTC GGGCTGGAGT GGGGCGAGAC CCGCTCGCGC TCGTTCCGCT TCGTGGTCAT CGACGAGGCC TTCGGCCGCG GCTCGGACGA GTCCGCCCGT TACGGCCTGG AGCTGTTCCA GCGGTTGAAC CTGCAGTTGC TGATCGTCAC CCCGTTGCAG AAGATCCACA TCATCGAGCC CTTCGTGGCC AGCGTGGGGT TTGTGCACAA CGTGGAGGGG CGCGAGTCTA TGGTGCGCAA TCTCACCATT GCGGAGTACC AGGCGGAGAA GGCCGCCCGC CAGCGTATCG CGCCATGA
|
Protein sequence | METQGLDFAG SDQRTGFRLQ RLEVFNWGTF HDRVWSLQPG GDNSLLTGDI GSGKSTLVDA ITTLLVPAQR ITYNKAAGAE AKERSLRSYV LGYYKSERSD AGLASKPVGL RDHNNYSVIL GHFHNEGFGH DVTLAQVFWM KDTQGQPARF FVVADRPLNI AEHFANFGTD LANLRKRLRG LPHVELYDTF PPYGGAFRRR FGIDNEQALE LFNQTVSMKS VGNLTEFVRS HMLEAFPVEE RIEALIRHFD DLNRAHEAVL KAKSQVAQLI PLVRDADEHD QRATRARELR DCRQALRPWF ASLKAELLDR RLDNLGQEHE RLQQRIEGLS TRLTEQRARR DELKQAIAEN GGDRLERIKA ELERKRREKA ERSEKAERYR RLAHRLELPD ATSDEVFHAN REALTVAREA TETARSELQN RITEASVELH QLKGRHEDLD AELASLRGRR SNIPAEILAI RTALCRALEV PAEDLPFAGE LLQVRDDERD WEGAIERVLH NFGLSLLVPD SHYARVAEWV DRTRLRGRLV YYRVREARDA PPPSLHADSL VRKLAIRPES AFYTWLEQAL AQRFDYACCT DLQQFRREKR AITRAGQIKA GGGRHEKDDR HRIDDRTRYV LGWSNEAKIA ALEAEADGLA ERMQAIAKRI GGWEDERRAQ EARRDTIQEL TLFQDFQALD WRPLAADIEA LEREYRELAE GSDVLRALER QLDELEKTIK GVEGEVRQAD REISANELKQ EQARAQWQDC HTLLESTPEE AQQTYFPRLA AMRDEALGEH TLTVESCDNR EKDMREWLQG KIDAEDKRLA QLRDRIIDAM RRYQDAWPLD SQEVDVSVEA AGDYRAMLEQ LQADDLPRFE ARFKELLNEN TIREVAAFQS RLHRERQAIL ERIGTINRSL REIDYNDNRY ITLMAESAPD AEIRDFQQSL RACTEGAVTG SDDDQYSEAK FLQVKAIIER FRGREGTTDL DRRWTRKVTD VRNWFVFSAS ERWREDDSEY EHYADSGGKS GGQKEKLAYT VLAASLAYQF GLEWGETRSR SFRFVVIDEA FGRGSDESAR YGLELFQRLN LQLLIVTPLQ KIHIIEPFVA SVGFVHNVEG RESMVRNLTI AEYQAEKAAR QRIAP
|
| |