Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0046 |
Symbol | |
ID | 4270915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 49623 |
End bp | 51101 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638124771 |
Product | hypothetical protein |
Protein accession | YP_740893 |
Protein GI | 114319210 |
COG category | [S] Function unknown |
COG ID | [COG3517] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03355] type VI secretion protein, EvpB/VC_A0108 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAA TGGTAGCCGA GCACACCGCG TCAGCGGCAT CGGACGGCCC CCTGCTGGAC CAGATCATGG CCAAGACCCG CATGGCGCCC AGCGACGAGG GCTACGATAT CGCCCGGCAG GGTGTTGCGG CCTTCATCAG CGAGCTGCTC AAGGGTGAGG GGGAGGAGAC GCCGCAGGTC AATAAAGCGG TGGTCGACCG GATGATCGTC GAACTGGACC GCAAGATCGG GCAGCAGGTG GACGAGATCC TTCATCTGCC CGAATTCCAG CGCCTGGAGT CCGCCTGGCG TGGCCTCAAG CTGTTGGTCG ATCGCACGGA CTTCCGCGAG AATATCCGCC TGAACCTCCT CCATGCCAGC AAGGAGGAAT TGCTGGAGGA TTTCGAGTTC TCGCCGGAAC TGCCCCAGAG CGGGTTGTAC CAGCACGTCT ACGCCTCCGG TTACGGTCAG TTCGGCGGCG ACCCGGTGGC GGGGGTCATC GGGGTCTACG ACTTCCAGCC CACCACCCCG GACATCAAGC TCCTCCAGTA CAGCGCCGCC GTGGGGGCCA CCGCCCACGC CCCCTTCATC TCGTCGGTGG CCCCGGAGTT CTTCGGTATC GAGCGTTACC AGGACCTGCC CAACATCAAG GACCTGGCCG CCACCCTTCA GGGGCCCAAA TACGCCCGTT GGCGCAGCCT GCGTGAGAGC GAGGATGCCC GATACCTCGG CCTCACGGCC CCGCGCTTTC TGCTGCGCAT GCCCTACGAC CCGGTGGAGA ACCCGGTAAA GACCTTCAAC TACCGGGAGT CGGTCAGCGA GAGCCACGAG CACTACCTGT GGGGCAACAC GGCCTATCTG TTCGCCGAGC GGCTTACCGA CAGCTTCGCG CGTTACCGCT GGTGCCCGAA CATCATTGGC CCGCAGAGCG GCGGTGCCGT GGAGCACCTG CCGGTGCATA CCTTCGAGTC CATGGGCCAA CTGCAGGCCA AGATCCCTAC CGAGGTGCTG ATCACCGACC GGCGGGAGTA CGAACTGGCC GAGGAAGGGT TCATCGCCCT GACCATGCGC AAGGACAGCG ACAACGCCGC CTTCTTCTCG GCCAACTCGG TGCAGCGGCC CCGGCGCTTT CCCAACACGG CGGAGGGCCG CGCGGCGGAG ACCAACTTCA AGCTGGGCAC GCAACTGCCC TATCTGTTCG TCGTCAATCG CCTCGCCCAC TACATCAAGG TGCTGCAGCG CGAGCAGATC GGTGCCTGGA AGGAGCGCCA GGACCTGGAG CGGGAGCTGA ACGACTGGAT CCGGCAGTAC GTGGCGGACC AGGAAAACCC GCCGGCGGAG GTGCGCAGCC GGCGCCCCCT GCGCGCCGCC TCCATCCAGG TCTCCGACCT GGAGGGCGAT CCGGGCTGGT ATCAGGTCTC GCTGGCCGTG CGTCCCCACT TCAAGTACAT GGGGGCCAAC TTCGAGCTCT CCCTTGTCGG TCGACTGGAC AAGGAGTAA
|
Protein sequence | MNEMVAEHTA SAASDGPLLD QIMAKTRMAP SDEGYDIARQ GVAAFISELL KGEGEETPQV NKAVVDRMIV ELDRKIGQQV DEILHLPEFQ RLESAWRGLK LLVDRTDFRE NIRLNLLHAS KEELLEDFEF SPELPQSGLY QHVYASGYGQ FGGDPVAGVI GVYDFQPTTP DIKLLQYSAA VGATAHAPFI SSVAPEFFGI ERYQDLPNIK DLAATLQGPK YARWRSLRES EDARYLGLTA PRFLLRMPYD PVENPVKTFN YRESVSESHE HYLWGNTAYL FAERLTDSFA RYRWCPNIIG PQSGGAVEHL PVHTFESMGQ LQAKIPTEVL ITDRREYELA EEGFIALTMR KDSDNAAFFS ANSVQRPRRF PNTAEGRAAE TNFKLGTQLP YLFVVNRLAH YIKVLQREQI GAWKERQDLE RELNDWIRQY VADQENPPAE VRSRRPLRAA SIQVSDLEGD PGWYQVSLAV RPHFKYMGAN FELSLVGRLD KE
|
| |