Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1251 |
Symbol | |
ID | 4269173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1454823 |
End bp | 1455800 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638126001 |
Product | XRE family transcriptional regulator |
Protein accession | YP_742090 |
Protein GI | 114320407 |
COG category | [S] Function unknown |
COG ID | [COG1426] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.92911 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCAG ACCACGACAG TCCGCTGTCC CCCGACCACG AGCGCGACCC CGGTCGCGGC GGCGAGACCC CGGGGCAGCT GCTCGCCAGG GCACGGGAGG CGCGAGGCCT GAGCCGGAAG GCCGCGGCCG ATGCGCTGAA CCTGCCCTTA CGGACCCTTG AGGCGCTCGA GGGGGACGAC TATGACAACC TGCCGCCGCT CACCTTTGTC AAAGGCTATC TCAGGGCCTA TGCCCAGCTC CTGGAGATTG ACTCGGACCG GCTGCGGGAG GCGCTGGCCC GGCTCGACCT GGAGCGGCCG GCCAGCGGCG CTCAGCAGCA GCCGGGCCAG GGCTCGACGT CAGTGACCGG TGGCGATACC TCTCGGGGGG GGCTCCAAGG CCGCCGGCTC CGGGGCTGGC TGCTGGGTGT GAGTAGCGTG GTCGTGGCCG CGGTCATTGC CGTGCTCGCC TGGCTCTGGC TGGTCGAGAA TGGGGTGTCC GGGGATAGCG AAGCACCACC GGTGGCGGAG GAGGGCACGG CGCCGTCCGA GGAGGCCCCG GCGGAGGCGG TGGTCGCCGA CGAGCTGGCC GAGCGGACAC TGCCTGAGGC GGTGGATGAG GTGGTCCCCG AGGCGTCGGT GCCGGACGAC ACCGGGGCTG ACGTCAGTGT CCCTGAGACG CTGGAACCCC CGGTGGTGCC CGAGGTCGAG GCGGAGCCGG AACCCGCTGT GGCGGAGCCC GAGCCCGAGC CCGCGCCGGA GCCGGCCGGT CTGGTCCTGC ATATTTCCGG GACCACCTGG CTGGAGGTCC GCGACGAGGA CGGTGAGCGT CTGATGGTAG GCAACTACCA GGGCGAGGAG ACCCACCAAC TGCAGGGCGA GGGGCCCTTC GCCCTGGTCA TCGGTAATGC GGACGCCGTT CGGGTGGAGT TCCAGGGTGA CGCCGTCGAT CTGGCGCCCC ACACCCGTGG CAATGTGGCG CGGCTGACCA TCCCTTAG
|
Protein sequence | MSSDHDSPLS PDHERDPGRG GETPGQLLAR AREARGLSRK AAADALNLPL RTLEALEGDD YDNLPPLTFV KGYLRAYAQL LEIDSDRLRE ALARLDLERP ASGAQQQPGQ GSTSVTGGDT SRGGLQGRRL RGWLLGVSSV VVAAVIAVLA WLWLVENGVS GDSEAPPVAE EGTAPSEEAP AEAVVADELA ERTLPEAVDE VVPEASVPDD TGADVSVPET LEPPVVPEVE AEPEPAVAEP EPEPAPEPAG LVLHISGTTW LEVRDEDGER LMVGNYQGEE THQLQGEGPF ALVIGNADAV RVEFQGDAVD LAPHTRGNVA RLTIP
|
| |