Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0031 |
Symbol | |
ID | 4268888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 33968 |
End bp | 35407 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638124758 |
Product | hypothetical protein |
Protein accession | YP_740880 |
Protein GI | 114319197 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGATC CGACGCTCAA CCCCTCCCCG GTCCGCCCGG CCGCTGCGGC CGCGCCGCGG GGCACCCCCG CACCCTCGCT TCTCTCTGAG CCCTTGCACG ATGCCACGAC CGGACCGCTG GCCGCCGTCA ACGAGCACCT GCACCGGCTT CACCCCCCAC CGCCGGGCGG CGGGGCGCCT GCCCGCTGGA GCACCACCGG CGGGCAGGCC CGGACGCGGC CCGACGGCGC GGGGGAGAGC AGCGTTGCGG CACTGATAAA GGCCCTGGAC GGTCTGCCCG CCGAGCCGGG TCTGCTGGGG CTGGATGTCA CCGATGTGCG CCAGTGCCGG GCGCTGCTCG ACTACTGCGC GCTGATTGCG GCCACCCTGG CCCGCGACCC GATGGGCCGC CGCTGGCTCT ATCGCCGGCT GCGGGATCGC AGCGGCCTGC TGGGCCTGGC CCCCTTTAAC TTCCAGCCGG CGCTGGCACG GCTGCTGGAC GCACTGGCCG GGCGCCTCCA GCGGCCCGCT GGCGGCGGAG GCAGCGGCCT CCCATCCGCC CACCCCGGTG GGGACCGGCC CGCTCCCCAG ACCGTGCTGG ACCTGAAGGC GTTGCAGGCC CATCCCCTCT ACCGGGCCCT GCCCGAGAGA ACCCGGGCCG CCATCCGCGC CCTGCGCCGG GTGGCAGGTG GACCGGAGCC GACCAGCTGG GAGGCCCTGG CCGGCTATCT GCTGCCGGCG CTCTGCCGGG AGGCGGACGG GCGGCACAGC CTCGCCTTCT CCTTACTGGT CCTCCTGACC CAGGGCTCCC GCACGATCGG GACGGCGCTC GAGCTCCAGC GCGACGATCA CCATGTGCAG TGCCACCGGG CCTGGCGACG CCACCCTTTC TTGCTCCAGG GCGCTGTCCA AAGCCGTGCG CAATGCCTCC AGGGCTGTGC GCCCGTGCGG GACGCCCCCT GGCAGGCCCG GGAGCACAGC CGCTCGCGGG CGCTGCTGAC CCTCTACCGG AAAAACCTTC GCCGCCTGCT GCTGGCCGCC CCCCAACGGT TCACGGTGGT ACGACCCGGC GCACTGTCGC GCGACCCCGG GGCCCGGGCG CGGGCCCTGC TGGCAGTCCT GGACAGAACC GAGGGCAGTG CCGTGGCCGC ATCCATTGGC GCCGATAAGC CCACCCACCC GGCCTTACGC CACTGGCTGG AGGAGGGTGC GGAGGCCCCG CCTTTGCTGA TGGCCGGAAT GGCACTGACC AACGCCTGCG CCAATCGCGG CCTGCTGGGT GACAGCACCG CGGAGCGGGC CCTGGGCCGG CAAGTGATCG ACAGCGGCTG GGCCACCCTG GCGCTATGGT TGGCTCGCCT GTGCGCCAAT CACGGGGCGA GGGAGACGGG CTCCCCGGCA TGGATGGTCG CCGGGGCTGC GATGGCAACC ATCACCGGCC TGGGGCGGGA GCGGGCGTAG
|
Protein sequence | MDDPTLNPSP VRPAAAAAPR GTPAPSLLSE PLHDATTGPL AAVNEHLHRL HPPPPGGGAP ARWSTTGGQA RTRPDGAGES SVAALIKALD GLPAEPGLLG LDVTDVRQCR ALLDYCALIA ATLARDPMGR RWLYRRLRDR SGLLGLAPFN FQPALARLLD ALAGRLQRPA GGGGSGLPSA HPGGDRPAPQ TVLDLKALQA HPLYRALPER TRAAIRALRR VAGGPEPTSW EALAGYLLPA LCREADGRHS LAFSLLVLLT QGSRTIGTAL ELQRDDHHVQ CHRAWRRHPF LLQGAVQSRA QCLQGCAPVR DAPWQAREHS RSRALLTLYR KNLRRLLLAA PQRFTVVRPG ALSRDPGARA RALLAVLDRT EGSAVAASIG ADKPTHPALR HWLEEGAEAP PLLMAGMALT NACANRGLLG DSTAERALGR QVIDSGWATL ALWLARLCAN HGARETGSPA WMVAGAAMAT ITGLGRERA
|
| |