Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1746 |
Symbol | |
ID | 4270853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2003003 |
End bp | 2004184 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638126504 |
Product | hypothetical protein |
Protein accession | YP_742582 |
Protein GI | 114320899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.681924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACA GCCCCGAGAT CGAGCTGCAG TTCCAGGAAG GCTGCGACGA CTGCGGTCGC CGGGAGGTCC GCCTGCCGCC ACGGCTGCCG GCGCTGGGCG ATGACTTCGA CTGGGACCTG CGGGACTACG ACGGCTTCCG CCTGTTCATG CTCGAGGAGC TGGCCGCGCG CTTCCCGGAG CGCAAGCGCT GGACCCCGGC CGACCTGGAG GTGGTGCTGG TGGAGGCGTT GGCCGCCGTG CTCGACCAGC TCTCCGACAC CCTGGACCGG GTGGCCGGCG AGGCCTACCT GGAGACCGCC CGGCGCCCCG AGTCGGTGCG CCGGCTGCTG TTGATGATCG GCTACGACGC GCTGGGGCTG CGCCGGCGCC AGGGCCTGCC GCCCTTCGAC GGGGAGCACG ACGGCGACCC GATTGCGGCC ATCGAACGCC TGGAACAGTA CTGGCTGGAC CATCCGGAGG ACATGGAGCG GGACCGCCAG GAGGGGCCGC GCCAGATCCA CCGCCAGCAC CGCATTGTCA CCACCGCGGA CTTTGTCACC CGGCTCGAGG CCCACCCGGT GGTGGAGCGC GCCCAGGCGG CCGAGACCTG GAACGGGAGC TGGTCGCTCA TCCAGGTCGC CGTCATCCCC TGGGCCCGGG TGGGCCTGGA CGCCCCGCAG GACTACGACG ATGCGCTTTG GACGCGCATC GAGCAATTCC ACGCCGAGCG CGACCTCTAC CTGCCCGGGC GCGACGGCCG GCCGCCGGTG CGCAGCCTGC TGCGCCACTA CCTGGACGAT TACCGCATGG TCGGCCAGGA GGTGCAGTTG GTGCCGGCCG AGGAGGTGGG CTTGTCGCTG TCGCTCTCCA TCCAGGTCGC CCCTCACTAC TTCCAGTCGG AGGTCCGCCG GGCGGTGGAG CAGGCCCTGG GAACCGGCCC GGGGGGGTTC TTCGAGCCGG GCCGGCTGCG CTTCGGCGAG GATGTCTGGG CCGGCGACCT GTTCCAGTAC CTGATGGCGC TGGACGGGGT GGAGAACCTC TGCCTCAACC GCTTCAAGCG CATCGGTACC CGCTTCCCGG ACATGAGTGG GACCGGGCGC ATCGCCCTCA ACGGCCTGGA GCTGGCCGTC TGCGACAACG AACCCGAACA CCCGGAGCGG GGCTATTTCC ACCTGCGGCT GCACGGCGGG AGGCGGGGCT GA
|
Protein sequence | MADSPEIELQ FQEGCDDCGR REVRLPPRLP ALGDDFDWDL RDYDGFRLFM LEELAARFPE RKRWTPADLE VVLVEALAAV LDQLSDTLDR VAGEAYLETA RRPESVRRLL LMIGYDALGL RRRQGLPPFD GEHDGDPIAA IERLEQYWLD HPEDMERDRQ EGPRQIHRQH RIVTTADFVT RLEAHPVVER AQAAETWNGS WSLIQVAVIP WARVGLDAPQ DYDDALWTRI EQFHAERDLY LPGRDGRPPV RSLLRHYLDD YRMVGQEVQL VPAEEVGLSL SLSIQVAPHY FQSEVRRAVE QALGTGPGGF FEPGRLRFGE DVWAGDLFQY LMALDGVENL CLNRFKRIGT RFPDMSGTGR IALNGLELAV CDNEPEHPER GYFHLRLHGG RRG
|
| |