Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2524 |
Symbol | |
ID | 4270163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2866702 |
End bp | 2867913 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638127283 |
Product | hypothetical protein |
Protein accession | YP_743354 |
Protein GI | 114321671 |
COG category | [S] Function unknown |
COG ID | [COG1565] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.585114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGC AACGCCACCC GGCCCTCGAC CTGCCCGAGC CCGATGCCAG CGCCCGCGCC CACAGCGAGG CCCTGCAGGC GCGGATCCGC GACGCCATCC GATCCGCCGG CGGCTGGCTG CCGTTCGACC GCTACATGGG CATGGCCCTG TACGAGCCCG GACTGGGTTA CTACAGCGCC GGCGCGCCGC GCTTCGGCGA AGGTGGCGAC TTCACCACCG CACCGCTCAT CTCGCCACTT TTCAGCCGCA CCCTGGCCCA CACCGTACAG CGCGCCCTGC AGGCCCTGGA GCTCGCCACC GGCCAGGGCG AGGTGCTGGA ACTGGGCGCC GGCAGCGGAC GGATGGCCGC CGACATCCTG CTGGAGCTGG AGCGGCTGGG GCAGCTTCCC GCCCGCTACC TCATCCTCGA GGTCAGCGCC GCCCTGCGCC AGGAACAGCA CCGCACCCTG GGTGAACACG CCCCCCACCT GCTCGACCGG GTGGAGTGGC TGGAACAGCT CCCGGAACAC CCCATTACCG GCGCCCTCCT CGCCAACGAA GTCCTCGACG CCCTGCCCTT TCGCTGCTTC GAGCGCGGGC GCGACGACAT CCTGGAACGC GGCGTGGCGC TGGACGACGA CGACCACCCG CAGTGGGCCA CCCGTCCCGC CGATGAGCCC CTGGCCGGCC ACGTCCGCCA CATCGAGGCC GAGACCGGCC GGCGGCTGCC CCCCGGTTAC CGCAGCGAGT GCCTGCCGCA ACTGGCCGAT TGGCTGCGCG ACACCACCCG CTGCCTGGCG CGGGGCCTGG TACTCTACAT CGATTACGGC TACCCCCGGC GCGAGTACTA CCTGCCCGAC CGCCACATGG GCACCCTGCT CTGCCACTAC CGCCACCGCG CCCACGAGGA CCCTTTCCTC TGGCCCGGGC TGCAGGACAT CACCGCCTTC GTCGACTTCA CGGCCGTGGC CGAGGCGGCA CTGGCCGCCG ACCTGGACGT GCTCGGCTTC ACCAGCCAGG CCCAATACCT GCTCGCCGCC GGCCTGGCGC ACCTGGCCGA CGAGGCCATG GCGCAGCACG ACGACGACAT GCACCGCCTT CAGATCGCGC AACAGGTCCG CCGCCTCACC CTGCCCTCCG AACTGGGCGA GCGCTTCAAG GTCCTGCCCC TGGGCCGCGA CCTGGCCCCC CTGCCGGAAT TCATCCGCAC CGACCAGCGC CACCGCCTTT GA
|
Protein sequence | MTRQRHPALD LPEPDASARA HSEALQARIR DAIRSAGGWL PFDRYMGMAL YEPGLGYYSA GAPRFGEGGD FTTAPLISPL FSRTLAHTVQ RALQALELAT GQGEVLELGA GSGRMAADIL LELERLGQLP ARYLILEVSA ALRQEQHRTL GEHAPHLLDR VEWLEQLPEH PITGALLANE VLDALPFRCF ERGRDDILER GVALDDDDHP QWATRPADEP LAGHVRHIEA ETGRRLPPGY RSECLPQLAD WLRDTTRCLA RGLVLYIDYG YPRREYYLPD RHMGTLLCHY RHRAHEDPFL WPGLQDITAF VDFTAVAEAA LAADLDVLGF TSQAQYLLAA GLAHLADEAM AQHDDDMHRL QIAQQVRRLT LPSELGERFK VLPLGRDLAP LPEFIRTDQR HRL
|
| |