Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1472 |
Symbol | |
ID | 4269264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1679762 |
End bp | 1682110 |
Gene Length | 2349 bp |
Protein Length | 782 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638126228 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_742311 |
Protein GI | 114320628 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.528687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATA TCCCCCATCA GATTGCCGAA GAACTCGGCG TCCAGCCGCG CCAGGTGGAG GCCAGTGTGG CGCTCCTGGA TGGCGGCGCC ACGGTCCCGT TCATCGCCCG CTACCGCAAG GAGGCCACCG GCGGCCTGGA CGACAGCCAG TTGCGCCGCC TGGAGGAGCG GTTGACCTAC CTGCGCGAGC TGGCGAGCCG GCGGGAGACC GTCCTCAAGA CCATCCGGGA CCAGGGCAAG CTTACCCCGC AACTGGAGAC GGACATCCGC GCCGCCGACA GCAAGACCCG GCTGGAGGAC CTCTACCGCC CCTATCAGCC CAAGCGTCGC ACCAAGGCCC AGATCGCCCG GGAGCTCGGC CTGGAGCCGC TGGCCGACAG CCTGCTGGAG GACCCCTCGC AGGCGCCGGA GGCGCTGGCC GCGGACTATC TGGTGCCCGC GGGCCCGGAC GGCAAAGGCG GCGTCCCCGA TGCCCAGGCG GCACTGGACG GCGCCCGTCA GATCCTGATG GAGCGCTTCG CCGAGGATGC CGAGCTGGTC GGCCGGCTGC GCGAGTGGGC CTGGGGCAAC GCCCGGCTGC GCAGCCGGGT TATCGAGGGC AAGGAGCGGG AGGGCGCCAA GTTCCGTGAC TACTTCGAGC ACGAGGAGCC GCTGGCCCGC ATCCCCTCCC ACCGTGCGCT GGCCCTGTTC CGCGGCCAGA GCGAGGATAT CCTGCAATTG AGCCTGATCT GGGAAGGGGA GGACGGCGAG AGCCCCACCG CGGAGGGCAT GATCGCGCGC CGCTTCGGTA TCCGTCATGG GGACCGTCCC GCCGACGACT GGCTGGCCCG CACCGTGCGC ATGAGCTGGC GCGTCAAGCT GTCGCTGCAG CTGGAACTGG CCCTGAAGCG CCGGCTGCGC GAGGCCGCCG AGGAGGAGGC CATCCGCGTC TTCGGGCGTA ACCTCAACGA CCTGCTGCTG GCCGCCCCGG CCGGTGCGGT CGCCACCATG GGGCTGGACC CGGGCATCCG CACCGGGGTC AAGGTGGCCG TGGTCGACGG CACCGGCAAG GTGGTAGACA CCGCGACGGT GCACCCCTTC CCCCCGCGCA ATGACCGCAA CGGTGCCCTG GCCACCCTCG CCGGGCTGGC CAAGCGGCAC GCCGTCCGCC TGGTGGCCAT TGGCAATGGC ACCCATTCGC GCGAGACCGA TGCCCTGGTG GCGGAACTGA TGCAGGCCAT GCCGGAACTC CCACTCACCC GGGTCACCGT CTCCGAGGCC GGGGCCTCGG TCTACTCGGC GTCTGAGTAC GCCAGCCGTG AACTGCCGGA ACTGGACGTG GCCCTGCGCG GGGCGGTCTC CATCGCCCGG CGCCTGCAGG ACCCGCTGGC CGAGCTGGTG AAGATCGAGC CCAAGGCCAT CGGCGTCGGC CAGTACCAGC ACGATGTCAG CCAGGTGAAG CTGGCGCGCA CCCTGGACAA CGTGGTGGAG GACTGCGTGA ACGCCGTGGG CGTGGACGTG AACACCGCCT CCGCCCCGCT GCTCGCCCGG GTCGCCGGCC TCACCCCCGC GCTGGCCGAG AACGTGGTCA GCCACCGCGA CCGGCACGGG CCCTTTCCCG ACCGGAAGAC CCTGCTCGAG GTGCAGCGCC TGGGCCCCAA GGCCTACGAG CAGTGCGCCG GTTTCCTTCG GATCCGCGGT GGCCGCAATC CCCTGGATGC CAGCGCCGTC CACCCGGAGG CCTATCCGGT GGTCGAGCGG ATCCTCAAGG ACAGCGGCAA GTCCATTGAC CAGCTCATCG GCGACAGCCC ATTCCTGCAG CGGCTCGAAC CGGCCCGCTA CACCGATGAC CAGTTCGGCG AGCCGACGGT GCGCGACATC CTCGCCGAGT TGGACAAACC CGGTCGTGAC CCGCGCCCGG AGTTTCGCAC CGCCCGCTTC CGCGAGGGGG TGAACACCCT GCAGGACCTG GCGCCGGGCA TGGTGCTGGA GGGAACGGTC ACCAACGTGA CCAACTTCGG CGCCTTCGTC GATATCGGTG TCCACCAGGA CGGTCTGGTG CACATCTCGG CGCTGGCCGA GCGCTTCGTG CGCGACCCGC ACGAGGTGGT CAAGGCCGGC GACATCGTCA AGGTGAAGGT CATGGAGGTG GACCTCGAGC GCAAGCGCAT CGGCCTGTCC ATGCGCCTGG ATGACGAACC CGGAGAGGGC CGGTCGCGCC GCGGCAGCAA GCCGGCCGAT GGCGCCGCGG CCCGGCGGGG TAAGGGCGGC AAGGACAACC GGCCCCAGGG GGGGCGCGGC GCCCGTCGCG AGGAGAAGCC CATGGGTGCC ATGGCCGAGG CACTGGCGGC CCTGAAAAAG GGGCAATGA
|
Protein sequence | MTDIPHQIAE ELGVQPRQVE ASVALLDGGA TVPFIARYRK EATGGLDDSQ LRRLEERLTY LRELASRRET VLKTIRDQGK LTPQLETDIR AADSKTRLED LYRPYQPKRR TKAQIARELG LEPLADSLLE DPSQAPEALA ADYLVPAGPD GKGGVPDAQA ALDGARQILM ERFAEDAELV GRLREWAWGN ARLRSRVIEG KEREGAKFRD YFEHEEPLAR IPSHRALALF RGQSEDILQL SLIWEGEDGE SPTAEGMIAR RFGIRHGDRP ADDWLARTVR MSWRVKLSLQ LELALKRRLR EAAEEEAIRV FGRNLNDLLL AAPAGAVATM GLDPGIRTGV KVAVVDGTGK VVDTATVHPF PPRNDRNGAL ATLAGLAKRH AVRLVAIGNG THSRETDALV AELMQAMPEL PLTRVTVSEA GASVYSASEY ASRELPELDV ALRGAVSIAR RLQDPLAELV KIEPKAIGVG QYQHDVSQVK LARTLDNVVE DCVNAVGVDV NTASAPLLAR VAGLTPALAE NVVSHRDRHG PFPDRKTLLE VQRLGPKAYE QCAGFLRIRG GRNPLDASAV HPEAYPVVER ILKDSGKSID QLIGDSPFLQ RLEPARYTDD QFGEPTVRDI LAELDKPGRD PRPEFRTARF REGVNTLQDL APGMVLEGTV TNVTNFGAFV DIGVHQDGLV HISALAERFV RDPHEVVKAG DIVKVKVMEV DLERKRIGLS MRLDDEPGEG RSRRGSKPAD GAAARRGKGG KDNRPQGGRG ARREEKPMGA MAEALAALKK GQ
|
| |