Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1865 |
Symbol | |
ID | 4268083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2124484 |
End bp | 2127168 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638126621 |
Product | UTP-GlnB uridylyltransferase, GlnD |
Protein accession | YP_742699 |
Protein GI | 114321016 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2844] UTP:GlnB (protein PII) uridylyltransferase |
TIGRFAM ID | [TIGR01693] [Protein-PII] uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.678957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCC ACACCGCAGT AACGGCGGGC GAGTTGTTCG ACGCCCACGC CCTGGACACC GCCCTGGAGC GGGGCGAGCC GGCGATCCCG GCCCTGCGCC GGGCCCTCGA GGAGGGGGAC CGGGCACTGG CCGGGCAATT CCATGCCGGC ACCCCCGCTG CCGAACTGGT GCCGCGCCGC GCCTACCTGA TGGACCGCAT CATTCGCCGG CTGTGGGCCC AACACCTACA GCATGCCGAG GACCGCTGCG CCCTGGTGGC GGTGGGTGGC TACGGCCGCG GCGAGTTGCA CCCGGGCTCG GATGTGGATG TCATGATCCT GATCGAGCCC GATGCCCGCG AGGCGTTGGG CGATGCCCTG GAGGGGATGA TCACCGCCCT CTGGGACCTG GGCCTGGAGG TGGGCCACAG CGTGCGCAGT ATCGAGGACT GCGTGCGCGA GGCGGAGATG GACATCACCG TGGCCACCAA TCTGATGGAG GCACGCCTGC TCTGCGGCAG CGCCGCCCTC TTCCAGACCA TGCGCCAGGT GACCGGCCCC GACCGGCTGT GGCCCTCGCG GGCCTTTTTC GAGGCCAAGT GGGCCGAGCA GGTGGCCCGT CACCAGAAGT ATCACGACAC CGCCTACAAC CTGGAGCCCA ACATCAAGAG CAGCCCCGGC GGGTTGCGCG ACATCCAGAT GGTGGGCTGG GTGGCCAAGC GCCATTTCCG CGCCAACACC CTGCGCGCCC TGGTCGATCA GCGGTTTCTC ACCGAGCAGG AGTTTGACGC GCTGATCGCC GGGCAGAACT TCCTGTGGGA TATCCGCTAC GCCCTGCACC TGCTGGCTGG CCGTCGCGAG GACCGACTGC TGTTCGACCA CCAGACCCAA CTCGCCGAGC AGTTCGGCTA CCGCGACCGC GAGCACAATC TCGCGGTCGA GCAGTTCATG CAGCGCTACT TCCGTACGGT GATGGAGCTC TCCCGGCTCA ACGAGATGCT CCTGCAGCTG TTCCAGGAAG CCATTCTCTA CGCCGGCGAG CACGAGGCCC CGGTGCTGCT CAACAAGCGC TTCCAGTCCC GGCGCGGGTT TCTGGAAGTC ACCCATAAGA ACGTCTTCCG GCGCTACCCC TTCGCCCTGC TGGAAGTGTT CCTGGTGATG CAGCAGCACC CGGAACTGAA AGGGGTGCGC GCCTCCACCA TCCGGCTGAT CCGCCAATAC CGCAATCTGA TCGACCAGCG CTTTCGCAAG GACCTGCGCG CCCGCAGCCT GTTCATGGAG ATCATGCGCC AGCCCCGCGG GCTCACCCAC GAGCTGCGCC GAATGCACCG CTACGGGGTG CTGGGCCGCT ACATCCCGGC CTTCGGTCAA ATCACCGGGC GCATGCAGTA CGACCTGTTC CATGTCTACA CCGTGGACAG CCACACCCTG TTCGTGGTGC GCAATCTCCG GCGCTTCGCC ATCCCCCGGC ACGCCCACGA GTTCCCGCAG TGCAATACCA TCATGGCGCG GCTGCCCAAG CCGGAGCTGC TCTACCTGGC GGCGCTGTTC CACGACATCG CCAAAGGGCG CGGCGGCGAC CACTCCGAGC TCGGCGCGCG GGATGCCTAC GATTTCTGCA TCCAGCACGG GCTGGGGGCG TACGATGCCC GCCTGGTGAG CTGGCTGGTG CGCAGCCACC TGCTGATGTC CATGACCGCC CAGCGCAAGG ACATCTCCGA CCCGGAGGTG ATCACTGAGT TCGCCCGGCG GGTGGGCGAC CAGATCCACC TGGACTACCT CTATCTGCTC ACCGTGGCGG ACATCCGCTC CACCAACCCG TCGCTGTGGA ACTCCTGGCG GGACGCGCTG CTGGCCGAGC TCTATGTGCT CACCAAGCGG GCCCTGCGCC GGGGTCTGGG CAACCCCATC GACAAACGGG AGCTGATCAA CGAGACCCAG GCCCAGTCCC GCCGCCGGCT GCGTCAGCGC GGCCTCCACC ACATGACCGT GCGCGCCATC TGGCGCCGGC TGGCGGACGA CTACTTCCTG CGCCACTCGG CCGAGGAGAT CGCCTGGCAC ACCGAGGCCA TCGCCGCCGC CCGGCCAGAG GACCTGCCCA TCGTTTTGGT GAAACAGCGC GGTCCCCGTG GCGGAACCGA GTTGTTCATC TACGCCCGGG ACAACCGCTA CGTGTTCGCC CGCACGGTCT CCACCCTGGA TCGGCTGGGG TTGAACATCC AGGACGCGCG GATCATCACC ACCGACCAGG GCTACACCCT GGACAGCTAT CTGGTGCTGG AGGACAACGG CGAGCCGGTT ACTGACGAGG GCCGTTGCCG GGAGATGGTC GAGCGCCTGC GCACCAGCCT GGCCGATGCC CACCGGCCAC CGGACTTGGC CGAGCACCGC CTGCCGCGGC GGCTCAAGCA CTTTTCCACG CCCACACAGA TCAACTTCAG CACCGACGGG CCCAACCAGC GCACCGTGCT GGAACTGATC ACCGGCGACC GGCCCGGCCT GCTGGCCCAG GTGGGTCAGG CCTTCAGTCA GTGCCGGGTC AAGCTGAAGA ACGCCAAGAT CGCCACCATC GGCGAGCGGG CCGAGGACGT CTTCTTTATC ACCGACGACC AGGATGAACC GCTGGCCGAC CCAGTGCAGT TCCGCTGTCT GCGCGACGTC CTGTCCGACT GCCTGGAGAA TACCGGCGAG GGCACCTCCC AATGA
|
Protein sequence | MTVHTAVTAG ELFDAHALDT ALERGEPAIP ALRRALEEGD RALAGQFHAG TPAAELVPRR AYLMDRIIRR LWAQHLQHAE DRCALVAVGG YGRGELHPGS DVDVMILIEP DAREALGDAL EGMITALWDL GLEVGHSVRS IEDCVREAEM DITVATNLME ARLLCGSAAL FQTMRQVTGP DRLWPSRAFF EAKWAEQVAR HQKYHDTAYN LEPNIKSSPG GLRDIQMVGW VAKRHFRANT LRALVDQRFL TEQEFDALIA GQNFLWDIRY ALHLLAGRRE DRLLFDHQTQ LAEQFGYRDR EHNLAVEQFM QRYFRTVMEL SRLNEMLLQL FQEAILYAGE HEAPVLLNKR FQSRRGFLEV THKNVFRRYP FALLEVFLVM QQHPELKGVR ASTIRLIRQY RNLIDQRFRK DLRARSLFME IMRQPRGLTH ELRRMHRYGV LGRYIPAFGQ ITGRMQYDLF HVYTVDSHTL FVVRNLRRFA IPRHAHEFPQ CNTIMARLPK PELLYLAALF HDIAKGRGGD HSELGARDAY DFCIQHGLGA YDARLVSWLV RSHLLMSMTA QRKDISDPEV ITEFARRVGD QIHLDYLYLL TVADIRSTNP SLWNSWRDAL LAELYVLTKR ALRRGLGNPI DKRELINETQ AQSRRRLRQR GLHHMTVRAI WRRLADDYFL RHSAEEIAWH TEAIAAARPE DLPIVLVKQR GPRGGTELFI YARDNRYVFA RTVSTLDRLG LNIQDARIIT TDQGYTLDSY LVLEDNGEPV TDEGRCREMV ERLRTSLADA HRPPDLAEHR LPRRLKHFST PTQINFSTDG PNQRTVLELI TGDRPGLLAQ VGQAFSQCRV KLKNAKIATI GERAEDVFFI TDDQDEPLAD PVQFRCLRDV LSDCLENTGE GTSQ
|
| |