Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2102 |
Symbol | |
ID | 4270080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2382428 |
End bp | 2385214 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126858 |
Product | ribonucleoside-diphosphate reductase, alpha subunit |
Protein accession | YP_742934 |
Protein GI | 114321251 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0209] Ribonucleotide reductase, alpha subunit [COG1328] Oxygen-sensitive ribonucleoside-triphosphate reductase |
TIGRFAM ID | [TIGR02504] ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent [TIGR02506] ribonucleoside-diphosphate reductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.503892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.151271 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCA AGGCTGCCGA GGCCGCACCA ACGGCCCAGA GCGATACCCA GCAGGTCATC CGCCGCAACG GCGCGCTGAC CGCCTTCGAC CCCGACAAGA TCCAGCTGGC CATGAAGAAG GCCTTCCTGG CCGTGGAGGG CGAGCGATCG GCCGATGCCG CCCGCATCCA GCAGGTCACC GCCGAACTCA CCGCGCAGGT GGTCCAGGCC CTGACCCGGC GCCCCACCGC CACCCCGATC CACATCGAGG ATATCCAGGA CCAGGTGGAG CTGGCGCTGA TGCGCGCCGG CGAGCACAAG GTGGCCCGCG CCTACGTACT CTACCGCGAG GAACACGCTC GCAAGCGCCG CCAGGAGGCC ACGGACGAGC CGCCGCGGCT GCACATGACC CGCACCGACG GTGAGCAGGT ACCGCTGGAT GAGTCGCTGC TGCGCCGGGT GCTGCACCAC GCCTGCCATG AGCTGGCGGA TACCGACCCC GAGCGCGTGG CCCGCGAGGC CTGGCGGAAC CTCTACGACG GGGTCACCGA GCAGGAGGTG CACAAGGCGC TGATCCTGAG CGCCCGCAGC CTGATCGAGC AGGAGCCGGC GTATGGCCAC GTGGCCGCGC GCCTGCTGCA ACACCAGCTC AACGGCGAGG CCCTGCGCTT TCTCGATTTT CCCTACGACG GCGCACCCGG CGCCGACCCT GGCTACACCG ACTACTTCGC CCGCTACATC CGGCGCGGGG TGGAACTGGA ACTGCTGGAC GAACAGCTGC TGACCTTCGA CCTGGAGCGG CTGGCCGAGG CCCTGATGCC GGAGCGCGAC CTGCAGTTCA ACTACCTGGG CCTGCAGACC CTGTACGACC GCTACCTGCA GCACTGGGAC GGCACCCGCT TCGAGCTGCC CCAGGCCTTC TTCATGCGCG TGGCCATGGG CCTGACCCTG CAGGAGGTGG AGCGCGAGGA GCGGGCCATC GAGTTCTACC GGCTGCTCTC CAGCTTCGAC TTCATGAGTT CCACCCCCAC GCTGTTCAAT AGCGGCACCC GCCGCCCGCA GCTCTCCAGC TGCTACCTGA CCAGCGTGCC CGACGACCTG GGCGGTATCT ACGGCGCCAT CCGCGACAAT GCCCTGCTGT CCAAGTTTGC CGGCGGCCTG GGCAACGACT GGACCCGCGT TCGGGCCATG GGCGCCCACA TCAAGGGCAC CAACGGCCGC TCCCAGGGCG TGGTGCCCTT CCTCAAGGTG GCCAGCGACA CCGCCGTGGC GGTGAACCAG GGGGGCAAGC GCAAGGGTGC GGTCTGCGCC TACCTGGAGA CCTGGCACCT GGACGTGGAG GAGTTCCTGG AGCTGCGCAA GAACACCGGC GACGACCGCC GCCGCACCCA CGACATGAAC ACTGCCCACT GGGTGCCCGA CCTGTTCATG CAGCGGGCCG AGGCCGACGC CGACTGGACC CTCTTCTCGC CGGACGATGC CGCTCACCTG CACGAGCTTT ACGGCCAGGC GTTCAAGGCC GCCTACGAAG ACCTGGAGGC CCGGGCGGCG CGCGGTGAGA TCCGCAACTA CAAGGTGGTC TCCGCCAAGC AGCTCTGGCG CCGCATGCTG GGCATGCTGT TCGAGACCGG CCACCCCTGG ATCACCTTCA AGGACCCGTG CAACCTGCGC TCCCCGCAGC AGCACGCGGG TGTGGTGCAC AGCTCCAACC TGTGCACGGA GATCACCCTG AACACCTCGG ACGAAGAGAT CGCCGTCTGC AACCTGGGCT CGGTGAACCT CGCGGCCCAT ACCACCCCCG ATGGTCTGGA CCACGAGCGG CTGCGCAACA CGGTCCGCAC CGCCATGCGC ATGCTGGATA ACGTCATCGA TATCAACTAC TACAGCGTGC CCCAGGCCCG CCGCGCCAAC CTGCGCCACC GCCCGGTGGG GCTGGGCGTG ATGGGGTTCC AGGACGCGCT TTACGCCCAG GACCTGCCCT ACGCCAGCGA CGAGGCGGTC GCCTTCGCCG ACCGCAGCCA GGAGGCGATC AGCTACTACG CCATCGAGGC CTCGGCGGAC CTGGCGCGGG AACGGGGGGC CTACCCCAGT TTCGAGGGCT CGCTCTGGCA GCGCGGTGAG CTGCCGCTGG ATTCCATTCA GCGGGTGGTG GAGGCACGGG ATGGCGATTG CACCATGGAC ACCTCGTCCA GCCTGGACTG GGCCGCCCTG CGGGAGAAGG TGCGCACCGG CATGCGCAAC TCCAACTGCC TGGCGATCGC CCCCACGGCC ACTATCGCCA ACATCGTCGG GGTCTCTCAG GGGATCGAGC CGGCGTTCAA GAACCTGTAC GTCAAATCCA ACCTCTCCGG CGAGTTCACC GTGGTGAACC CGGCCCTGGT CCGGGCGCTG AAGGCGTACG GCTTGTGGGA TGCGGTGATG GTGAATGACC TGAAGTATTA CGACGGCAGC GTGCAGCCCA TCGGCCGTGT ACCGGAGGAA CTGAAACAGC GCTTCGCCAC CGCCTTCGAA CTGGACTCGG AGTGGCTGGT CCAGGCCGGC AGCCGGCGGC AGAAGTGGCT GGACCAGTCC CAGTCGCTGA ACCTCTACAT GGCCGAGCCC TCGGGGCCGA AGCTGGATGC GCTCTACCGC CAGGCCTGGC GCTTGGGGCT GAAGACCACC TACTACCTGC GCAGCACCGG GGCCACCCAG GTGGAGAAGA GCACCATGGA CCCGGCGCGG GCCAACCGGT TGAACGCGGT GAGCGCGGCG CCGGGGGGTG GGCAGAGCTG TTCGGTTGAT GATCCGGAGT GTGAGGCGTG TCAGTAG
|
Protein sequence | MSAKAAEAAP TAQSDTQQVI RRNGALTAFD PDKIQLAMKK AFLAVEGERS ADAARIQQVT AELTAQVVQA LTRRPTATPI HIEDIQDQVE LALMRAGEHK VARAYVLYRE EHARKRRQEA TDEPPRLHMT RTDGEQVPLD ESLLRRVLHH ACHELADTDP ERVAREAWRN LYDGVTEQEV HKALILSARS LIEQEPAYGH VAARLLQHQL NGEALRFLDF PYDGAPGADP GYTDYFARYI RRGVELELLD EQLLTFDLER LAEALMPERD LQFNYLGLQT LYDRYLQHWD GTRFELPQAF FMRVAMGLTL QEVEREERAI EFYRLLSSFD FMSSTPTLFN SGTRRPQLSS CYLTSVPDDL GGIYGAIRDN ALLSKFAGGL GNDWTRVRAM GAHIKGTNGR SQGVVPFLKV ASDTAVAVNQ GGKRKGAVCA YLETWHLDVE EFLELRKNTG DDRRRTHDMN TAHWVPDLFM QRAEADADWT LFSPDDAAHL HELYGQAFKA AYEDLEARAA RGEIRNYKVV SAKQLWRRML GMLFETGHPW ITFKDPCNLR SPQQHAGVVH SSNLCTEITL NTSDEEIAVC NLGSVNLAAH TTPDGLDHER LRNTVRTAMR MLDNVIDINY YSVPQARRAN LRHRPVGLGV MGFQDALYAQ DLPYASDEAV AFADRSQEAI SYYAIEASAD LARERGAYPS FEGSLWQRGE LPLDSIQRVV EARDGDCTMD TSSSLDWAAL REKVRTGMRN SNCLAIAPTA TIANIVGVSQ GIEPAFKNLY VKSNLSGEFT VVNPALVRAL KAYGLWDAVM VNDLKYYDGS VQPIGRVPEE LKQRFATAFE LDSEWLVQAG SRRQKWLDQS QSLNLYMAEP SGPKLDALYR QAWRLGLKTT YYLRSTGATQ VEKSTMDPAR ANRLNAVSAA PGGGQSCSVD DPECEACQ
|
| |