Gene Mlg_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2102 
Symbol 
ID4270080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2382428 
End bp2385214 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content68% 
IMG OID638126858 
Productribonucleoside-diphosphate reductase, alpha subunit 
Protein accessionYP_742934 
Protein GI114321251 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0209] Ribonucleotide reductase, alpha subunit
[COG1328] Oxygen-sensitive ribonucleoside-triphosphate reductase 
TIGRFAM ID[TIGR02504] ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent
[TIGR02506] ribonucleoside-diphosphate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.503892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.151271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCA AGGCTGCCGA GGCCGCACCA ACGGCCCAGA GCGATACCCA GCAGGTCATC 
CGCCGCAACG GCGCGCTGAC CGCCTTCGAC CCCGACAAGA TCCAGCTGGC CATGAAGAAG
GCCTTCCTGG CCGTGGAGGG CGAGCGATCG GCCGATGCCG CCCGCATCCA GCAGGTCACC
GCCGAACTCA CCGCGCAGGT GGTCCAGGCC CTGACCCGGC GCCCCACCGC CACCCCGATC
CACATCGAGG ATATCCAGGA CCAGGTGGAG CTGGCGCTGA TGCGCGCCGG CGAGCACAAG
GTGGCCCGCG CCTACGTACT CTACCGCGAG GAACACGCTC GCAAGCGCCG CCAGGAGGCC
ACGGACGAGC CGCCGCGGCT GCACATGACC CGCACCGACG GTGAGCAGGT ACCGCTGGAT
GAGTCGCTGC TGCGCCGGGT GCTGCACCAC GCCTGCCATG AGCTGGCGGA TACCGACCCC
GAGCGCGTGG CCCGCGAGGC CTGGCGGAAC CTCTACGACG GGGTCACCGA GCAGGAGGTG
CACAAGGCGC TGATCCTGAG CGCCCGCAGC CTGATCGAGC AGGAGCCGGC GTATGGCCAC
GTGGCCGCGC GCCTGCTGCA ACACCAGCTC AACGGCGAGG CCCTGCGCTT TCTCGATTTT
CCCTACGACG GCGCACCCGG CGCCGACCCT GGCTACACCG ACTACTTCGC CCGCTACATC
CGGCGCGGGG TGGAACTGGA ACTGCTGGAC GAACAGCTGC TGACCTTCGA CCTGGAGCGG
CTGGCCGAGG CCCTGATGCC GGAGCGCGAC CTGCAGTTCA ACTACCTGGG CCTGCAGACC
CTGTACGACC GCTACCTGCA GCACTGGGAC GGCACCCGCT TCGAGCTGCC CCAGGCCTTC
TTCATGCGCG TGGCCATGGG CCTGACCCTG CAGGAGGTGG AGCGCGAGGA GCGGGCCATC
GAGTTCTACC GGCTGCTCTC CAGCTTCGAC TTCATGAGTT CCACCCCCAC GCTGTTCAAT
AGCGGCACCC GCCGCCCGCA GCTCTCCAGC TGCTACCTGA CCAGCGTGCC CGACGACCTG
GGCGGTATCT ACGGCGCCAT CCGCGACAAT GCCCTGCTGT CCAAGTTTGC CGGCGGCCTG
GGCAACGACT GGACCCGCGT TCGGGCCATG GGCGCCCACA TCAAGGGCAC CAACGGCCGC
TCCCAGGGCG TGGTGCCCTT CCTCAAGGTG GCCAGCGACA CCGCCGTGGC GGTGAACCAG
GGGGGCAAGC GCAAGGGTGC GGTCTGCGCC TACCTGGAGA CCTGGCACCT GGACGTGGAG
GAGTTCCTGG AGCTGCGCAA GAACACCGGC GACGACCGCC GCCGCACCCA CGACATGAAC
ACTGCCCACT GGGTGCCCGA CCTGTTCATG CAGCGGGCCG AGGCCGACGC CGACTGGACC
CTCTTCTCGC CGGACGATGC CGCTCACCTG CACGAGCTTT ACGGCCAGGC GTTCAAGGCC
GCCTACGAAG ACCTGGAGGC CCGGGCGGCG CGCGGTGAGA TCCGCAACTA CAAGGTGGTC
TCCGCCAAGC AGCTCTGGCG CCGCATGCTG GGCATGCTGT TCGAGACCGG CCACCCCTGG
ATCACCTTCA AGGACCCGTG CAACCTGCGC TCCCCGCAGC AGCACGCGGG TGTGGTGCAC
AGCTCCAACC TGTGCACGGA GATCACCCTG AACACCTCGG ACGAAGAGAT CGCCGTCTGC
AACCTGGGCT CGGTGAACCT CGCGGCCCAT ACCACCCCCG ATGGTCTGGA CCACGAGCGG
CTGCGCAACA CGGTCCGCAC CGCCATGCGC ATGCTGGATA ACGTCATCGA TATCAACTAC
TACAGCGTGC CCCAGGCCCG CCGCGCCAAC CTGCGCCACC GCCCGGTGGG GCTGGGCGTG
ATGGGGTTCC AGGACGCGCT TTACGCCCAG GACCTGCCCT ACGCCAGCGA CGAGGCGGTC
GCCTTCGCCG ACCGCAGCCA GGAGGCGATC AGCTACTACG CCATCGAGGC CTCGGCGGAC
CTGGCGCGGG AACGGGGGGC CTACCCCAGT TTCGAGGGCT CGCTCTGGCA GCGCGGTGAG
CTGCCGCTGG ATTCCATTCA GCGGGTGGTG GAGGCACGGG ATGGCGATTG CACCATGGAC
ACCTCGTCCA GCCTGGACTG GGCCGCCCTG CGGGAGAAGG TGCGCACCGG CATGCGCAAC
TCCAACTGCC TGGCGATCGC CCCCACGGCC ACTATCGCCA ACATCGTCGG GGTCTCTCAG
GGGATCGAGC CGGCGTTCAA GAACCTGTAC GTCAAATCCA ACCTCTCCGG CGAGTTCACC
GTGGTGAACC CGGCCCTGGT CCGGGCGCTG AAGGCGTACG GCTTGTGGGA TGCGGTGATG
GTGAATGACC TGAAGTATTA CGACGGCAGC GTGCAGCCCA TCGGCCGTGT ACCGGAGGAA
CTGAAACAGC GCTTCGCCAC CGCCTTCGAA CTGGACTCGG AGTGGCTGGT CCAGGCCGGC
AGCCGGCGGC AGAAGTGGCT GGACCAGTCC CAGTCGCTGA ACCTCTACAT GGCCGAGCCC
TCGGGGCCGA AGCTGGATGC GCTCTACCGC CAGGCCTGGC GCTTGGGGCT GAAGACCACC
TACTACCTGC GCAGCACCGG GGCCACCCAG GTGGAGAAGA GCACCATGGA CCCGGCGCGG
GCCAACCGGT TGAACGCGGT GAGCGCGGCG CCGGGGGGTG GGCAGAGCTG TTCGGTTGAT
GATCCGGAGT GTGAGGCGTG TCAGTAG
 
Protein sequence
MSAKAAEAAP TAQSDTQQVI RRNGALTAFD PDKIQLAMKK AFLAVEGERS ADAARIQQVT 
AELTAQVVQA LTRRPTATPI HIEDIQDQVE LALMRAGEHK VARAYVLYRE EHARKRRQEA
TDEPPRLHMT RTDGEQVPLD ESLLRRVLHH ACHELADTDP ERVAREAWRN LYDGVTEQEV
HKALILSARS LIEQEPAYGH VAARLLQHQL NGEALRFLDF PYDGAPGADP GYTDYFARYI
RRGVELELLD EQLLTFDLER LAEALMPERD LQFNYLGLQT LYDRYLQHWD GTRFELPQAF
FMRVAMGLTL QEVEREERAI EFYRLLSSFD FMSSTPTLFN SGTRRPQLSS CYLTSVPDDL
GGIYGAIRDN ALLSKFAGGL GNDWTRVRAM GAHIKGTNGR SQGVVPFLKV ASDTAVAVNQ
GGKRKGAVCA YLETWHLDVE EFLELRKNTG DDRRRTHDMN TAHWVPDLFM QRAEADADWT
LFSPDDAAHL HELYGQAFKA AYEDLEARAA RGEIRNYKVV SAKQLWRRML GMLFETGHPW
ITFKDPCNLR SPQQHAGVVH SSNLCTEITL NTSDEEIAVC NLGSVNLAAH TTPDGLDHER
LRNTVRTAMR MLDNVIDINY YSVPQARRAN LRHRPVGLGV MGFQDALYAQ DLPYASDEAV
AFADRSQEAI SYYAIEASAD LARERGAYPS FEGSLWQRGE LPLDSIQRVV EARDGDCTMD
TSSSLDWAAL REKVRTGMRN SNCLAIAPTA TIANIVGVSQ GIEPAFKNLY VKSNLSGEFT
VVNPALVRAL KAYGLWDAVM VNDLKYYDGS VQPIGRVPEE LKQRFATAFE LDSEWLVQAG
SRRQKWLDQS QSLNLYMAEP SGPKLDALYR QAWRLGLKTT YYLRSTGATQ VEKSTMDPAR
ANRLNAVSAA PGGGQSCSVD DPECEACQ