Gene Mlg_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1471 
Symbol 
ID4269263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1676030 
End bp1679317 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content66% 
IMG OID638126227 
Productribonucleoside-diphosphate reductase, adenosylcobalamin-dependent 
Protein accessionYP_742310 
Protein GI114320627 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0209] Ribonucleotide reductase, alpha subunit 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region
[TIGR02504] ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.298183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.181073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAG CAGCCAAGGT GCGCGCCCTG CCGAATGCCA CGGCCGACAT TCCGTTACAG 
AGCGCGTCCG CGGACATCTG GGATAAGAAA TACCGCCTCA AGTCCAAGGA CGGCCAGGTG
ATCGACGAGA CCGTCGATCA CACCTGGCAG CGCGTGGCGC GTGCCCTGGC CGAGGTGGAA
AAGACCAAGA AGCTGCGTGA GCACTGGTAC GAGCGCTTCC TCTGGGCCCT GCGCCGCGGC
GCCATTCCCG CCGGGCGCAT TACCTCCAAC GCCGGTGCCT GGGCGCACAA GCCCGCCACC
TCCACCATCA ACTGCACGGT CTCCGGCACC ATCCAGGACT CCATGGACGA CATCTTGCAG
AAGGTGCACG AGGCGGGCCT GACCCTGAAG GCCGGCTGCG GTATCGGCTA CGAGTTCTCC
ACCCTGCGCC CGCGCGGCGC CTATGTCTCC GGGGCCGGCG CCTACACCTC CGGATCGCTG
TCCTTCATGG ACATCTACGA CCGCATGTGC TTCACCGTCT CCTCCGCCGG CGGCCGCCGC
GGGGCGCAGA TGGCCACCTT CGATGTGGGC CACCCGGACG TGCTCGACTT CATCCGCGCC
AAGCGCGAGG ACGGCCGGCT GCGCCAGTTC AACCTCAGCC TGCTGATCAC CGAGGAGTTC
ATGCAGGCGG TGGAGGATGA TGCCCAGTGG CCGCTGGCCT TCCCCCTCAC CAAGCACGAG
GCGGAGACGG ACAACATCGA CCTCACCGAC AGCGAGCAGG TCATCTGGCG CGAGTGGCCG
ACCACCGAGC CCTATATCAG CGACGAGCAG GGGCTGGTGG CCTGCCGCAT CTACCGCACC
GTCAAGGCCC GGCGGCTGTG GGACATGATC ATGACCTCCA CCTACGACTT CGCGGAGCCG
GGGTTCATCC TGATCGACCG GGTCAACGAG ATGAACAACA ACTGGTTCTG CGAGGACATC
CGCGCCACCA ACCCGTGCGT CACCGCCGAC ACCTGGGTGC AGACCGCCGA CGGGCCGCGG
CGGGTGGCCG CGCTGGTCGG TCGGCGCTTT TCTGCCCGGG TCGACGGCAC GGACCACCCC
AGTGGTGACG AAGGGTTCTT CCGGACGGCG ACCAAAGAGG TAGTCGAGCT GCGTACGGCC
GAGGGCTATC GGCTCAAGCT CACCGCCGAC CACCGTGTAC GCCGGGTCAG TGCCTTTACC
CGTTACCGCA CTGAGACCGA ATGGTGCGAA GCGGGGCAGC TCCAGGCCGG TGATCGCGTC
CTGCTCAACG ACCACCGCAC CAACGCCGAG TGGGCCGGCC GCCATGGCCG GAATGAAGGC
TACCTGATGG GGCTGCTGCT GGGTGACGGC ACCCTCAAAT CGGATAAGGC CGTGCTGTCG
GTCTGGCGCA CCGCGGCTGC GGCCAACGGG GTGCCGGCCG GACTGCCGGG CGGCGTCGAG
GAGGTGATGG CCGAGGGCCT CGAGGCCGCC CGCTCGCTGC CGCACCGGGC CGATTTCACC
GGCTGGTGTG AGGTCCCGGG CCGTGGCGAG TACCGCCTGG CCACCTCCGC GATCCGCGAT
CTCGCCCTTG AGCTGGGCAT ACGGCCAGGC AGCAAGCGGA TCACCGATGG TGTGGAGCAG
GGTTCCAGCG ACTTCTACAA GGGTTTCCTG CGCGGCCTGT ACGACGCCGA CGGCTCCGTC
CAGGGCAGCC AGCGCAAGGG CGTGTCGGTG CGCCTGGCCC AGTCCGACAG CGACAATCTG
GAGGCGGCGC AGCGCATGCT GCTGCGCCTG GGTATCGCCT CCACCCTCTA CCGTGATCGT
CGTCCGGCCG GCAACACCGT GTTGCCCGAC GGCAACGGCG GGAGCGCTGA GTACGCCACC
CGTGCTCAGC ACGAGCTGGT GATCAGTGGT GAGAACCTTG CGCGGTTCCG CGACCTCATC
GGCTTTGCCA ATAGTGACAA GCAGGCACGC CTCGAGGCGC TGCTGGACCG TTACCAGCGC
ACCCTCAATC GCGAGCGGTT TGTGGCCCGC GTGGTGGCGG TGGAGGCCGC CGGCACCGAG
GAGGTCTACG ACGTCCAGGT ACCCGGTATC AACACCTTCG ATGCCAATGG CCTTCATGCC
CACAACTGCG GGGAGCAACC TTTACCCCCC TACGGCGCCT GCCTTTTGGG CTCGGTTAAC
CTGACCAAGT TCGTGCGCGA TCCCTTCACC GAGAAAGCGC GCTTCGACTG GCAGGAGTAT
GCCGAGGTGG TCTCGGTCTT CACCCGCATG CTGGACAACG TGGTAGAGAT CAACGGCCTG
CCGTTGGATG AGCAGCGCCG GGAGATCGAG CGCAAGCGCC GCCATGGCAT GGGCTTTCTC
GGGTTGGGCT CCACCCTGAC CATGCTGCGC ATGAAGTATG GTGAGGCCGA ATCGCTGGCC
TTCACCGAGA AGGTCTCCCG CGAGATGGCC CTGGCCGGCT GGCGCACCGG TGTGGAACTG
GCCGAGGAAA AGGGCCCGGC GCCGATCATG GACGAGACCT ACACCGTGGA CGAGGCCATG
CTGGCCAAGC GCCCGGAGAT GGCCGTGGAC GGCTACAAGG TGGGCGACAA GGTGAAGGGC
CGGGTGCTGC TGGCCAGGTA CAGCCACTAC ATGCAGCGGG TGGCCGAGGT GGACGCCGAG
CTGGTGGAGG CCATCGCCGA GAAGGGCGCG CGCTTCTCCC ACCACACCTC GATCGCGCCC
ACCGGCACGA TCTCACTGTC GCTGGCCAAC AACGCCAGCA ACGGCATTGA GCCGAGCTTT
GCGCATCATT ACTTTCGCAA CGTGATCCGC GAGGGGCGCA AGTCCAAGGA GAAGGTCGAT
GTCTTCTCCT TCGAGCTGCT GGCCTACCGC CACTTCATTG ACCCGGAGGC GGTCCCGTCC
ACCGACCCGG AGACGCGCAA CCTCCCGGAC TACTTCATCA CCGCCGACTC GGTGACCCCG
AAGCAGCACG TGGACATCCA GGCGGCGGCC CAGAAGTGGG TGGACTCCTC CATCTCCAAG
ACCGCCAACG TGCCCACGGA CTACCCCTAT GAGGACTTCA AGGACATCTA CCGCTATGCC
TATCAGCAGG GGCTGAAGGG CTGCACCACC TTCCGCTTCA ACCCCGAGGC GTTCCAAGGG
GTGCTGGTCA CCGAGAAGGA CCTGACCAAT ACCTTCTACA AGTTCAAGCT GGCAGACGGC
ACCGAGGTGG AGTTGCGGGG CGACGAGGAG GTGGAGTACG ACGGCGAGAC CCACACCGCC
GCCAACCTCT ATGACGCCCT CAAGGAAGGC TACTACGGTA AGTTCTGA
 
Protein sequence
MTTAAKVRAL PNATADIPLQ SASADIWDKK YRLKSKDGQV IDETVDHTWQ RVARALAEVE 
KTKKLREHWY ERFLWALRRG AIPAGRITSN AGAWAHKPAT STINCTVSGT IQDSMDDILQ
KVHEAGLTLK AGCGIGYEFS TLRPRGAYVS GAGAYTSGSL SFMDIYDRMC FTVSSAGGRR
GAQMATFDVG HPDVLDFIRA KREDGRLRQF NLSLLITEEF MQAVEDDAQW PLAFPLTKHE
AETDNIDLTD SEQVIWREWP TTEPYISDEQ GLVACRIYRT VKARRLWDMI MTSTYDFAEP
GFILIDRVNE MNNNWFCEDI RATNPCVTAD TWVQTADGPR RVAALVGRRF SARVDGTDHP
SGDEGFFRTA TKEVVELRTA EGYRLKLTAD HRVRRVSAFT RYRTETEWCE AGQLQAGDRV
LLNDHRTNAE WAGRHGRNEG YLMGLLLGDG TLKSDKAVLS VWRTAAAANG VPAGLPGGVE
EVMAEGLEAA RSLPHRADFT GWCEVPGRGE YRLATSAIRD LALELGIRPG SKRITDGVEQ
GSSDFYKGFL RGLYDADGSV QGSQRKGVSV RLAQSDSDNL EAAQRMLLRL GIASTLYRDR
RPAGNTVLPD GNGGSAEYAT RAQHELVISG ENLARFRDLI GFANSDKQAR LEALLDRYQR
TLNRERFVAR VVAVEAAGTE EVYDVQVPGI NTFDANGLHA HNCGEQPLPP YGACLLGSVN
LTKFVRDPFT EKARFDWQEY AEVVSVFTRM LDNVVEINGL PLDEQRREIE RKRRHGMGFL
GLGSTLTMLR MKYGEAESLA FTEKVSREMA LAGWRTGVEL AEEKGPAPIM DETYTVDEAM
LAKRPEMAVD GYKVGDKVKG RVLLARYSHY MQRVAEVDAE LVEAIAEKGA RFSHHTSIAP
TGTISLSLAN NASNGIEPSF AHHYFRNVIR EGRKSKEKVD VFSFELLAYR HFIDPEAVPS
TDPETRNLPD YFITADSVTP KQHVDIQAAA QKWVDSSISK TANVPTDYPY EDFKDIYRYA
YQQGLKGCTT FRFNPEAFQG VLVTEKDLTN TFYKFKLADG TEVELRGDEE VEYDGETHTA
ANLYDALKEG YYGKF