Gene Mlg_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1944 
Symbol 
ID4268112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2210799 
End bp2212961 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content66% 
IMG OID638126698 
Productpolynucleotide phosphorylase/polyadenylase 
Protein accessionYP_742776 
Protein GI114321093 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) 
TIGRFAM ID[TIGR03591] polyribonucleotide nucleotidyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0229407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCGGG TGTCCCGCAG GGGCGGGGCA CCTCGTCAGT CATTTCAAGA GCAGCCGGAT 
ATTATCGTGA AATCAGTCAA GAAGAGCTTT CAGTACGGAA ACCACACGGT CACTCTGGAA
ACCGGTGGCG TCGCACGGCA GGCCGATGGC GCCGTGCTGG TCAACATGAG TGATACGGTG
GTGCTCGTCA CCGCTGTCGG TCGCAAGGAG GCGGACCCGG GCAAGGGCTT CTTCCCCCTG
ACCGTCAATT ACCAGGAGCG GACCTATGCG GCGGGCAAGA TCCCGGGAGG CTTCTTCAAG
CGTGAGGGCC GCCCCTCCGA GAAGGAGACC CTCACCTGCC GTCTGATCGA CCGGCCCATC
CGGCCGCTGT TCCCGGAGGG GTTCTATAAC GAGGTGCAGG TGGTGGCCAC CGTGCTCTCC
ATGAACCCTG AGGTGGATGC CGACATCCCG GCATTGATCG GGGCCTCCGC GGCACTGTCT
ATTTCCGGTA TCCCCTTCGA TGGCCCCATC GGCGCCGCCC GTGTTGGCTA TAAGGACGGC
GAGTACCTGC TGAATCCCAC CTTCGAGGAG ACCGCCGCCT CCGACCTGGA CCTGGTGGTC
GCGGGCACGG AGAACGCCGT GCTGATGGTG GAGTCGGAGG CCAACCAGCT CCCCGAGGAG
GCCATGCTTG GCGCCGTGCT GTACGGCCAC GAGCAGATGC AGGTGGCTAT CCAGGCGATC
AACGAGCTCA CCGCCGAGGC GGGCAAGCCG CGATGGGACT GGCACCCGCC GCAAGGCGAC
GCTGCCCTGG AGACGGCGAT CAAGGACCTG GTGGGCGACG ACCTGGCCGC CGCCTACCAG
ATCCCGGAAA AGCAGGAGCG CCAGAACCGG ATCGGCGAAC TGCGGCAGCG GGCCGTCGAG
GCGCTGGGTG AGAACCGTGA GGAAGAGGGC GGTTGGCCCG AGAAGGACGT GGGCGACGCC
TTTAAGGGGC TGGAGAAGGA CATCGTCCGC GGGCGCATCC TGGCGGGTGA GCGCCGTATC
GACGGGCGGG ATACCCGGAC CGTCCGGCCC ATCGACATCG AGGTGGGGAG CCTGCCGCGT
ACGCACGGTT CGGCGATCTT TACCCGCGGC GAGACCCAGG CTGTGGTGGT GACCACCCTC
GGGACCGGCC GTGATGCCCA GATCATCGAT GCCATCGAGG GCGAGCGCAA AGAGCAGTTC
ATGCTGCACT ACAACTTCCC GCCCTACTGT GTGGGCGAGA CCGGCTTCAT GGGCACGCCC
AAGCGCCGCG AGATCGGTCA CGGTAAGCTC GCCAAGCGGG GCATTGAAGC GGTCATGCCG
GCCGCGGACG ATTGCCCCTA CGTGATCCGC GTGGTCTCCG AGATCACCGA GTCCAACGGC
TCCTCCTCCA TGGCCACCGT CTGCGGCACC TCCCTGTCGC TGATGGACGC CGGCGTGCCA
GTGAAAGCAC CGGTGGCCGG TATCGCCATG GGCCTGATCA AGGAGGACGA GCAGTTCGCC
GTGCTCTCCG ACATCCTCGG CGATGAGGAC CACCTGGGCG ACATGGACTT CAAGGTCGCC
GGGACCGAGA GCGGCGTGAC CGCGCTGCAG ATGGACATCA AGATCCAGGG GATTACCCGC
GAGATCATGG AGCAGGCGCT GGAGCAGGCC CGGGAAGGCC GCCTGCACAT CCTTGGTGAG
ATGAACAATG CCATCAGCGG CCCGCGGAGC GAGATGTCCG AGTACGCTCC GCGCCTGCTC
ACCATCCGGA TCGACCCGGA CAAGATCCGT GATGTCATCG GCAAGGGTGG CGCCACCATT
CGCGCGTTGA CCGAGGAGAC CGGCACCACT ATCGACATCT CCGACGATGG CAAGGTGACC
ATCGCCTCCG CGGACAAGGC CGCGGCCGAC GAGGCCCGCC GGCGCATCGA GCTGCTCACC
GCCGACGTGG AGGTGGGGAC GGTCTACGAG GGGAAGGTCT CGAAGCTGAT GGATTTCGGC
GCCTTCGTCA ACATCCTGCC CGGCCGGGAT GGCCTGGTGC ACATCTCCCA GATCTCCAAC
GAGCGCGTGG AGCGGGTGGG TGACTACCTC AAGGAAGGTG ACACCGTGCG CGTCAAGGTG
CTGGAGGTGG ACCGCCAGGG CCGTATCCGG CTGAGCATGA AGGCGGTGCA GGACGGCGAG
TGA
 
Protein sequence
MRRVSRRGGA PRQSFQEQPD IIVKSVKKSF QYGNHTVTLE TGGVARQADG AVLVNMSDTV 
VLVTAVGRKE ADPGKGFFPL TVNYQERTYA AGKIPGGFFK REGRPSEKET LTCRLIDRPI
RPLFPEGFYN EVQVVATVLS MNPEVDADIP ALIGASAALS ISGIPFDGPI GAARVGYKDG
EYLLNPTFEE TAASDLDLVV AGTENAVLMV ESEANQLPEE AMLGAVLYGH EQMQVAIQAI
NELTAEAGKP RWDWHPPQGD AALETAIKDL VGDDLAAAYQ IPEKQERQNR IGELRQRAVE
ALGENREEEG GWPEKDVGDA FKGLEKDIVR GRILAGERRI DGRDTRTVRP IDIEVGSLPR
THGSAIFTRG ETQAVVVTTL GTGRDAQIID AIEGERKEQF MLHYNFPPYC VGETGFMGTP
KRREIGHGKL AKRGIEAVMP AADDCPYVIR VVSEITESNG SSSMATVCGT SLSLMDAGVP
VKAPVAGIAM GLIKEDEQFA VLSDILGDED HLGDMDFKVA GTESGVTALQ MDIKIQGITR
EIMEQALEQA REGRLHILGE MNNAISGPRS EMSEYAPRLL TIRIDPDKIR DVIGKGGATI
RALTEETGTT IDISDDGKVT IASADKAAAD EARRRIELLT ADVEVGTVYE GKVSKLMDFG
AFVNILPGRD GLVHISQISN ERVERVGDYL KEGDTVRVKV LEVDRQGRIR LSMKAVQDGE