Gene Mlg_1430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1430 
Symbol 
ID4269240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1634203 
End bp1637424 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content70% 
IMG OID638126186 
Productribonuclease E 
Protein accessionYP_742269 
Protein GI114320586 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.260181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA TGCTGATCAA CGCAACTCAG CCAGAGGAGT TGCGTGTTGC CCTCGTGGAC 
GGGCAACAGC TCTACGACCT GGATATCGAA ACCCCGGCCC GGGAGCAGAA GAAGGCCAAC
ATCTACAAGG GCACCATCAC CCGCGTGGAG CCCAGCCTGG AGGCGGCCTT CGTACAGTAC
GGCGCGGAGC GCCACGGGTT TCTGCCGTTT AAGGAGATTG CCCCCAGCTA CTACCGCGAG
GGTCTCAAGG CCGACGGGGG GCGCCCCAGC ATCCGCGATG CGGTGCGTGA GGGCCAGGAG
GTGGTGGTTC AGGTGGACAA GGAGGAGCGC GGCACCAAGG GTGCGGCGTT GACCACCTAC
ATCAGCCTGG CCGGGCGCTA CCTGGTGCTC ATGCCCAACA ACCCGCGCGC CGGCGGCGTC
TCGCGCCGCA TTGAGGGCTC CGACCGGGCC GAAATCCGCG AGGCGCTGCG ACAACTCGAC
GTCCCGGAGG GCATGGGTCT GATCGTCCGC ACCGCCGGCG TGGGCCGGAC GGTGGAGGAA
CTGCAGTGGG ACCTCGATTA CCTGCTCAAG CTCTGGAGCG CTATCCAGAA GGCCGCCGAG
GCCCGGTCCG CCCCCTTCCT GATCTACCAG GAGAGCGACG TCATCATTCG CGCGCTGCGC
GACTACCTGC GCTCGGATAT CGGGGAGATT CTCATCGATG ACCCGCAGGT CTTTGAAAAG
GCCGTGGAGT TCGTCGAGCA GGTCATGCCG TATAACCGGC AGAAGCTGAA GTACTACGAC
GACCGGGTAC CGCTGTTCAC CCGTTACCAG ATCGAAAGCC AGATCGAATC CGCCTTCCAG
CGGGACGTTC GTCTGCCCTC CGGCGGTGCC ATCGTGATCG ACCACACCGA GGCGCTGATC
TCCATCGACA TCAACTCGGC CCGGGCCACC AAAGGCTCGG ACATCGAGGA GACGGCCCTG
CACACCAACC TCGAAGCCGC CGACGAGATC GCCCGGCAGC TCCGATTGCG GGATCTGGGC
GGGCTGATCG TGATCGACTT CATTGACATG GGCCCGAACC GAAACCAGCG GGAGGTGGAA
AACCGACTCC GCGAGGCGGT CAAGGCGGAC CGAGCCCGGG TTCAGATCGG GCGAATCTCC
CGCTTTGGTT TGCTGGAGAT GTCCCGCCAG CGGCTTCGTT CGTCCCTGGG CGAGTCGCAC
CAGGAGGTCT GCGCCCGCTG TGGTGGCCAG GGTACCGTGC GCAGCGTGGA GTCGCTGGCC
CTTTCCATCC TGCGCATCGT CGAGGAAGAG GCGATGAAGG AGAAGACCGG GACGGTGCTG
GCGCAGCTCC CGGTGGATGT CGCCACCTTC CTTTTGAACG AGAAACGGCC CTCCATCAGT
GAGATCGAGG ATCGCCACAG GGTGGCCGTG GTGCTGATCC CCAATCGCAC CATGGAGACA
CCCCACTACG ACGTCCAGCG CCTGCGTAGC GACGACGAGT CCACCGAGGA CCCCAGTTAC
CGGCTCGCCT CCGAGGAACA TCCAGAGCCC TCCCCCGAGT GGTTGGAGCG GGAGGCCGCG
CCGCGCTCGG AGGAACCGGT GGTCCAGCGG GTACAGCCGC CCTCCGCACC GCCGCCGACG
GCGCCCGTCG AAGAGAGTCC AAAGGAAGAA GCGCCACCAC CGCAGCGCGA CGCGCCCGAA
GGTAGCGGCG CATCGGACGT CGGGATCAGC GGCCTCACCG GCCTGTTCCA GCGGCTCCGT
GGCTTCTTTG GGCGGGAAGC GGACAACGAC AGCGCCGGTC AGGAGGGTCC AGTCGGCGAG
AAGGCCCCTG CTGCCTCCGG GCGACGCGGC AGCGATCAGA AGGGAGCCAC CAACGGGCGC
CGGGGCGGTG GTAGTAACGG CGGCAAGGGT GGCCGGACAC GGCAAAAGGC GCGCCCCGAG
GACGGTCGAT CGGCCTCGCA ACCGGAGGCT ACCGGCAAGC CGGAGAGTGA TGCCCGGCGC
GATGCGGATG AGACCGCCGA GGGCGCCGGT CGCAGCGGTC GCAGCCGGCG CGGCCGGCGG
GGCGGACGGC GCCGCCGGCG TAGCGGGAGC ACAGGCGGTG GCCAGCCGGA GGAATCGGCC
GCACGCCAGG ATCAGAAGGG GCAGACCCCG CCGGCGGCAG AGGCCGCGGA CGACGCTCCC
CGGGATCGCG CCGCCGGCAA GGCCGAGGCC GCCGCCGAGC GGCCCAGGGA GGGAGACCGC
GCGGCGGCCG AGAAACCGGC GGCCAGCGAA AAGGTAAAGG CTGACAAGCC GGCGCTGCCA
ACGATCACCG AGGAGGAGAT CCAGGGCAGC GCCACCCCGC AGCTCAAGGA CTGGCCACCG
CGGGGCCAGG TCGCCACCGA GGCCGCCAGT CCCGAATCGG CGCCGGCGGA CGACGAGAGC
GGCACCGCCG CCGGGTCGCC GAAGGCCACG CAGGCCCAAC GACCGGCGAC GGCCGACACC
GAGGCCTCCG ACACCAAGGC AGATGCGCCG GCGTCCTCCG GTGAGACGCC CGCCGACCAA
CCGACCGGTG CGCCACCGGC GGCTGGCGGT GACGGGGCCA AGGCGCCGAC CGCTGAGAAG
GCCGCTGGCG AGAAGCCGGG GGCGCGGCGC AAGCGCTCGA AGCCCAGTGT CCAACCGGGC
ACGCCGCCGG TCCTGCCGCA GGATATCGGC CCGGATGAAC CGTCGGTGCG CTCCGGGCGC
CCGCGTATCA GCGCAGCGGC CCCCACCGAG GGCAACGAGG CCGACACCGC CGGCTCGGCT
GACGTCGCCC CGCGTGAGCC GACACCGGCT AACACGGGGA CCGATCAGCC ACCGAAACCG
GCGCCGGAAC CAGCGCCGGC CAAGGCGCCG GAGCCGGCCC CGGGGTCCGA TAGTGAGCCA
GCGGCCGCCG ACCAGGACGC CGCGCCTGCC GAGGAGGTGC CCAACGCCGC CGACGACGGT
GCTGCCGGGC CACAGGCAAC CCCTTGGCTG GAGCACGAGC CTGAGAGCAG TCGGGAGGCC
ATTGGCGCCC ATAGCGCGGG TACGCCGGAC GCTCCTGATG CGCAGAAGGC CCCGGCCGAC
ACCGCGGATG CCGCGGACAC CGCGGGCGAC GAAGGCGGGG ATCAGACGCA GGAGACCGAT
GAGGCCTCCC GAAAGGGCGG TTCCCGGAGA AAGGGGCCGG CCGGGGAGAG TGAGGCGCGG
CCCGAATCCC CGGACCAGGG CAGGTCACGA GAGGATGACT GA
 
Protein sequence
MKRMLINATQ PEELRVALVD GQQLYDLDIE TPAREQKKAN IYKGTITRVE PSLEAAFVQY 
GAERHGFLPF KEIAPSYYRE GLKADGGRPS IRDAVREGQE VVVQVDKEER GTKGAALTTY
ISLAGRYLVL MPNNPRAGGV SRRIEGSDRA EIREALRQLD VPEGMGLIVR TAGVGRTVEE
LQWDLDYLLK LWSAIQKAAE ARSAPFLIYQ ESDVIIRALR DYLRSDIGEI LIDDPQVFEK
AVEFVEQVMP YNRQKLKYYD DRVPLFTRYQ IESQIESAFQ RDVRLPSGGA IVIDHTEALI
SIDINSARAT KGSDIEETAL HTNLEAADEI ARQLRLRDLG GLIVIDFIDM GPNRNQREVE
NRLREAVKAD RARVQIGRIS RFGLLEMSRQ RLRSSLGESH QEVCARCGGQ GTVRSVESLA
LSILRIVEEE AMKEKTGTVL AQLPVDVATF LLNEKRPSIS EIEDRHRVAV VLIPNRTMET
PHYDVQRLRS DDESTEDPSY RLASEEHPEP SPEWLEREAA PRSEEPVVQR VQPPSAPPPT
APVEESPKEE APPPQRDAPE GSGASDVGIS GLTGLFQRLR GFFGREADND SAGQEGPVGE
KAPAASGRRG SDQKGATNGR RGGGSNGGKG GRTRQKARPE DGRSASQPEA TGKPESDARR
DADETAEGAG RSGRSRRGRR GGRRRRRSGS TGGGQPEESA ARQDQKGQTP PAAEAADDAP
RDRAAGKAEA AAERPREGDR AAAEKPAASE KVKADKPALP TITEEEIQGS ATPQLKDWPP
RGQVATEAAS PESAPADDES GTAAGSPKAT QAQRPATADT EASDTKADAP ASSGETPADQ
PTGAPPAAGG DGAKAPTAEK AAGEKPGARR KRSKPSVQPG TPPVLPQDIG PDEPSVRSGR
PRISAAAPTE GNEADTAGSA DVAPREPTPA NTGTDQPPKP APEPAPAKAP EPAPGSDSEP
AAADQDAAPA EEVPNAADDG AAGPQATPWL EHEPESSREA IGAHSAGTPD APDAQKAPAD
TADAADTAGD EGGDQTQETD EASRKGGSRR KGPAGESEAR PESPDQGRSR EDD