Gene Mlg_2387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2387 
Symbol 
ID4269384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2710724 
End bp2712766 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content69% 
IMG OID638127145 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_743217 
Protein GI114321534 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.900198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000000289366 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGATGC GCCTCTTCAC TGCATGGATG GCCGCCGCGC TGGTCGCGGC ACTGTTGCTG 
CCCCCGCTGC TGAGCGGAGG CCTGGAACCG CCGGAGGACG ACCGGCGGGT GGCCGCGCTC
TCAGCGGAGC GGTTGGCGGC CCTGCCGGCT TCGGCCCGCA CCCTGATCGA GACGGTCCGG
GACAACCCGG ACTGCTGTGT GCCGGGGGAG TTGCTGATCC GGCCGGCCGG GGACGCGGCG
GAGACCGAGC CGCGTATGCG GGCGGCTCAT GCCCTGGTGG GCGCCAATGT GGTCCGGCGC
TTTCGCACCG GTAACGCGGA GCATGTTCGG CTGCCGTCCT GGATGAGCCT GGAGGAGGCC
ATCGCCGCCT ACGCCGAGGA CCCGGCGGTG GCCTATGTCG AGCCCAACTA CCGGGTCTAC
GCCGCCCAGC TCGACCCGGA GCCCGAGTTC TATGACCAGC AATGGAACCT GCCGGCCATC
GATGTGCCGC AGGCCTGGGA CCTGACCACC GGTGACCCGG CGGTGCGACT GGGCCAGGTG
GACACCGAGA TCGATACGGC CCATCCCGAT CTGCAGCAGA ACGTGGCGGG GACCGGTTCC
CAGTTCGACG ATTCTCAGCC GGAGGATCAT GGCACCCATG TCGCCGGCGT GATCGCGGCT
GAGGGGGAGG GGGTGGCCGG CGTGAATCTC CAGGTCTCGC TGTACAGCCA CGCCGCGCTG
GAGGTCGCTG GCCCGAACCG GGTCGAAGGG TCCTTTGCCG ACGTGATTGC GGCCGTGGAG
GCGTTGGTCG ATGAAGGGGT GAGGGTCATC AACGCCAGTT TCGGCACCGA TGAAAGTCCG
CTCCGGGATG ACGGCGAGGA CGACACCCTC AAGGCGGCCC TGCGGGCAGC CGGTGAGCAG
GGGGTCCTGG TGGTGGCCGC GGCGGGGAAC GCGGGTGAGG AGAATGACAC GGAATCGGAC
GAGCCGACGG GCTTCTGGCC GGCCAGTTAC CGGCTGGACA ACATCATCTC GGTCGCGGCC
TCAGACGAGG ACGACGAGCG CGCGAGTTTC TCCAATTTTG GGGAGACCGA CGTCCACCTG
GCCGCGCCCG GCGTTGATAT CCTGAGTGCC GTGGTGGACC GGCCGGACCG GCCAGACCCC
TACGAATTCC GTTCGGGCAC CTCTATGGCT GTACCCCACG TGGTCGGCAT CGCCGGCCTG
GTGCTGGCCC AGCACGGCGT CGATACCCCT TACCAGGCGG TGCGGGAGCG CCTCCTGATG
TCCGCCCGGC TGAATGCCAG CCCGGACTGG GAGGGGCTGA CCGCCACCGG CGGCATCGTG
AACGCCTACG ACGCCCTCAC CGTCGACCTG GCCGGTCTGC CGCCCTTCGC CCCGAGCCGG
CTGCGGATCA ACCCGCTGCC CGATACCTCG GAGCTGGAAT TGACCTGGCT GAACAACAGC
TATCAGCTCG ATGCGCTGGC GCTGGAGCAC TGCGAGGGCG ACGGCTGCGA CGATGCGGAC
GCCGACTTCC AGGCCACCGG CGGGGTCACC CTGGAGAGCG GGGACCAGGA GGCCGTCGTC
TCCCCGGCAC TGGACGAGGG CGAGACCATC ACCTACCGGG TCTGCGCCGA GCGGGCCAGT
GTGCGCGCCT GCTCGGGTTC GGCGAGCTAC GTGGCGACCA GCGACGACAA TGGCGACAAT
GGCAACGACG ACAACGGCAA CGGCAACGAC GACAACGGCA ACGGCAACGA CGATAACGGC
AATGGCGACA ACGGCGCCCC GGACGGAGAC AACGGCACTG GGGGTGGTGG CGGAGGTGGC
GGCTGCTTCA TTGCCACCGC CGCCTGGGGC AGTGAATGGG ATGCGCCCGT GGCCACCCTG
CGCCAGTTCC GCGACGAGGC CCTGCTGACC AGCCGTGCCG GGCAGGAGCT GGTCAGCCTC
TATTACCACT TCAGCCCGGC CATCGCCGAT CGGATCGCCG AGGACGAGGA GCTCCGCGCG
CGGGTGCGGC AATGGCTGGC GCCCTTCGCC GCCGTCGCCG AGCAGGCGGT GGAGGCGGAG
TAG
 
Protein sequence
MKMRLFTAWM AAALVAALLL PPLLSGGLEP PEDDRRVAAL SAERLAALPA SARTLIETVR 
DNPDCCVPGE LLIRPAGDAA ETEPRMRAAH ALVGANVVRR FRTGNAEHVR LPSWMSLEEA
IAAYAEDPAV AYVEPNYRVY AAQLDPEPEF YDQQWNLPAI DVPQAWDLTT GDPAVRLGQV
DTEIDTAHPD LQQNVAGTGS QFDDSQPEDH GTHVAGVIAA EGEGVAGVNL QVSLYSHAAL
EVAGPNRVEG SFADVIAAVE ALVDEGVRVI NASFGTDESP LRDDGEDDTL KAALRAAGEQ
GVLVVAAAGN AGEENDTESD EPTGFWPASY RLDNIISVAA SDEDDERASF SNFGETDVHL
AAPGVDILSA VVDRPDRPDP YEFRSGTSMA VPHVVGIAGL VLAQHGVDTP YQAVRERLLM
SARLNASPDW EGLTATGGIV NAYDALTVDL AGLPPFAPSR LRINPLPDTS ELELTWLNNS
YQLDALALEH CEGDGCDDAD ADFQATGGVT LESGDQEAVV SPALDEGETI TYRVCAERAS
VRACSGSASY VATSDDNGDN GNDDNGNGND DNGNGNDDNG NGDNGAPDGD NGTGGGGGGG
GCFIATAAWG SEWDAPVATL RQFRDEALLT SRAGQELVSL YYHFSPAIAD RIAEDEELRA
RVRQWLAPFA AVAEQAVEAE