Gene Mlg_2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2213 
Symbol 
ID4268685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2518981 
End bp2520207 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content70% 
IMG OID638126969 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_743045 
Protein GI114321362 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0322293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGC CGCGCCTGCT GCGATTCCTG CTCAGTTACA CCGCCATCGG CCTGCTGGTG 
GCGGCGGTCA TCATCGGCCT GCGCCCGGAC CTGGTGGGCA TGCAGACCGG TAACAACGCC
GCGCCGAACG ACCGCAATGG CAACGGGGCG GCTGCGCCCG CCACCCAGAC GCTAGCGCCG
CCGATACAGC CCCGCACGGG GCCGGTGTCC TACGCCGACG CCGTGGAGCA GGCCCAACCG
GCCGTGGTCA ACATCTACAC CGCCAAGACC GTGCAGGAGG CGCCGCACCC ACTGTTCGAC
GACCCCTTCT TCCGCCGCTT TTTCGGCGAT GTCGCGCCCC ACCGGCCGCG GGAGCGCACG
CAGACCAGCC TGGGTTCGGG GGTGATCTTC AGCGAACAGG GCTACGTCAT CACCAATAAC
CACGTCATCG AGGACGCCGA CCAGATCCAG GTGCTGCTGG CGGATGGCCG GGAGGCCCTG
GCCAGTGTGG TGGGCCGCGA CCCGGAGACC GATCTCGCGG TGCTGCGCAT CGAACTGGAC
CGGCTGCCGG TGATCCAGTT GGCGGACGAC CGGGCGCTGC GCGTGGGCGA CGTGGTGCTG
GCCATCGGCA ACCCCTTCGG CGTCGGCCAG ACGGTGACCA TGGGCATTGT CAGCGCCACC
GGCCGCGATC AGCTCGGCCT GACCACCTTC GAGAACTTTA TCCAGACCGA CGCGGCCATC
AACCCGGGCA ACTCCGGCGG CGCACTGATC AACGCCGAGG GCCGGCTGGT GGGCATCAAC
ACCGCCATCT TCAGCCGCAC CGGGGGCCAC CAGGGCATCG GCTTCGCCAT CCCGGCCCAC
CTGGCGGTCT CGGTGCTGCA AAGCATCGTC GAGGAGGGTC GCGTGGTGCG CGGCTGGATC
GGGGTCCAGG CCCAGAGCCT GACCCCCATG CTGGCCGAGT CCTTCGACCT GGCGGCGGCA
CAGGGCATTG TCATCTCCGG CGTGTTGCGC GGGGGCCCGG CGGACCGTGC CGGCCTGCGC
CCCGGGGATA TCATCACCCA CATCGAGGGC GAACCGGCGG CCGATGCCCA GGCGCTGCTG
GAGCGGGTCA CCGACAAGCG GCCCGGGAGC GAGCTGCGGC TGGATCTGCT GCGGGATGGC
GAGGCGCGCA CGGTCACCGT GGCGGTGGGA GAACGCCCGG CCCAGGACGA GCGGCAGCCG
GCCCCACGGC AGCCGCGGTT ACCCTGA
 
Protein sequence
MKVPRLLRFL LSYTAIGLLV AAVIIGLRPD LVGMQTGNNA APNDRNGNGA AAPATQTLAP 
PIQPRTGPVS YADAVEQAQP AVVNIYTAKT VQEAPHPLFD DPFFRRFFGD VAPHRPRERT
QTSLGSGVIF SEQGYVITNN HVIEDADQIQ VLLADGREAL ASVVGRDPET DLAVLRIELD
RLPVIQLADD RALRVGDVVL AIGNPFGVGQ TVTMGIVSAT GRDQLGLTTF ENFIQTDAAI
NPGNSGGALI NAEGRLVGIN TAIFSRTGGH QGIGFAIPAH LAVSVLQSIV EEGRVVRGWI
GVQAQSLTPM LAESFDLAAA QGIVISGVLR GGPADRAGLR PGDIITHIEG EPAADAQALL
ERVTDKRPGS ELRLDLLRDG EARTVTVAVG ERPAQDERQP APRQPRLP