Gene Mlg_1975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1975 
Symbol 
ID4268518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2244278 
End bp2246197 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content65% 
IMG OID638126731 
ProductFtsH peptidase 
Protein accessionYP_742807 
Protein GI114321124 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00765382 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCAAGA ATCTGATCCT CTGGATCATC ATCGCCGTGG TCCTGATGTC CGTGTTCAGC 
AACTTCCAGG AACGCGGTGC CCCGGGGCCG GAGCAGATGA AATACTCCGA GTTCCTCCGC
CATGTGGAGG CCGGAGAGGT CCGTGAGGTG ATGATCCAGG GCGATAAGAT CACCGTGGAG
CGCGCCGATG GCACGCGGGC CCACACCTAC AACCCGGAGT CGGACAACCG GGCGCTGATT
GGCACCCTGC TGGAGCACGA CGTCACCATC GATGCCCGCG AGCCGGAAGG CCGCAACATG
TTGGTGCAGA TTCTCATCTC CTGGTTCCCG TTCCTGCTGC TGATTGCCGT GTGGATCTAC
TTCATGCGCC AGATGCAGGG GGGTGGCGCG GGTCGTGGGG CCATGTCCTT CGGTAAGAGC
AAGGCCAAGC TCATGACCGA GGAGCAGAGC AAGGTCACCT TCAACGACGT GGCCGGCTGC
GACGAGGCCA AGGACGACGT GGCGGAGCTG GTGGACTTCC TGCGGGATCC GTCCAAGTTC
CAGCGCCTGG GCGGGACCAT CCCCCGCGGG GTGTTGATGG TCGGCCCCCC CGGCACCGGT
AAAACGCTGC TCGCCAAGGC GATCGCCGGT GAGGCCAAGG TGCCGTTCTT CAGCATCTCC
GGCTCCGATT TCGTCGAGAT GTTCGTCGGC GTGGGTGCTT CCCGCGTGCG CGATATGTTC
AGCCAGGCCA AGAAGCACAG CCCCTGCATC ATCTTCATCG ACGAGATCGA CGCCGTGGGC
CGCCAGCGCG GTGCCGGCCT GGGCGGGGGG CACGACGAGC GTGAGCAGAC CCTGAACCAG
CTGCTGGTGG AGATGGACGG TTTCGAGGGC AATGAGGGCG TCATCGTCAT CGCCGCCACC
AACCGCCCCG ACGTGCTCGA CCCGGCCCTG CTGCGCCCGG GCCGTTTTGA TCGCCAGGTG
GTGGTGCCGT TGCCGGACGT ACGCGGCCGC GAGCACATCC TCAAGGTGCA CATGAAGAAG
GTGCCCCTGG ACGACGATGT CACCCCCGCG ATCCTGGCCC GCGGCACCCC GGGCTTCTCG
GGCGCTGACC TGGCCAACCT GGTCAACGAG GCCGCCCTGT TCGCCGCCCG CGCCAACAAG
CGGGTGGTGG ACCAGGAGGA CTTCGAGAAG GCCAAGGACA AGATCATGAT GGGCTCCGAG
CGTAAGTCCA TGGTGATGAA GGAGGATGAG AAGCGCCTGA CCGCCTACCA CGAGGCCGGC
CACGCCATCG TCGGGTTGGT CACCCCGGAG CACGATCCGG TGCACAAGGT GACCATCATC
CCCCGCGGCC GGGCCTTGGG CGTGACCATG TTCCTACCGG AAGAGGACCG TTACAGCTAC
ACCAAGCAAC GCCTGAACAG CATGATCGCC AGCCTGTTCG GCGGCCGTAT CGCCGAGGAG
CTGATCTTCG GCCACGAGCG GGTGACCACC GGCGCCTCCA ATGACATCCA GCGGGCCACC
GAGATTGCCC GCAACATGGT CACCAAGTGG GGGCTGTCCG CCCGCCTGGG CCCCCTGGCC
TACGGAGATG AGGAGGGCGA GGTCTTCCTC GGTCATTCGG TGACCCAGCA TAAGGACGTT
TCGGAAGAGA CCCAGCACGC CATCGACGAG GAGGTGCGGG CCATCATCGA CGCCAACTAC
ACGGCGGCGG AGAAGATCAT CCGCGAGCAC ATGGATCAGC TTCACGTGAT GGCCGATGCA
CTGATGCGCT ACGAGACCAT CGATCGTGCT CAGATCGACG ACATCATGGC CGGGCGCGAG
CCGCGGCCGC CGAAGGGCTG GTCCGGGCCT TCGGACGATG ACGATGACCG TGCCGAGGGC
GAGAAGGTGG GTAAGGACGA GCGCGGCCGT ATTGGCGGCC CCGCCACCAG CGAACACTGA
 
Protein sequence
MAKNLILWII IAVVLMSVFS NFQERGAPGP EQMKYSEFLR HVEAGEVREV MIQGDKITVE 
RADGTRAHTY NPESDNRALI GTLLEHDVTI DAREPEGRNM LVQILISWFP FLLLIAVWIY
FMRQMQGGGA GRGAMSFGKS KAKLMTEEQS KVTFNDVAGC DEAKDDVAEL VDFLRDPSKF
QRLGGTIPRG VLMVGPPGTG KTLLAKAIAG EAKVPFFSIS GSDFVEMFVG VGASRVRDMF
SQAKKHSPCI IFIDEIDAVG RQRGAGLGGG HDEREQTLNQ LLVEMDGFEG NEGVIVIAAT
NRPDVLDPAL LRPGRFDRQV VVPLPDVRGR EHILKVHMKK VPLDDDVTPA ILARGTPGFS
GADLANLVNE AALFAARANK RVVDQEDFEK AKDKIMMGSE RKSMVMKEDE KRLTAYHEAG
HAIVGLVTPE HDPVHKVTII PRGRALGVTM FLPEEDRYSY TKQRLNSMIA SLFGGRIAEE
LIFGHERVTT GASNDIQRAT EIARNMVTKW GLSARLGPLA YGDEEGEVFL GHSVTQHKDV
SEETQHAIDE EVRAIIDANY TAAEKIIREH MDQLHVMADA LMRYETIDRA QIDDIMAGRE
PRPPKGWSGP SDDDDDRAEG EKVGKDERGR IGGPATSEH