Gene GM21_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1916 
Symbol 
ID8137250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2223132 
End bp2224928 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content63% 
IMG OID644869530 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_003021727 
Protein GI253700538 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCA CGAGCAAGAA GAAGTTACTG ATAATAGTTG CCGCTTTGAG CATTGTGGCG 
GGAAGCTGCT ACGGTATCTA CGCCTGGAAG AACTCTCATT CGGATGAGGG TAAACAGATC
AGCTACACGG CTTTCATGGA GAAGGTCAAT GCCGGCGGCA TAGCGAAGGT GAAGATCGCC
GGCGACCAGA TCGACGCTGT TGGAAAATCC GGCGAGAAAT TTCGGTTATT TCTTCCTGCG
GGAGCCGAAC TGGCCGACGC CCTGGTAGCC AAGAAGATTG ACTTCTCCTC GAATCCCGCC
GCGTCCGAGC CCAAATGGTT CGAGATCAGC ATCATACTCC TGGTCGCGCT ATTTCTCGTC
ATGGTGCTCA AGAGATACGG CGGAGTGGGG CGGAGCAAGG CGAGGATCAT CGACTGCTCG
GAGTCTCTCA CCCGGTTTAA CGATGTAGCG GGCGCCGAGG AGGCAAAGGC CGAGCTTCTC
GATACGGTCG AGTTCCTGAA AGACCCCGCG AAATTCAGCG CCCTGGGCGG CAAGATGCCG
ACCGGCGTGC TTTTGGTGGG CCCTCCCGGC ACCGGCAAGA CGCTCCTCGC CAGGGCTGTA
GCCGGCGAGG CCGACGTTCC CTTCTTCTCC ATATCGGGCT CCGAGTTCGT CGAGATGTAC
GTCGGCGTCG GCGCTTCCAG GGTGAGGGAC CTTTTCGCGC AGGCAAAAAA GGCGGCTCCC
TGCATCGTCT TCATCGACGA GATCGACGCG GTCGGCCGCA AGCGCGATGC CGCGGTGGGG
GGCGGCGCCA GCGACGAGCG CGACCAGACC CTGAACCAGC TCCTGGTGGA GATGGACGGC
TTCGCCGTCA ACTCCGGCAT AGTGGTTCTC GCCGCGACCA ACCGTCCCGA GATACTCGAC
GCTGCGCTGC TTCGCTCCGG CCGCTTCGAC CGCGAGGTCA CCGTGGGCGC GCCCGATATA
AGGGGACGCG AGGCGATACT GAAGGTCCAC TCGAAGAACG TACCCCTGAG CCCGGAGGTC
GACCTGATGG TGATCGCCCG CGGGACGCCG GGCATGTCCG GCGCCGATCT GGCCAACGTG
GTCAACGAGG CCGCTATCCT GGCGGCAAGG TCAAACAAGG GGTGTGTCGA GATGCTCGAC
TTCGACAACG CGAAGGACAA GGTGCTGATG GGGGCCGAGA AGAAGTCGAT GGTGCTCTCC
GACAAGTCCA AGCTCTCGAC CGCCTACCAC GAGGCGGGGC ACGTGCTGGT GGCGAAGCTG
GTGCCGGGAT GCGACCCGGT GCACAAGGTC TCCATCATCC CTCGCGGCAG GGCCATGGGG
GTCACGCTGC AGATCCCCGA GGAGGACATC TACTGCTACA CCAAAGAGAT GCTGCTGGCC
CACCTCAAGG TGCTCATGGG CGGGCGCGCC GCCGAGGAGA TCATCTTCCA CACCACGACC
ACCGGTGCCG GCAACGACCT GGCGCGCGCC ACCGATACGG CCAGGAAGAT GGTGAGCGAG
TGGGGCATGT CCAGGGCCTT CGGTCCGGTC GCTTTCGGCC ATCAGGAGAA CACCGACGGC
GGCGGCAAGA AAGGGTTCAG CGACGCTACC GCGCTGGAGA TGGACAACGA GATCAGGTCC
ATCGTCACCA CCTGCTATGC CGACGTGCGG ACGCTCCTGG AGGAGAACCT GGACGTCCTT
GAGCGGCTCA CCCAGGAACT GGTCGTCAAG GAGACGCTGG ACGCCGCCGA GATCGACGCC
ATCCTGGGCC TCGCCACGGC GGACGACGCC GAAGCATCCT GCTGCGCCGC CGCATGA
 
Protein sequence
MTSTSKKKLL IIVAALSIVA GSCYGIYAWK NSHSDEGKQI SYTAFMEKVN AGGIAKVKIA 
GDQIDAVGKS GEKFRLFLPA GAELADALVA KKIDFSSNPA ASEPKWFEIS IILLVALFLV
MVLKRYGGVG RSKARIIDCS ESLTRFNDVA GAEEAKAELL DTVEFLKDPA KFSALGGKMP
TGVLLVGPPG TGKTLLARAV AGEADVPFFS ISGSEFVEMY VGVGASRVRD LFAQAKKAAP
CIVFIDEIDA VGRKRDAAVG GGASDERDQT LNQLLVEMDG FAVNSGIVVL AATNRPEILD
AALLRSGRFD REVTVGAPDI RGREAILKVH SKNVPLSPEV DLMVIARGTP GMSGADLANV
VNEAAILAAR SNKGCVEMLD FDNAKDKVLM GAEKKSMVLS DKSKLSTAYH EAGHVLVAKL
VPGCDPVHKV SIIPRGRAMG VTLQIPEEDI YCYTKEMLLA HLKVLMGGRA AEEIIFHTTT
TGAGNDLARA TDTARKMVSE WGMSRAFGPV AFGHQENTDG GGKKGFSDAT ALEMDNEIRS
IVTTCYADVR TLLEENLDVL ERLTQELVVK ETLDAAEIDA ILGLATADDA EASCCAAA