Gene GM21_0793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0793 
Symbol 
ID8136109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp946616 
End bp948073 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content52% 
IMG OID644868411 
ProductATP-dependent Lon-type protease-like protein 
Protein accessionYP_003020625 
Protein GI253699436 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4930] Predicted ATP-dependent Lon-type protease 
TIGRFAM ID[TIGR02688] conserved hypothetical protein TIGR02688 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones102 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATA CAGATAAATT GCGGGCGGCA TTTCCAGACA CAACGGTTTT TAAGGCCCCG 
AACATCGTGG CGATCTTCAA AGCTGCGTCG ATTCCGTCTT TTTTGCGTGA CTGGATTTTG
AAGCGCAAGG CCGAATCGGA CGGGAGAATC CACAATGCCG AGGCATTGCG CAATTATATC
TACGATATTA TCCCCCGCCG GGAGGATTTA CTGAATTTGC AGACTGCCGC GCGAAGTGAG
GGGCGCACAA AGAAATTTCT GGCTAAGATT GAAATCCAAT TCAGCGTACG GTCCAACGAG
TACACGTTTG CGATTCCGGA ATTGGGTCTG GGGCATGCGG AAACGCTGAT TGAGGATTAT
GTATGGACTC GGATCAAAGA TGATGTTGTA AATACTGCTG GTGGCTGGGG ACTAGTGCAA
CTTGGATACC GGAGTCCGGA CGACGAGAAT GCACGTGGAT GTTTCACACT CTTAGAGTAC
AAGAATTTCT GCCCGTACAC GATAGATCTG GACGCCTATA GAGAAGCACG CTGCCAGTTC
ACAACCGAGG AATGGATCGA CGTTGTGTTG GGGGCAATCG ACTACAATCC GGAGGGTTAC
GAAGACTGGG TACAGAAGCA TACGGTACTG ACGCGCCTGC TGCCATTCAT CGAACCGCGC
CTTAATTTGA TTGAGCTAGC ACCCAAGGGG ACAGGCAAGT CATATATGTT CGGGCGGGTA
GGGAAATACG GCTGGCTGGT CAGCGGGGGG ACACTGACGC GCGCGAAGAT GTTCGGCGAT
ATCAATGGGA AGAGTCCCGG CCTGATCGCG TCGAACGACT TTGTAGCGCT GGACGAAATC
CAGTCCATCA ACTTTCCCGA TCCGAGTGAG ATGCAGGGCG GGCTAAAAGC CTACATGGAA
AGTGGGGAGA TCACGGTCGG CAAAAACCGT ATCATTGGCG GTGCCGGTGT CATACTACTG
GGAAACATCC CACAAACAGA TATGGACGAG ACCAAAGACA TGTTCCAGAG GCTGCCGCAA
GTGTTTCACG AGTCAGCATT GCTGGACCGG TTTCACGGTT TTATCCGTGG GCGCGACATA
CCGCGCATGA GTGAGAACTT GAAAATTAAC GGTTGGGCTC TGAACACGGA GTATTTTTCC
GAAATAATGC ATCTGCTTCG CCAGCCCGCA GAAACAATGA TTTATCGGCA TGTCGTAGAA
AGACTAGTAG ACTACCCTTC CGGGGCGGAT ACCCGCGATA CGGAAGCGGT TTTACGGTTG
TGCACGGCAT ACCTTAAACT GCTGTTTCCG CACGTTACAG CACCCGGCAG AATCGACAAG
GGGGAGTTCA AGCGATACTG TCTGCGTCCT GCCGTTCAGA TGCGCACGGT GATCCGACAG
CAGTTGCAGA GTATAGACCC CCTTGAATTC GGAGGGAAGA ACGTAGCAGC CTACACGTTA
CGCGAGGTAA ATGAATGA
 
Protein sequence
MVDTDKLRAA FPDTTVFKAP NIVAIFKAAS IPSFLRDWIL KRKAESDGRI HNAEALRNYI 
YDIIPRREDL LNLQTAARSE GRTKKFLAKI EIQFSVRSNE YTFAIPELGL GHAETLIEDY
VWTRIKDDVV NTAGGWGLVQ LGYRSPDDEN ARGCFTLLEY KNFCPYTIDL DAYREARCQF
TTEEWIDVVL GAIDYNPEGY EDWVQKHTVL TRLLPFIEPR LNLIELAPKG TGKSYMFGRV
GKYGWLVSGG TLTRAKMFGD INGKSPGLIA SNDFVALDEI QSINFPDPSE MQGGLKAYME
SGEITVGKNR IIGGAGVILL GNIPQTDMDE TKDMFQRLPQ VFHESALLDR FHGFIRGRDI
PRMSENLKIN GWALNTEYFS EIMHLLRQPA ETMIYRHVVE RLVDYPSGAD TRDTEAVLRL
CTAYLKLLFP HVTAPGRIDK GEFKRYCLRP AVQMRTVIRQ QLQSIDPLEF GGKNVAAYTL
REVNE