Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0793 |
Symbol | |
ID | 8136109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 946616 |
End bp | 948073 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644868411 |
Product | ATP-dependent Lon-type protease-like protein |
Protein accession | YP_003020625 |
Protein GI | 253699436 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4930] Predicted ATP-dependent Lon-type protease |
TIGRFAM ID | [TIGR02688] conserved hypothetical protein TIGR02688 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 102 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGATA CAGATAAATT GCGGGCGGCA TTTCCAGACA CAACGGTTTT TAAGGCCCCG AACATCGTGG CGATCTTCAA AGCTGCGTCG ATTCCGTCTT TTTTGCGTGA CTGGATTTTG AAGCGCAAGG CCGAATCGGA CGGGAGAATC CACAATGCCG AGGCATTGCG CAATTATATC TACGATATTA TCCCCCGCCG GGAGGATTTA CTGAATTTGC AGACTGCCGC GCGAAGTGAG GGGCGCACAA AGAAATTTCT GGCTAAGATT GAAATCCAAT TCAGCGTACG GTCCAACGAG TACACGTTTG CGATTCCGGA ATTGGGTCTG GGGCATGCGG AAACGCTGAT TGAGGATTAT GTATGGACTC GGATCAAAGA TGATGTTGTA AATACTGCTG GTGGCTGGGG ACTAGTGCAA CTTGGATACC GGAGTCCGGA CGACGAGAAT GCACGTGGAT GTTTCACACT CTTAGAGTAC AAGAATTTCT GCCCGTACAC GATAGATCTG GACGCCTATA GAGAAGCACG CTGCCAGTTC ACAACCGAGG AATGGATCGA CGTTGTGTTG GGGGCAATCG ACTACAATCC GGAGGGTTAC GAAGACTGGG TACAGAAGCA TACGGTACTG ACGCGCCTGC TGCCATTCAT CGAACCGCGC CTTAATTTGA TTGAGCTAGC ACCCAAGGGG ACAGGCAAGT CATATATGTT CGGGCGGGTA GGGAAATACG GCTGGCTGGT CAGCGGGGGG ACACTGACGC GCGCGAAGAT GTTCGGCGAT ATCAATGGGA AGAGTCCCGG CCTGATCGCG TCGAACGACT TTGTAGCGCT GGACGAAATC CAGTCCATCA ACTTTCCCGA TCCGAGTGAG ATGCAGGGCG GGCTAAAAGC CTACATGGAA AGTGGGGAGA TCACGGTCGG CAAAAACCGT ATCATTGGCG GTGCCGGTGT CATACTACTG GGAAACATCC CACAAACAGA TATGGACGAG ACCAAAGACA TGTTCCAGAG GCTGCCGCAA GTGTTTCACG AGTCAGCATT GCTGGACCGG TTTCACGGTT TTATCCGTGG GCGCGACATA CCGCGCATGA GTGAGAACTT GAAAATTAAC GGTTGGGCTC TGAACACGGA GTATTTTTCC GAAATAATGC ATCTGCTTCG CCAGCCCGCA GAAACAATGA TTTATCGGCA TGTCGTAGAA AGACTAGTAG ACTACCCTTC CGGGGCGGAT ACCCGCGATA CGGAAGCGGT TTTACGGTTG TGCACGGCAT ACCTTAAACT GCTGTTTCCG CACGTTACAG CACCCGGCAG AATCGACAAG GGGGAGTTCA AGCGATACTG TCTGCGTCCT GCCGTTCAGA TGCGCACGGT GATCCGACAG CAGTTGCAGA GTATAGACCC CCTTGAATTC GGAGGGAAGA ACGTAGCAGC CTACACGTTA CGCGAGGTAA ATGAATGA
|
Protein sequence | MVDTDKLRAA FPDTTVFKAP NIVAIFKAAS IPSFLRDWIL KRKAESDGRI HNAEALRNYI YDIIPRREDL LNLQTAARSE GRTKKFLAKI EIQFSVRSNE YTFAIPELGL GHAETLIEDY VWTRIKDDVV NTAGGWGLVQ LGYRSPDDEN ARGCFTLLEY KNFCPYTIDL DAYREARCQF TTEEWIDVVL GAIDYNPEGY EDWVQKHTVL TRLLPFIEPR LNLIELAPKG TGKSYMFGRV GKYGWLVSGG TLTRAKMFGD INGKSPGLIA SNDFVALDEI QSINFPDPSE MQGGLKAYME SGEITVGKNR IIGGAGVILL GNIPQTDMDE TKDMFQRLPQ VFHESALLDR FHGFIRGRDI PRMSENLKIN GWALNTEYFS EIMHLLRQPA ETMIYRHVVE RLVDYPSGAD TRDTEAVLRL CTAYLKLLFP HVTAPGRIDK GEFKRYCLRP AVQMRTVIRQ QLQSIDPLEF GGKNVAAYTL REVNE
|
| |