Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1195 |
Symbol | |
ID | 8136520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1390508 |
End bp | 1391956 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868809 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003021014 |
Protein GI | 253699825 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 127 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGGA CTTTGATGTG TTATGCGTTC GTGGCCCTCT TGTTGATCCC TTCCCTGGCC CAGGGGGCCG AGAAAAAGGT GATCATCGGC TTTCGTAAAA CCTCAGCACT GACGGAACTG GAAAAGCAGG ACAGAGTGCA TCGAGCCGGC GGCCGGGTGG AGCGTTCCCA TGCGATCGCT AACGCTTTGA CGGCAGACCT TCCCGAAGAG GCGATCCGCT CGCTCAAAGA GGATCCAGAC GTCTCCTACG TGGAGGAGGA CCGGGAATTT TCGGCGGAAC CCGCCTTTCC CGAGCCCGAA CTCACGCCGG AATACCTCCT CTCCTGGGGG GTGACCCGCA TAGCCGCCAA TCGGGCCGCC TGGAACGGGA TCCGGGGAGC CGGCATCAAG GTCGCCATCC TCGATACGGG GATCGACTAC AACCACCCGG AACTCAAGGA AAGTTACCGA GGGGGCTACA ACTTCCTAAC CAACACCGCC GATCCCTACG ACGACTCCCG CCGGGGGCAT GGCACTCACA TAGCCGGGGT CATCGCGGCC AAGGACAACG GAACCGGCGT GGTGGGGGTG GCTCCGGACG CCTCGCTCTA CGCGGTGAAG ATCCTGGACC GGAACATGTT CGGGAGCACT TCCAGGGTCC TGGCGGGGTT GGAATGGGCC ATAGCCAACA CAGTCGACGT CATCAACATC AGCTTCAGCA TGCCCAACGA CCCCATGTTT TTCTCACAGG CGGTGAAGGA CGCCTGCGAC GAGGCGTACG CGGCCGGGAT CGTCGTCGTC GCCGCGGCCG GCAACTCGGG GCGCCCTGTC GTGGACTACC CGGCGGATTT CGCCTCGGTG ATAGCCGTGG CAGCCACAGC GGCGGACGAC ACCCGCGCCT TCTTCTCCAA CTACGGCGCC AAGATCGAAT TCTCCGCCCC CGGCGTGGGG ATCACCTCGA CCCTCCCGGG AGGAAGGTAC GGCTTATTGA GCGGGACCTC CCAGGCTGCC CCGCACGTAG CCGGGGCGGT GGCGTTGTTA CTGTCGACCG GCGTGGTTGA CGATTCCGGC CAGGACGCGG GGAAAGTGGA AGCCGTGCGC AACTGGCTTG CCGCCGGAGC GCTGGATCTG GGCGAGCCGG ATAGGGACGC CACCTTCGGC CACGGCCTGG TGCAGGCCCC CGCCTATTGG ACCGTGCGCC GGACCCCCGC CCCCCCCTGG GAGAATGCCC TGATCCTGCC GGTAACGGCC GGCAAGCACC GTGTCGACGT GGTGAATCAC GGGCTGAAAC GCCTGGTCAT CAAAACTCCC CAAGGGGTCC AGGACGTCAT GCATCTCCCG GAAGGGAACT GGGCTCACCA GGAGACCTAC TTCAGCTTCG AGTACGAATC CGAGACCGCG GACCGGATCA CCTTTTATCC CTATGGAAGC GTCGGCAGTT CCGCCGAGAT CGGCATTTCC GCGCATTAA
|
Protein sequence | MARTLMCYAF VALLLIPSLA QGAEKKVIIG FRKTSALTEL EKQDRVHRAG GRVERSHAIA NALTADLPEE AIRSLKEDPD VSYVEEDREF SAEPAFPEPE LTPEYLLSWG VTRIAANRAA WNGIRGAGIK VAILDTGIDY NHPELKESYR GGYNFLTNTA DPYDDSRRGH GTHIAGVIAA KDNGTGVVGV APDASLYAVK ILDRNMFGST SRVLAGLEWA IANTVDVINI SFSMPNDPMF FSQAVKDACD EAYAAGIVVV AAAGNSGRPV VDYPADFASV IAVAATAADD TRAFFSNYGA KIEFSAPGVG ITSTLPGGRY GLLSGTSQAA PHVAGAVALL LSTGVVDDSG QDAGKVEAVR NWLAAGALDL GEPDRDATFG HGLVQAPAYW TVRRTPAPPW ENALILPVTA GKHRVDVVNH GLKRLVIKTP QGVQDVMHLP EGNWAHQETY FSFEYESETA DRITFYPYGS VGSSAEIGIS AH
|
| |