Gene GM21_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1195 
Symbol 
ID8136520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1390508 
End bp1391956 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content64% 
IMG OID644868809 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003021014 
Protein GI253699825 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones127 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGA CTTTGATGTG TTATGCGTTC GTGGCCCTCT TGTTGATCCC TTCCCTGGCC 
CAGGGGGCCG AGAAAAAGGT GATCATCGGC TTTCGTAAAA CCTCAGCACT GACGGAACTG
GAAAAGCAGG ACAGAGTGCA TCGAGCCGGC GGCCGGGTGG AGCGTTCCCA TGCGATCGCT
AACGCTTTGA CGGCAGACCT TCCCGAAGAG GCGATCCGCT CGCTCAAAGA GGATCCAGAC
GTCTCCTACG TGGAGGAGGA CCGGGAATTT TCGGCGGAAC CCGCCTTTCC CGAGCCCGAA
CTCACGCCGG AATACCTCCT CTCCTGGGGG GTGACCCGCA TAGCCGCCAA TCGGGCCGCC
TGGAACGGGA TCCGGGGAGC CGGCATCAAG GTCGCCATCC TCGATACGGG GATCGACTAC
AACCACCCGG AACTCAAGGA AAGTTACCGA GGGGGCTACA ACTTCCTAAC CAACACCGCC
GATCCCTACG ACGACTCCCG CCGGGGGCAT GGCACTCACA TAGCCGGGGT CATCGCGGCC
AAGGACAACG GAACCGGCGT GGTGGGGGTG GCTCCGGACG CCTCGCTCTA CGCGGTGAAG
ATCCTGGACC GGAACATGTT CGGGAGCACT TCCAGGGTCC TGGCGGGGTT GGAATGGGCC
ATAGCCAACA CAGTCGACGT CATCAACATC AGCTTCAGCA TGCCCAACGA CCCCATGTTT
TTCTCACAGG CGGTGAAGGA CGCCTGCGAC GAGGCGTACG CGGCCGGGAT CGTCGTCGTC
GCCGCGGCCG GCAACTCGGG GCGCCCTGTC GTGGACTACC CGGCGGATTT CGCCTCGGTG
ATAGCCGTGG CAGCCACAGC GGCGGACGAC ACCCGCGCCT TCTTCTCCAA CTACGGCGCC
AAGATCGAAT TCTCCGCCCC CGGCGTGGGG ATCACCTCGA CCCTCCCGGG AGGAAGGTAC
GGCTTATTGA GCGGGACCTC CCAGGCTGCC CCGCACGTAG CCGGGGCGGT GGCGTTGTTA
CTGTCGACCG GCGTGGTTGA CGATTCCGGC CAGGACGCGG GGAAAGTGGA AGCCGTGCGC
AACTGGCTTG CCGCCGGAGC GCTGGATCTG GGCGAGCCGG ATAGGGACGC CACCTTCGGC
CACGGCCTGG TGCAGGCCCC CGCCTATTGG ACCGTGCGCC GGACCCCCGC CCCCCCCTGG
GAGAATGCCC TGATCCTGCC GGTAACGGCC GGCAAGCACC GTGTCGACGT GGTGAATCAC
GGGCTGAAAC GCCTGGTCAT CAAAACTCCC CAAGGGGTCC AGGACGTCAT GCATCTCCCG
GAAGGGAACT GGGCTCACCA GGAGACCTAC TTCAGCTTCG AGTACGAATC CGAGACCGCG
GACCGGATCA CCTTTTATCC CTATGGAAGC GTCGGCAGTT CCGCCGAGAT CGGCATTTCC
GCGCATTAA
 
Protein sequence
MARTLMCYAF VALLLIPSLA QGAEKKVIIG FRKTSALTEL EKQDRVHRAG GRVERSHAIA 
NALTADLPEE AIRSLKEDPD VSYVEEDREF SAEPAFPEPE LTPEYLLSWG VTRIAANRAA
WNGIRGAGIK VAILDTGIDY NHPELKESYR GGYNFLTNTA DPYDDSRRGH GTHIAGVIAA
KDNGTGVVGV APDASLYAVK ILDRNMFGST SRVLAGLEWA IANTVDVINI SFSMPNDPMF
FSQAVKDACD EAYAAGIVVV AAAGNSGRPV VDYPADFASV IAVAATAADD TRAFFSNYGA
KIEFSAPGVG ITSTLPGGRY GLLSGTSQAA PHVAGAVALL LSTGVVDDSG QDAGKVEAVR
NWLAAGALDL GEPDRDATFG HGLVQAPAYW TVRRTPAPPW ENALILPVTA GKHRVDVVNH
GLKRLVIKTP QGVQDVMHLP EGNWAHQETY FSFEYESETA DRITFYPYGS VGSSAEIGIS
AH