Gene GM21_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3954 
Symbol 
ID8139328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4536925 
End bp4538115 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content65% 
IMG OID644871570 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003023728 
Protein GI253702539 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.000000174945 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAACT ACATCATCCT TCGTGACCTA AGCGCAGTAG AAACCGCTGA ACCGTTCGGC 
ATCAGGGCGG GGCGTATGCG GGCCGGCGTC GCCGCGCCGA TGCAACCGCA GCTCTCCGTG
GAGCAGCTCG ACAAGCAAAC GGTCAAGGAC GTCGTGCGCG ATCCCAGCGT GCTCGCGATC
ACTCCGAAGA TGCCCATCAA GCTGATCCAC CCCACGGAGA CGTCGATCCC GTCCCGGGCC
ACCGAATGCT GGGGAGTCGA CGCCGTCGGC GCCAAGAACT GTCCCTTCAA CGGCGAGGGG
GTGACCATAG CCGTGCTCGA TACCGGGGTC GATGCGGCGC ACCGCGCCTT CCAGGGGGTG
ACCTTCGTGC AGAAGGATTT CTCGGGCTCG GGAGACGGAG ACCGCCAGGG ACACGGCACT
CATTGCATGG GGACCATCAT CGGTCGCGAC GTGGAAGGAA TCAGGATCGG CATCGCGCCG
GGCGTTCAAC GGGCCCTGAT CGGGAAGGTG CTGGACGACA CCGGAAGCGG CACCTCCGAA
ATGATCTTCC AAGGGATCCA ATGGGCCGTC TCCCAGGGGG CGGACGTTAT CTCCATGTCT
CTTGGCTTCG ACTTTACGGG GATGGTGGAC TCGCTGATTT CCCAGGGGTG GCCGAACGCA
CTCGCCACCT CCGCCGGGCT TGAGGCGTAC CGTGGCAACC TGCGCATGTT CGACGCGCTG
ATGAACGTGG TGCAGAGCCA GGCGGCGTTC ACCCAGGGGT GCATCATCGT GGCGGCCGCC
GGCAATGAAA GCAAGCGCGA CGTGCGGTCG GATTTCGAGA TCGCGGCATC CCTCCCCGCG
GCAGCCCAGG GGGTCATCTC GGTCGGCGCG CTGCAACAGG GGCAAAACGG CCTGCAGATA
GCCAGTTTCT CCAACACCTT CCCGCAGGTG GCCGCTCCCG GAGTCGCCAT CCTCTCCGCC
AAGGCAGGCG GTGGTCTGCG CGCCTTGAGC GGCACCAGCA TGGCCTGCCC TCACGTCGCA
GGGATTGCCG CCCTCTGGTG GGACGCCCTG CGCAAATCCG CCCTGAACCC CACTGCGACG
GCGGTACAGG CGAAGCTTTT GGCCTCTTCG CGCACCAACG CTTTCGCCCC GAACACCGAT
TTCGCCGACC GGGGGCTGGG GATAGTTACG GCTCCCACTG AAATATCCTG A
 
Protein sequence
MNNYIILRDL SAVETAEPFG IRAGRMRAGV AAPMQPQLSV EQLDKQTVKD VVRDPSVLAI 
TPKMPIKLIH PTETSIPSRA TECWGVDAVG AKNCPFNGEG VTIAVLDTGV DAAHRAFQGV
TFVQKDFSGS GDGDRQGHGT HCMGTIIGRD VEGIRIGIAP GVQRALIGKV LDDTGSGTSE
MIFQGIQWAV SQGADVISMS LGFDFTGMVD SLISQGWPNA LATSAGLEAY RGNLRMFDAL
MNVVQSQAAF TQGCIIVAAA GNESKRDVRS DFEIAASLPA AAQGVISVGA LQQGQNGLQI
ASFSNTFPQV AAPGVAILSA KAGGGLRALS GTSMACPHVA GIAALWWDAL RKSALNPTAT
AVQAKLLASS RTNAFAPNTD FADRGLGIVT APTEIS