Gene GM21_3457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3457 
Symbol 
ID8138829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4000636 
End bp4002066 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content62% 
IMG OID644871077 
Productprotease Do 
Protein accessionYP_003023237 
Protein GI253702048 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones102 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCA TCAGACTCCT CGCTGTCTGC CTTATCGTCT CCACGTTAAT GATAGCCTGC 
AAGAAAAAGG AGCAGGCATC CTTCTACGAA TCCGAGCGCA AGGAGAGCGT CCCTGCCCCG
GTCCAGGAGG TTCCCAAGGA CATCCTGGTG ACGCAGCAGG CCTTCACCGC GCTGGTCAGG
ACCGTGACCC CCTCCGTCGT CAACATCTCC ACCATCGGCA AGAAAAAGCT GGTCCGCCCC
TTCTTCGAGT CGTCCCCGTT CTTCGACGAG TTCTTCGGCG AAAGAACACG CCCGCAGTAC
CGCCGAGAAC ACAGCTTGGG CTCCGGCTTC ATTCTGAACA AGGAAGGGTA TATCGTCACC
AACGATCACG TGGTGCGGGA CGCCGAAACC ATACAGGTGA AGCTTTCCAA CGAAAGCGTC
TACAAGGGGA AGGTGATCGG CTCGGACCCG AAGACGGACA TCGCGGTGAT AAAGATCGAC
GCCAAAGAGC CGCTGCCGGC AGCAGTCCTC GGGGATTCCA ATAAGCTGCA GGTCGGGCAG
TGGGCGATTG CCATAGGGAA CCCCTTCGGC CTCGACCGGA CCGTCACCGT GGGCGTTGTC
TCCGCAACCG GCAGATCGAA CATGGGGATC GAGACCTACG AGGATTTCAT ACAGACCGAC
GCCTCCATCA ACCCGGGGAA CTCCGGCGGT CCGCTGCTTA ACATCTACGG CGAGGTGATC
GGAATCAACA CCGCCATCGT CGCCGCCGGA CAGGGAATCG GCTTCGCCAT CCCGGTAAAC
ATGGCGAAGC AGGTGGTGAC GCAGTTGATC AGCAAGGGGA ACGTGAGCCG CGGCTGGCTC
GGCGTCTCGA TCCAGTCGGT GACCGAGGAG ATGGCGAGCT CCTTCGGGCT TCCCAAAGCG
TACGGCGCGC TGGTGAACGA CGTCGTTGCA GGCGGCCCGG CTGCGAAGGC CGGCGTCATG
CAGGGGGACG TGATCACGAG CTTTGCCGGG ACAGCGGTGA AGGACGTGCG CCAGTTGCAG
CGCCTGGTCG GGGAAACGCC TATCGGGAAG AAGGTCCCGG TGGAGCTCTA TCGCGACGGC
AAGAAGATCA CCGTCCAGAT AACCACCGCA CCTGCCGACA GCGCCCAGGC CCAGATGCAA
AGACCTGCCG AGCGGGAAGC GGGGACACTG GGGCTCTCGG TCGAGGAACT GGGAGCGGAG
ATGCGTTCCC GCGGCGTCAC CGGCGTGGTG GTGAGCGACC TCGAACCGGG CGGGATCGCT
GAGGAGAGCG GCATCCAGCG CGGCGACATC ATCGTCTCCG TGAACCAGAG GAAGGTGAGG
AATCTGGCCG AGTACCAGAA GGCGATGGCG GACGCCGGCA AACGCGGCGC CGTGGCGCTC
CTGGTCCGGA GAGGCTCGGC CAGCATCTAC TTCGCACTAA AATTAAGATA G
 
Protein sequence
MKSIRLLAVC LIVSTLMIAC KKKEQASFYE SERKESVPAP VQEVPKDILV TQQAFTALVR 
TVTPSVVNIS TIGKKKLVRP FFESSPFFDE FFGERTRPQY RREHSLGSGF ILNKEGYIVT
NDHVVRDAET IQVKLSNESV YKGKVIGSDP KTDIAVIKID AKEPLPAAVL GDSNKLQVGQ
WAIAIGNPFG LDRTVTVGVV SATGRSNMGI ETYEDFIQTD ASINPGNSGG PLLNIYGEVI
GINTAIVAAG QGIGFAIPVN MAKQVVTQLI SKGNVSRGWL GVSIQSVTEE MASSFGLPKA
YGALVNDVVA GGPAAKAGVM QGDVITSFAG TAVKDVRQLQ RLVGETPIGK KVPVELYRDG
KKITVQITTA PADSAQAQMQ RPAEREAGTL GLSVEELGAE MRSRGVTGVV VSDLEPGGIA
EESGIQRGDI IVSVNQRKVR NLAEYQKAMA DAGKRGAVAL LVRRGSASIY FALKLR