Gene Gmet_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1014 
Symbol 
ID3740357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp1126874 
End bp1128304 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content58% 
IMG OID637778293 
Productpeptidase S1C, Do 
Protein accessionYP_383981 
Protein GI78222234 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCATT GGACGCTGAA ATCTGCTGGC AAAATATCCC TTCTGACAGC TTTTCTCCTG 
ATTTCGCTAA TTTTCCTGGG GGGATGCGAC GGGAGGAGCA AGACCGAATT CGTGGGATTC
CCCCAATCAT TCGCCGATCT CGCCGAAAAA ATCAGACCCG CCGTGGTGAA CATCAGCACC
ACATCAACCG TCAAAGTACC CGGCAATCCC TTCAGGCACT TTTTCGGCCC CGAGGAAGAA
GGGCCGTTTG GTGATTTCTT CAAGCATTTT TTCGGCGACA TGCCCGACCG TGAGCTGAAA
CAGCAGAGTC TCGGCTCCGG GATCATCACC GACAAGGACG GGTACATCGT CACCAACAAC
CACGTGGTGG ATAATGCCGA GGAGATAAAG GTCAAGATCT CTGACGGCAG AGAATTCAAG
GCCAAGGTTA TCGGAAGGGA TCCCAAAACC GATCTTGCGC TGATCAAGAT ATCTTCCCCC
TTCAGAAATC TCCCCGTCCT CCCCCTCGGC GACTCCGACA AAATGAGAGT TGGTGATTGG
GTGCTTGCAG TGGGGAACCC GTTCGGTCTC GAACACACCG TGACCCAGGG GATCATCAGC
GCCACCGGGA GGGTGATCGG TTCCGGGCCC TATGACAATT TCCTCCAGAC CGACGCCCCC
ATCAACCCTG GCAACAGCGG CGGCCCCCTG GTCAACCTCA AAGGGGAGGT GATCGGGATC
AATACCGCCA TCGTCCCCGG CGGGCAGGGG CTCGGCTTTG CCATCCCGAG CAGCATGGCC
AAAATGGTGC TCAAGCAGTT GCAGGAGAAG GGGAAAGTGG TGCGGGGATG GCTCGGTGTT
ACGATCCAGA CCGTAACCCC CGACCTGGCC GCCTCCTTTG GCCTCAAGGA GGCGAAGGGG
GCCCTCGTCT CCGACATCGC GGAAGGAGGA CCGGCCGCCA AAGGGGGAAT CAGGCGGGGA
GATATCATCC TTTCCTTTGA CGGGAAAAAT GTGAAGGACT CCATGGAACT GCCCCGAATC
GTAGCGGAAA CCCCGGTCGG CAAAGAGGTG GATGTCACGG TGCTCAGGGA AGGGAAAGAG
GTGCATTGCA GGGTGAGGGT CGAGGAACTC ACGGAACAGA GGATTGCCGC CCAGACCGAG
GCGCCGACGG ACAGCTTCGG AATGACGTTT GTCGACATTA CCCCCAAGGT GCGGCAACAA
CTCGGGATCA AAGAGAAAAC GGGAGTTGTC GTTGCCGGAG TGGAGCCCGG GAGCATCGCC
GAAGATGCGG GTATCCGGGC GGGGGATGTG ATCAAGGAAG TTAATCGCAA ACCGGTCAGA
AACCTGGCGG ACTTGAGCAG TGCCTTGGAG AAGTCCGCAA AGGGGCAACC GGTCCTCTTG
CTGCTCAATC GGGGAAGTCA GACTTTCTAT GTGACGCTGG AAACTTCGTA G
 
Protein sequence
MKHWTLKSAG KISLLTAFLL ISLIFLGGCD GRSKTEFVGF PQSFADLAEK IRPAVVNIST 
TSTVKVPGNP FRHFFGPEEE GPFGDFFKHF FGDMPDRELK QQSLGSGIIT DKDGYIVTNN
HVVDNAEEIK VKISDGREFK AKVIGRDPKT DLALIKISSP FRNLPVLPLG DSDKMRVGDW
VLAVGNPFGL EHTVTQGIIS ATGRVIGSGP YDNFLQTDAP INPGNSGGPL VNLKGEVIGI
NTAIVPGGQG LGFAIPSSMA KMVLKQLQEK GKVVRGWLGV TIQTVTPDLA ASFGLKEAKG
ALVSDIAEGG PAAKGGIRRG DIILSFDGKN VKDSMELPRI VAETPVGKEV DVTVLREGKE
VHCRVRVEEL TEQRIAAQTE APTDSFGMTF VDITPKVRQQ LGIKEKTGVV VAGVEPGSIA
EDAGIRAGDV IKEVNRKPVR NLADLSSALE KSAKGQPVLL LLNRGSQTFY VTLETS