Gene GM21_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0228 
Symbol 
ID8135534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp271231 
End bp272604 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content63% 
IMG OID644867849 
Productprotease Do 
Protein accessionYP_003020071 
Protein GI253698882 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCAG CAATCGTAGC AAGATTTCTG GTACTTAGCG TATTTCTCTC CTCCATTTTC 
TCGACCGTCG CGGGCGCCTC GGTGCTCACT CCGGACTTCG CCAAGCTGGC GAAGAAGCTG
AAGCCCGCTG TGGTCAACAT CAGCACCTCG AAGACCATAG CGGTGAAAAA GCGGCAGATC
GCCCCCGGAC GCGACCCCTT CCAGGAATAT TTCGAGAAGT TTTTCGAGGG GCCTCACCAG
CGTCCGCAAA AGCAGAGGAA CCTCGGCACC GGTTTCATCA TCAGCGACGA CGGCTACATC
ATCACCAACA ACCACGTGGT GAAGGATGCC GACGAGATCA AGGTGAAGCT CTCCGATGGC
AGGGAGTTCG CGGGGGATGT GAAGGGGCGC GACGAAAAGC TGGACCTTGC CCTGGTGAAG
ATCGACGCCA AGGGTCACCT TCCCGTGGCG CCTTTGGGGG ACAGCGACAA GATGGAAGTG
GGGGACTGGG TGATGGCGAT AGGCAACCCC TTCGGGCTCT CCCAGACCGT GACGGCGGGG
ATCATCAGCG CCCAGGGGCG CGTGATCGGT TCCGGCCCGT ATGACGACTT CATCCAGACC
GACGCCTCCA TCAACCCCGG CAACTCAGGG GGGCCGCTAT TCAACACCGA AGGGGAGGTG
ATCGGCATCA ACACTGCGAT CGTCGCCGGC GGCCAGGGAA TAGGGTTTGC CATACCGGTC
AACATGGCGA AGGAGATCCT CCCGCAGCTG AAGTCGGCCG GCAAGGTGAC ACGCGGTTGG
CTGGGCGTAT CGGTGCAGTT GGTGACCCCG GACCTCGCCA AATCCTTCGG GCTTGACAGC
GAGAAGGGGG CGCTGGTGGC CGACGTGGTG AAAGAGAGCC CCGCGGAGAA GGCCGGCCTC
AAGGGGGGCG ACATCATCCT CGAGTACGAC GGGCATCCCA TCAAGGAGAT GGGGGAGCTT
CCGCGCCGCG TGGCCGCCAC CCCGGTAGGA AAGAAGGTGA AACTGGTGGT GCAGCGCGAG
GGGCGTCAGG AGACGTTGCA GGTGACCGTC GAGCAGTTGA AGGACGACGA CCAGGATAGC
GCGGTCGCCA GCGACCGGCT CGGGGTGAAG GTGACGGAGC TTACCCCGGA GCGCGCGCAG
CAGTTGCGGG TGCAGGGGGA CAAAGGGGTC GTGGTGACCG ATGTGGAGCC GGACAGCCTT
GCCGACCGCG CCGGCATCCA GGAGGGAGAC CTGATCAGGG AGATCAACGG AGTGCGCGTA
AGCGGTGTGA GCGATTACAG CAAGTTGATC GCGGCGGCGA AGAAGGGGGG GTATCTGAAG
ATGCTGCTGA GGCGCGGCGA CGCCTCCCTG TTCGTGGCGC TCAGGCTCGA ATAG
 
Protein sequence
MQAAIVARFL VLSVFLSSIF STVAGASVLT PDFAKLAKKL KPAVVNISTS KTIAVKKRQI 
APGRDPFQEY FEKFFEGPHQ RPQKQRNLGT GFIISDDGYI ITNNHVVKDA DEIKVKLSDG
REFAGDVKGR DEKLDLALVK IDAKGHLPVA PLGDSDKMEV GDWVMAIGNP FGLSQTVTAG
IISAQGRVIG SGPYDDFIQT DASINPGNSG GPLFNTEGEV IGINTAIVAG GQGIGFAIPV
NMAKEILPQL KSAGKVTRGW LGVSVQLVTP DLAKSFGLDS EKGALVADVV KESPAEKAGL
KGGDIILEYD GHPIKEMGEL PRRVAATPVG KKVKLVVQRE GRQETLQVTV EQLKDDDQDS
AVASDRLGVK VTELTPERAQ QLRVQGDKGV VVTDVEPDSL ADRAGIQEGD LIREINGVRV
SGVSDYSKLI AAAKKGGYLK MLLRRGDASL FVALRLE