Gene GM21_2758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2758 
Symbol 
ID8138101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3203136 
End bp3206168 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content63% 
IMG OID644870363 
ProductPDZ/DHR/GLGF domain protein 
Protein accessionYP_003022552 
Protein GI253701363 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones157 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCAG TTCGGCGAGT AATTAAGCGG CAGATCGCAC TCCCGGCAAT CGCCGCACAG 
GTTTGTCTCC TTGTTCTGGT CATGACGGGA ACGGCCGCCG CCAAGAGTGC TTCCCGACCG
TCCGCCGTCG TCAAACCACC GGCCGCGGTC AAAGCGCCGC CCGCGGCATC TCCCGCGGCG
ACGGCGCCTG CTGCGACGGC CGAACGGGAA CCCGCGCCTG CGCTCTTCGG AGATGCCACC
TTCCTGCTCC CCCGAGGGGC GAAGGTCGCG GCGCGTTGGG TGCTGCCACC GCTGGAGAAG
GGGAGAGACG CCATCTTCAG CGTCGATGCC GCGGGTTCGC CCGTCATCGC GTTCAGTGGC
GACTCGAACT ATCACCTGCT CCATCCCGAC AGAAACTACG TGGTGGCCGT GAGAGCTACC
ATTTCGGGGA TGACCCATCT CGCCAACGGC GTGCTCCTGC TGTCGTCGGG CAACGACCTG
CTCCTCGTTG CCGAACCGAA GGAGAAGACT CTGGACAAAA AGGGGGTTCC CTATGCCGCG
CTGCAGCCCC TGACCAAGAT CCCCCTGCGC AAGATCGAGG TGCTGACCAG CGTCGGCACC
ACGGTCTACT GCGCCGGCAT CGATGCCCGC AGCGGCCGCC ATGCCCTTTA CCTGCTCCGC
TCCCTCAAGG GGGGAGGCAT TCTCGATATG GAACTGGCGT ACGAGTCACA AGAGCCCATT
ACCGCGGTTA CCGCAGATGC AGATGCCATC TACGTCGCCA AGGGGCGCAC GGTGGTTCGG
TATCTCGTCA AGGATGGGAC GCAGACCCCT TTCTATACCC ACCCCTCAAA GACCGTCACC
GGGCTGGCCA TGACACCGGC CGGACTCGTC GTCAGCACCG GCAGGGAGAT CGTCCTAGCC
GGCCGGGGCG GTGCGCTGGA GATCATGGCC TCGTCGGCGC ACCGCATCGC CATGGCCGGC
GAAACTCTCT ATGTCTTCTT CAACAGCTCC CTGGGGGTGC TGGCCCTCGA CAACCTCGCG
GACCTCGGGC GCTTCAACCT GGCAGTCAGG CCGGTGACTC CCGGCGAGGC GCAATCCCCT
CTAGCCGTCA GCAGCGTCAG CTTCTTCGAG AGCGATTCCC TTCAAAACAC CCACGGGTTC
TCGGAGAGCT TCGACCGCAA AGCGGTGCGG CGGATCGTGG CCCAGATCGA ATTGGACCCG
GCCTCACTGG CCGGAAGCCG GGGGGATCAT GTCGTCACGC TCTCCTGGCA CGAGCCAAAC
GGCGGCATGC TCAAAAGCAC CAGCCACCAG GTGACAAAGC AGTCCGGCGG CCGGATCTTC
GCCACTCTCG GGGGCAAGGC ACAGCGTGGA TATTCTCCTC CCCATTGGAC CAACAACGGA
GAGGCATTTC TTCGCTATAC CGATGAACTG GCGAACAACT ACCCCGGACG CTATCGCATG
CTGGTGCAGG TGGATGGGAT CGCTGCCGGG GAATGGTCCT TTATCCTCAC CGGTCAGGCC
ACTGCCGAAC AGGCCATTTC CCGTGACGAC CTGCAAGCGC TCAAAACCAT GCTCGATCAG
GGGCTCAGTG CCCGGAGCAA AAGCGAGGAT GGCGAGCCGC TCGTCACCAC GGCGGTTCGG
TTTGGAAGCG TTCGGGCAGT GCAGTTGCTT CTGGAGCGGG GCGCCGATGC CAATGCCACA
GACAAGGAAG GATATCCGTC GCTGGCGCGG GCGGAGTACG CCGGCGATTG GCGGACCAAG
GCTGAACTCC TGTTGCGTCA CGGCGCCAAC GTCAACGCAC CGAGGTACCC GGGCGGGCCG
CCGCTGGTGG AGACCTTTTC CGCCGACTTT ACCCTCTTCA TGCTGCAAAA CGGGGCCGAT
TTCCGGTACG AAACCAAATA CGGCAAGCGC AGCGTGTTGG GCGAGATGTA CGACTCGACT
TGCACCGAGG AGATTCTGTC GCTCCTGATC AAGCGCGGGG CCGATCTGAA CGAAACCACC
TCGTTTGTAC ACTTCTCCCC TCTGGGCAGA GCCATCTACA GCGGCAACGA ACGCTGCACC
CAGCTGCTTC TGGAAAAAGG GGCGTCGACC GCCGTCGTCC AGAAGGAGCC GAACAGGCGG
CCCCGTTCCG CGCTGTACGT GGCCTTCGAA AATCTCGACC GCAGTTCTGA CCCCAAGGAT
AAAGCCGCGC GACGGCGCAT CGTCCGCCTG CTGCTGCAAA AAGGCGCCAC CTTCAAACCG
GGGAAAAAGC TCTCCACATC CGCATACTTC GATCTTCCCA AGGACGATTA CATCAGGCAG
ATGGATGAAA CCGCCTCCAT CGTCGCGGGA GAGGGGAGCC TCATGTTCCT GGGCGAGGGG
CCGACCTTCT TCGAGCAGGT GGACATGATC GGGATGCTGG AACAGCAAGA TGCGGCACTG
GAGACGGCTA CCAAATCCAA AGACCCGGCG ATCCGGGAGT TGGCGCTTAG CACGCACCTT
GGTCGGGTCC GGGAGTTGAC GGCCAAGGCG AGGGATCAGT ACGACATGGG CTTTCAGGTT
CACAAGCACT GTGAGCAGGC ATTCCAGCTG TCCGAGGCGC AGTATCGCCC GGCCCAGGTG
GATGTCGTGC CCGAACTGCA GCAACCTCCG CCGGGTGGTC AGGGGAAATC CCAGCTCGGA
GTCAAACTGC TGAAACGTGC GGCGGGAGGC GCCTACGTAC AGGGGGTAAT GCCGGGAGGC
CCCGCCGAAC GGGCCGGGCT GAAGACGGGC GATATCATCC TCGCCCTGGA TACCCAGAAG
ATGAAGGATG CCGACGAAGT TGCGGCTACG GCCGCCCGCC TCGCACCCGG TATGCCGGTG
CGGGTGACCT TTCTGCGCGA CGAACCGCTG CGGATGCCGG ACCTTCAGCT CAGCTGCGGA
CTGGTGGAGA CGGAATACAA GGACCACTGG GGATACGCCG AGATGAACCT CACCCGCTGG
CTGGCCGCGC ATCCCGACGC TGCCGCCTCC GCAGAGGTGC GCGCGCGGCT CAAGCAAATT
ACTTCGGGGG TGCGGAAGCA GCTTCCCATT TAA
 
Protein sequence
MDPVRRVIKR QIALPAIAAQ VCLLVLVMTG TAAAKSASRP SAVVKPPAAV KAPPAASPAA 
TAPAATAERE PAPALFGDAT FLLPRGAKVA ARWVLPPLEK GRDAIFSVDA AGSPVIAFSG
DSNYHLLHPD RNYVVAVRAT ISGMTHLANG VLLLSSGNDL LLVAEPKEKT LDKKGVPYAA
LQPLTKIPLR KIEVLTSVGT TVYCAGIDAR SGRHALYLLR SLKGGGILDM ELAYESQEPI
TAVTADADAI YVAKGRTVVR YLVKDGTQTP FYTHPSKTVT GLAMTPAGLV VSTGREIVLA
GRGGALEIMA SSAHRIAMAG ETLYVFFNSS LGVLALDNLA DLGRFNLAVR PVTPGEAQSP
LAVSSVSFFE SDSLQNTHGF SESFDRKAVR RIVAQIELDP ASLAGSRGDH VVTLSWHEPN
GGMLKSTSHQ VTKQSGGRIF ATLGGKAQRG YSPPHWTNNG EAFLRYTDEL ANNYPGRYRM
LVQVDGIAAG EWSFILTGQA TAEQAISRDD LQALKTMLDQ GLSARSKSED GEPLVTTAVR
FGSVRAVQLL LERGADANAT DKEGYPSLAR AEYAGDWRTK AELLLRHGAN VNAPRYPGGP
PLVETFSADF TLFMLQNGAD FRYETKYGKR SVLGEMYDST CTEEILSLLI KRGADLNETT
SFVHFSPLGR AIYSGNERCT QLLLEKGAST AVVQKEPNRR PRSALYVAFE NLDRSSDPKD
KAARRRIVRL LLQKGATFKP GKKLSTSAYF DLPKDDYIRQ MDETASIVAG EGSLMFLGEG
PTFFEQVDMI GMLEQQDAAL ETATKSKDPA IRELALSTHL GRVRELTAKA RDQYDMGFQV
HKHCEQAFQL SEAQYRPAQV DVVPELQQPP PGGQGKSQLG VKLLKRAAGG AYVQGVMPGG
PAERAGLKTG DIILALDTQK MKDADEVAAT AARLAPGMPV RVTFLRDEPL RMPDLQLSCG
LVETEYKDHW GYAEMNLTRW LAAHPDAAAS AEVRARLKQI TSGVRKQLPI