Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2758 |
Symbol | |
ID | 8138101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3203136 |
End bp | 3206168 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870363 |
Product | PDZ/DHR/GLGF domain protein |
Protein accession | YP_003022552 |
Protein GI | 253701363 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 157 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCAG TTCGGCGAGT AATTAAGCGG CAGATCGCAC TCCCGGCAAT CGCCGCACAG GTTTGTCTCC TTGTTCTGGT CATGACGGGA ACGGCCGCCG CCAAGAGTGC TTCCCGACCG TCCGCCGTCG TCAAACCACC GGCCGCGGTC AAAGCGCCGC CCGCGGCATC TCCCGCGGCG ACGGCGCCTG CTGCGACGGC CGAACGGGAA CCCGCGCCTG CGCTCTTCGG AGATGCCACC TTCCTGCTCC CCCGAGGGGC GAAGGTCGCG GCGCGTTGGG TGCTGCCACC GCTGGAGAAG GGGAGAGACG CCATCTTCAG CGTCGATGCC GCGGGTTCGC CCGTCATCGC GTTCAGTGGC GACTCGAACT ATCACCTGCT CCATCCCGAC AGAAACTACG TGGTGGCCGT GAGAGCTACC ATTTCGGGGA TGACCCATCT CGCCAACGGC GTGCTCCTGC TGTCGTCGGG CAACGACCTG CTCCTCGTTG CCGAACCGAA GGAGAAGACT CTGGACAAAA AGGGGGTTCC CTATGCCGCG CTGCAGCCCC TGACCAAGAT CCCCCTGCGC AAGATCGAGG TGCTGACCAG CGTCGGCACC ACGGTCTACT GCGCCGGCAT CGATGCCCGC AGCGGCCGCC ATGCCCTTTA CCTGCTCCGC TCCCTCAAGG GGGGAGGCAT TCTCGATATG GAACTGGCGT ACGAGTCACA AGAGCCCATT ACCGCGGTTA CCGCAGATGC AGATGCCATC TACGTCGCCA AGGGGCGCAC GGTGGTTCGG TATCTCGTCA AGGATGGGAC GCAGACCCCT TTCTATACCC ACCCCTCAAA GACCGTCACC GGGCTGGCCA TGACACCGGC CGGACTCGTC GTCAGCACCG GCAGGGAGAT CGTCCTAGCC GGCCGGGGCG GTGCGCTGGA GATCATGGCC TCGTCGGCGC ACCGCATCGC CATGGCCGGC GAAACTCTCT ATGTCTTCTT CAACAGCTCC CTGGGGGTGC TGGCCCTCGA CAACCTCGCG GACCTCGGGC GCTTCAACCT GGCAGTCAGG CCGGTGACTC CCGGCGAGGC GCAATCCCCT CTAGCCGTCA GCAGCGTCAG CTTCTTCGAG AGCGATTCCC TTCAAAACAC CCACGGGTTC TCGGAGAGCT TCGACCGCAA AGCGGTGCGG CGGATCGTGG CCCAGATCGA ATTGGACCCG GCCTCACTGG CCGGAAGCCG GGGGGATCAT GTCGTCACGC TCTCCTGGCA CGAGCCAAAC GGCGGCATGC TCAAAAGCAC CAGCCACCAG GTGACAAAGC AGTCCGGCGG CCGGATCTTC GCCACTCTCG GGGGCAAGGC ACAGCGTGGA TATTCTCCTC CCCATTGGAC CAACAACGGA GAGGCATTTC TTCGCTATAC CGATGAACTG GCGAACAACT ACCCCGGACG CTATCGCATG CTGGTGCAGG TGGATGGGAT CGCTGCCGGG GAATGGTCCT TTATCCTCAC CGGTCAGGCC ACTGCCGAAC AGGCCATTTC CCGTGACGAC CTGCAAGCGC TCAAAACCAT GCTCGATCAG GGGCTCAGTG CCCGGAGCAA AAGCGAGGAT GGCGAGCCGC TCGTCACCAC GGCGGTTCGG TTTGGAAGCG TTCGGGCAGT GCAGTTGCTT CTGGAGCGGG GCGCCGATGC CAATGCCACA GACAAGGAAG GATATCCGTC GCTGGCGCGG GCGGAGTACG CCGGCGATTG GCGGACCAAG GCTGAACTCC TGTTGCGTCA CGGCGCCAAC GTCAACGCAC CGAGGTACCC GGGCGGGCCG CCGCTGGTGG AGACCTTTTC CGCCGACTTT ACCCTCTTCA TGCTGCAAAA CGGGGCCGAT TTCCGGTACG AAACCAAATA CGGCAAGCGC AGCGTGTTGG GCGAGATGTA CGACTCGACT TGCACCGAGG AGATTCTGTC GCTCCTGATC AAGCGCGGGG CCGATCTGAA CGAAACCACC TCGTTTGTAC ACTTCTCCCC TCTGGGCAGA GCCATCTACA GCGGCAACGA ACGCTGCACC CAGCTGCTTC TGGAAAAAGG GGCGTCGACC GCCGTCGTCC AGAAGGAGCC GAACAGGCGG CCCCGTTCCG CGCTGTACGT GGCCTTCGAA AATCTCGACC GCAGTTCTGA CCCCAAGGAT AAAGCCGCGC GACGGCGCAT CGTCCGCCTG CTGCTGCAAA AAGGCGCCAC CTTCAAACCG GGGAAAAAGC TCTCCACATC CGCATACTTC GATCTTCCCA AGGACGATTA CATCAGGCAG ATGGATGAAA CCGCCTCCAT CGTCGCGGGA GAGGGGAGCC TCATGTTCCT GGGCGAGGGG CCGACCTTCT TCGAGCAGGT GGACATGATC GGGATGCTGG AACAGCAAGA TGCGGCACTG GAGACGGCTA CCAAATCCAA AGACCCGGCG ATCCGGGAGT TGGCGCTTAG CACGCACCTT GGTCGGGTCC GGGAGTTGAC GGCCAAGGCG AGGGATCAGT ACGACATGGG CTTTCAGGTT CACAAGCACT GTGAGCAGGC ATTCCAGCTG TCCGAGGCGC AGTATCGCCC GGCCCAGGTG GATGTCGTGC CCGAACTGCA GCAACCTCCG CCGGGTGGTC AGGGGAAATC CCAGCTCGGA GTCAAACTGC TGAAACGTGC GGCGGGAGGC GCCTACGTAC AGGGGGTAAT GCCGGGAGGC CCCGCCGAAC GGGCCGGGCT GAAGACGGGC GATATCATCC TCGCCCTGGA TACCCAGAAG ATGAAGGATG CCGACGAAGT TGCGGCTACG GCCGCCCGCC TCGCACCCGG TATGCCGGTG CGGGTGACCT TTCTGCGCGA CGAACCGCTG CGGATGCCGG ACCTTCAGCT CAGCTGCGGA CTGGTGGAGA CGGAATACAA GGACCACTGG GGATACGCCG AGATGAACCT CACCCGCTGG CTGGCCGCGC ATCCCGACGC TGCCGCCTCC GCAGAGGTGC GCGCGCGGCT CAAGCAAATT ACTTCGGGGG TGCGGAAGCA GCTTCCCATT TAA
|
Protein sequence | MDPVRRVIKR QIALPAIAAQ VCLLVLVMTG TAAAKSASRP SAVVKPPAAV KAPPAASPAA TAPAATAERE PAPALFGDAT FLLPRGAKVA ARWVLPPLEK GRDAIFSVDA AGSPVIAFSG DSNYHLLHPD RNYVVAVRAT ISGMTHLANG VLLLSSGNDL LLVAEPKEKT LDKKGVPYAA LQPLTKIPLR KIEVLTSVGT TVYCAGIDAR SGRHALYLLR SLKGGGILDM ELAYESQEPI TAVTADADAI YVAKGRTVVR YLVKDGTQTP FYTHPSKTVT GLAMTPAGLV VSTGREIVLA GRGGALEIMA SSAHRIAMAG ETLYVFFNSS LGVLALDNLA DLGRFNLAVR PVTPGEAQSP LAVSSVSFFE SDSLQNTHGF SESFDRKAVR RIVAQIELDP ASLAGSRGDH VVTLSWHEPN GGMLKSTSHQ VTKQSGGRIF ATLGGKAQRG YSPPHWTNNG EAFLRYTDEL ANNYPGRYRM LVQVDGIAAG EWSFILTGQA TAEQAISRDD LQALKTMLDQ GLSARSKSED GEPLVTTAVR FGSVRAVQLL LERGADANAT DKEGYPSLAR AEYAGDWRTK AELLLRHGAN VNAPRYPGGP PLVETFSADF TLFMLQNGAD FRYETKYGKR SVLGEMYDST CTEEILSLLI KRGADLNETT SFVHFSPLGR AIYSGNERCT QLLLEKGAST AVVQKEPNRR PRSALYVAFE NLDRSSDPKD KAARRRIVRL LLQKGATFKP GKKLSTSAYF DLPKDDYIRQ MDETASIVAG EGSLMFLGEG PTFFEQVDMI GMLEQQDAAL ETATKSKDPA IRELALSTHL GRVRELTAKA RDQYDMGFQV HKHCEQAFQL SEAQYRPAQV DVVPELQQPP PGGQGKSQLG VKLLKRAAGG AYVQGVMPGG PAERAGLKTG DIILALDTQK MKDADEVAAT AARLAPGMPV RVTFLRDEPL RMPDLQLSCG LVETEYKDHW GYAEMNLTRW LAAHPDAAAS AEVRARLKQI TSGVRKQLPI
|
| |