Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2128 |
Symbol | |
ID | 8137464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2483768 |
End bp | 2485312 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869743 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_003021938 |
Protein GI | 253700749 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.0000251865 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGATCAAAC CTTTGCTGAT GCTCCTCGCC TTGTTGGTAC CAACCATTTT GCCGGCAACG GCGCCTGGCG CCGAAACTGC CGCCGCGAAA ACGGCGGAAA GCGCCGCCAA GACGGAATCT CTGCCGGGCA CGGCGGCCAC TGGGCGGACG GTCACAACCG CACACAGCAC GCTCATCGCC GGCAAAGAGA TCAAATACCT TGCCACCGCA GGAGAACTCC CGCTGATAAA CGAAGCCGGA GAGACCGAGG CGCAGATCTT CTACGTCTCC TACAGCGTCG AGAAACCCGA TACCAGGCGG CCGTTGCTGT TCGTTTTCAA CGGCGGACCG GGAGCGGCCG CGGTATGGCT GCACCTGGGC GCCATGGGGC CCAGGCGGGT ACAGATGCTC CCCGACGGCA ACATGCCTTC GCCCCCCTTC CAACTGGTAG ACAACGAACA GGGGTGGCTC GATCTTGCCG ACCTCGTCTT CGTCGACCCC GTCGGCACCG GCTACAGCCG GGCGGCCAAA CCGGAACTAA CCAAGAAGTT CAGCGCCGTC CAAGCCGACA TCGACTCCCT GACGCGGTTC ATCATGCTTT ACCTGGGGAA GACCCAGCGT TGGGGAAGCC CGCTCTTCTT GGCCGGCGAG AGCTACGGGA GCTTCCGGTC GGCAGCGCTT TCCGAATCCC TTGTTGAGCA CGGCATCGCC TTGAACGGGG TGTTGCTGAT ATCCTCCATC CTCAACCTGC AGACCGTCGC CTTCGACTTC GGCAACGATC TCCCCTACCC CCTCTTCCTT CCCAGCTACA CGGCAACGGC CTGGTATCAC AAGAAGCTCG CGCCCGAGCT GCAAGAAGAC CTGGAACGAA CCCTGGCAGA CGCTGAAAAA TGGGCGGCGA GCGATTACCT GACGGCGCTG AACCAGGGGG ACCGTCTCGA TCCGGCGGCA CGCCGCGCCG TGGCCGAAAA GCTCGCCCGC TTCACGGGGC TCGGCGTCAG CTTCGTGGAG AACCGCAACC TGCGTATCGA AAGCCGGGAC TTCGCAACCG AACTGCTGCG GGGCGACGGC AGGATCACCG GCATCATGGA CACCCGCTTC AGCGCCCCGA ACACCGACCC CAACAAGGGG ATTCCCTTCG ACCCGACGGT GAGCACCATA CGCTCGCCGT TCACCTCCAC CGTCAACCTC TACCTGCGGA ATGAACTCAA GTACCACAGC GACCTGGAAT ACTTCGTCCT AGGGGGAGGC ATCGGGCGGT GGGATTGGGA GGCGAAGAAC AGTTATGCCG ACACGAGCGA GAACCTCCGA AACGCCATGG CGAAGAACCG CTACCTCGGC GTGTTCGTCG CCTCAGGGCT CTTCGACCTG GCGACCCCGC ATTCCGCGAC CGACTACACC GTGGCGCACC TCGGGGTCGC ACCTGAGTTG AAGAAGAACA TCACAGTGCG CCGTTACCGC TCGGGGCACA TGATGTATCT GGAAAAGGAG TCGTTGGCTC AGCTGAAAAA GGATGCGGCC GAGTTCATCG GGAACGCGTT GAGGAGAAGC GCTGCAGGGA GGTAG
|
Protein sequence | MIKPLLMLLA LLVPTILPAT APGAETAAAK TAESAAKTES LPGTAATGRT VTTAHSTLIA GKEIKYLATA GELPLINEAG ETEAQIFYVS YSVEKPDTRR PLLFVFNGGP GAAAVWLHLG AMGPRRVQML PDGNMPSPPF QLVDNEQGWL DLADLVFVDP VGTGYSRAAK PELTKKFSAV QADIDSLTRF IMLYLGKTQR WGSPLFLAGE SYGSFRSAAL SESLVEHGIA LNGVLLISSI LNLQTVAFDF GNDLPYPLFL PSYTATAWYH KKLAPELQED LERTLADAEK WAASDYLTAL NQGDRLDPAA RRAVAEKLAR FTGLGVSFVE NRNLRIESRD FATELLRGDG RITGIMDTRF SAPNTDPNKG IPFDPTVSTI RSPFTSTVNL YLRNELKYHS DLEYFVLGGG IGRWDWEAKN SYADTSENLR NAMAKNRYLG VFVASGLFDL ATPHSATDYT VAHLGVAPEL KKNITVRRYR SGHMMYLEKE SLAQLKKDAA EFIGNALRRS AAGR
|
| |