Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gbem_3867 |
Symbol | |
ID | 6780731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter bemidjiensis Bem |
Kingdom | Bacteria |
Replicon accession | NC_011146 |
Strand | - |
Start bp | 4404147 |
End bp | 4406069 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642769862 |
Product | peptidase C1A papain |
Protein accession | YP_002140655 |
Protein GI | 197120228 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC CGAAACTGGT GACGAAGGAT GGACGGAAGT TGTACGTCAG GCCGGACACG CTGGATTTCA GGGACAGGAT GTTCGTGCCG ACGCTGTTGG AGGTCCCGAT GCGGATGGAG CTGGAAAGCT ACCTCGAATA CGAGGTTCCC ATCCTGGACC AGGGAACCGA AGGGGCCTGC ACTGGCTTCG GCCTGGCCAC GAACGTCAAC TACCTTTTGC GCAAACGCAG GGTTATCCCG GACACGGTCA GCGTCAGCCC CTGGATGTTG TACCAGCTGG CCAAGCGCTA CGACGAGTGG CCTGGGGAGA ACTACGAGGG GTCGAGCGCG AGGGGCGCCA TGAAGGGGTG GCACAAGCAC GGGGTCTGTG CCACCGGCCT GTGCGAGAAG GGGGGACGCC TGAGCGAGGA GGCGCTGAAG GACGCTCCCA AGCGCTCGCT GGGTGCCTAT TTCCGGGTGA ACCACAAGGA CCTGGTGGCG ATGCACAGCG CCCTGGCTGA GGTGGGGATC CTCTTCGCCA CCTCGAGCGT TCACAGCGGC TGGGAAAACG TCGGCTCCGA CGGGGTGATC AAGAGATCGG ACACCGTCAT AGGTGGGCAC GCGTTCGCCA TCGTGGGCTA CGACGAGGAC GGCTTCTGGA TGCAGAACTC CTGGGGAAAG GACTGGGGCA AGGGGGGCTT CGCCCGCATC GGCTATGACG ACTGGCTTGC CAACGGCGAC GACGCCTGGG TGGCGCGGCT GGGGGTGCCG ATGAACCTCC ATGCGCCGGA GTCGACGGCC ATCGGGAGCT CCGCGGCGAT CAGCCATTCC GGCGCCTACT CCAGCACGGA ACTGCGTCGC CATATCGTCA GCATCGGCAA CAACGGGGTG CTGCGGCCGG GGGGAAGCTA CGGCACCACC AGGCACGACG TGGAGGACAT CTTCGCCAGG GACTTCCCTG CGCTCACGGC GGGGTGGAAC AAGAAAAGGC TGCTCCTTTA CGCGCACGGG GGCCTTGTGG ACGAGGCGTC GGCGGTGCAG CGGGTGGCCG AATACTGCAC CCAGCTCCTG AAGGCGGAGA TATATCCACT GGCCTTCATC TGGCACAGCG ACATGTTCAC CACCATCTGC AACATCCTCA CCGATGCCAT GCACAAGCGG AGGTCGGAAG GGTTCCTGGA CGACAGCAAG GACTTCATGC TGGACCGCCT GGACGACGCA CTGGAACCGG TGGCGCGGCT GGCAGGAAAG CCGCTTTGGA GCGAGATGAA GCAAAACGCG CTCGCTGCCG GGACCGCCGA GGAGGGAGGC GCACGCTTGG CTCTCGAGCA GATCAAGCGG CTCCCCGCCG ATGTGGAGAT CCACCTGGTC GGGCACAGCG CCGGATCGGT CTTCCACGCC CCGGTGGTCG AGGGGCTGGC GAAGATGGGG CGCCCGATCA AGAGCTGCAC CCTCTGGGCG CCGGCATGCA CCACGGCGCT CTTCAAGCAG AGCTACCTCC CTTCCATCGA CTGCGGCCAC ATCGGGTGCT TCACCCTTTT CACCTTGAAC GACAAGGCGG AGCAGTGCGA CAACTGCGCG CGCATCTACA ACAAGTCGCT CTTGTACCTG GTGTCGAACG CCTTCGAGGC TGAGCCGCAC ATCCCGCTCT TCAAGGACGG CGTGCCGCTA TTGGGGCTGG AGCGCTGCAT CGAGAGCGAC TCAAGGCTTA GCGATCTCTT CTCCGCAAAG AAAGCGGACT GGGTGAGAGC CCCGAACGAC CTGAAGGACT CCCCCTGCGA CTATTCAACG GCCCGCCATC ATGGGGATTT CGATGACGAC CAGGCGACGG TCAAGGCGAC CCTTGCGCGC ATGCTGGGGG GGAAAGCGGA ACTCAAAGGG GAGTTCCGCT TCGAGGTCAC CAAGTCGTCC TCGCGCCAGA GGCGGGCGAA CATTTCGCGG TGA
|
Protein sequence | MSKPKLVTKD GRKLYVRPDT LDFRDRMFVP TLLEVPMRME LESYLEYEVP ILDQGTEGAC TGFGLATNVN YLLRKRRVIP DTVSVSPWML YQLAKRYDEW PGENYEGSSA RGAMKGWHKH GVCATGLCEK GGRLSEEALK DAPKRSLGAY FRVNHKDLVA MHSALAEVGI LFATSSVHSG WENVGSDGVI KRSDTVIGGH AFAIVGYDED GFWMQNSWGK DWGKGGFARI GYDDWLANGD DAWVARLGVP MNLHAPESTA IGSSAAISHS GAYSSTELRR HIVSIGNNGV LRPGGSYGTT RHDVEDIFAR DFPALTAGWN KKRLLLYAHG GLVDEASAVQ RVAEYCTQLL KAEIYPLAFI WHSDMFTTIC NILTDAMHKR RSEGFLDDSK DFMLDRLDDA LEPVARLAGK PLWSEMKQNA LAAGTAEEGG ARLALEQIKR LPADVEIHLV GHSAGSVFHA PVVEGLAKMG RPIKSCTLWA PACTTALFKQ SYLPSIDCGH IGCFTLFTLN DKAEQCDNCA RIYNKSLLYL VSNAFEAEPH IPLFKDGVPL LGLERCIESD SRLSDLFSAK KADWVRAPND LKDSPCDYST ARHHGDFDDD QATVKATLAR MLGGKAELKG EFRFEVTKSS SRQRRANISR
|
| |