Gene Gbem_3867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_3867 
Symbol 
ID6780731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp4404147 
End bp4406069 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content64% 
IMG OID642769862 
Productpeptidase C1A papain 
Protein accessionYP_002140655 
Protein GI197120228 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC CGAAACTGGT GACGAAGGAT GGACGGAAGT TGTACGTCAG GCCGGACACG 
CTGGATTTCA GGGACAGGAT GTTCGTGCCG ACGCTGTTGG AGGTCCCGAT GCGGATGGAG
CTGGAAAGCT ACCTCGAATA CGAGGTTCCC ATCCTGGACC AGGGAACCGA AGGGGCCTGC
ACTGGCTTCG GCCTGGCCAC GAACGTCAAC TACCTTTTGC GCAAACGCAG GGTTATCCCG
GACACGGTCA GCGTCAGCCC CTGGATGTTG TACCAGCTGG CCAAGCGCTA CGACGAGTGG
CCTGGGGAGA ACTACGAGGG GTCGAGCGCG AGGGGCGCCA TGAAGGGGTG GCACAAGCAC
GGGGTCTGTG CCACCGGCCT GTGCGAGAAG GGGGGACGCC TGAGCGAGGA GGCGCTGAAG
GACGCTCCCA AGCGCTCGCT GGGTGCCTAT TTCCGGGTGA ACCACAAGGA CCTGGTGGCG
ATGCACAGCG CCCTGGCTGA GGTGGGGATC CTCTTCGCCA CCTCGAGCGT TCACAGCGGC
TGGGAAAACG TCGGCTCCGA CGGGGTGATC AAGAGATCGG ACACCGTCAT AGGTGGGCAC
GCGTTCGCCA TCGTGGGCTA CGACGAGGAC GGCTTCTGGA TGCAGAACTC CTGGGGAAAG
GACTGGGGCA AGGGGGGCTT CGCCCGCATC GGCTATGACG ACTGGCTTGC CAACGGCGAC
GACGCCTGGG TGGCGCGGCT GGGGGTGCCG ATGAACCTCC ATGCGCCGGA GTCGACGGCC
ATCGGGAGCT CCGCGGCGAT CAGCCATTCC GGCGCCTACT CCAGCACGGA ACTGCGTCGC
CATATCGTCA GCATCGGCAA CAACGGGGTG CTGCGGCCGG GGGGAAGCTA CGGCACCACC
AGGCACGACG TGGAGGACAT CTTCGCCAGG GACTTCCCTG CGCTCACGGC GGGGTGGAAC
AAGAAAAGGC TGCTCCTTTA CGCGCACGGG GGCCTTGTGG ACGAGGCGTC GGCGGTGCAG
CGGGTGGCCG AATACTGCAC CCAGCTCCTG AAGGCGGAGA TATATCCACT GGCCTTCATC
TGGCACAGCG ACATGTTCAC CACCATCTGC AACATCCTCA CCGATGCCAT GCACAAGCGG
AGGTCGGAAG GGTTCCTGGA CGACAGCAAG GACTTCATGC TGGACCGCCT GGACGACGCA
CTGGAACCGG TGGCGCGGCT GGCAGGAAAG CCGCTTTGGA GCGAGATGAA GCAAAACGCG
CTCGCTGCCG GGACCGCCGA GGAGGGAGGC GCACGCTTGG CTCTCGAGCA GATCAAGCGG
CTCCCCGCCG ATGTGGAGAT CCACCTGGTC GGGCACAGCG CCGGATCGGT CTTCCACGCC
CCGGTGGTCG AGGGGCTGGC GAAGATGGGG CGCCCGATCA AGAGCTGCAC CCTCTGGGCG
CCGGCATGCA CCACGGCGCT CTTCAAGCAG AGCTACCTCC CTTCCATCGA CTGCGGCCAC
ATCGGGTGCT TCACCCTTTT CACCTTGAAC GACAAGGCGG AGCAGTGCGA CAACTGCGCG
CGCATCTACA ACAAGTCGCT CTTGTACCTG GTGTCGAACG CCTTCGAGGC TGAGCCGCAC
ATCCCGCTCT TCAAGGACGG CGTGCCGCTA TTGGGGCTGG AGCGCTGCAT CGAGAGCGAC
TCAAGGCTTA GCGATCTCTT CTCCGCAAAG AAAGCGGACT GGGTGAGAGC CCCGAACGAC
CTGAAGGACT CCCCCTGCGA CTATTCAACG GCCCGCCATC ATGGGGATTT CGATGACGAC
CAGGCGACGG TCAAGGCGAC CCTTGCGCGC ATGCTGGGGG GGAAAGCGGA ACTCAAAGGG
GAGTTCCGCT TCGAGGTCAC CAAGTCGTCC TCGCGCCAGA GGCGGGCGAA CATTTCGCGG
TGA
 
Protein sequence
MSKPKLVTKD GRKLYVRPDT LDFRDRMFVP TLLEVPMRME LESYLEYEVP ILDQGTEGAC 
TGFGLATNVN YLLRKRRVIP DTVSVSPWML YQLAKRYDEW PGENYEGSSA RGAMKGWHKH
GVCATGLCEK GGRLSEEALK DAPKRSLGAY FRVNHKDLVA MHSALAEVGI LFATSSVHSG
WENVGSDGVI KRSDTVIGGH AFAIVGYDED GFWMQNSWGK DWGKGGFARI GYDDWLANGD
DAWVARLGVP MNLHAPESTA IGSSAAISHS GAYSSTELRR HIVSIGNNGV LRPGGSYGTT
RHDVEDIFAR DFPALTAGWN KKRLLLYAHG GLVDEASAVQ RVAEYCTQLL KAEIYPLAFI
WHSDMFTTIC NILTDAMHKR RSEGFLDDSK DFMLDRLDDA LEPVARLAGK PLWSEMKQNA
LAAGTAEEGG ARLALEQIKR LPADVEIHLV GHSAGSVFHA PVVEGLAKMG RPIKSCTLWA
PACTTALFKQ SYLPSIDCGH IGCFTLFTLN DKAEQCDNCA RIYNKSLLYL VSNAFEAEPH
IPLFKDGVPL LGLERCIESD SRLSDLFSAK KADWVRAPND LKDSPCDYST ARHHGDFDDD
QATVKATLAR MLGGKAELKG EFRFEVTKSS SRQRRANISR