Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3953 |
Symbol | |
ID | 8139327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4535781 |
End bp | 4536926 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871569 |
Product | peptidase C1A papain |
Protein accession | YP_003023727 |
Protein GI | 253702538 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.00000017791 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTACCC CCTGTTTGAT TTGGCGCTTT GGAAGGCACC CCTTCTCCAG CACGGAACTG CGCCGGCACA TCGTGAGCAT CGGCAACAAC GGCGTGCTGC GGCCGGGGGG AAGCTACGGC ACCACCAGGC ACGACGTGGA GGACATCTTC GCGAGGGACT TCCCTGCGCT CACGGCGGGG TGGAAAAGGA AAAGGCTGCT CCTATACGCG CACGGGGGGC TTGTGGACGA GGCGTCGGCG GTGCAGCGGG TGGCCGAGTA CCGCACCGAG CTCTTGAAGG CGGAGATCTA CCCCCTGGCC TTCATCTGGC ACAGCGACAT GTTCACCACC ATCACCAACA TCCTCACCGA TGCCATGAGA AAGCGGAGGT CGGAAGGGTT CCTCGACGAC AGCTTGGATT TCATGCTGGA CCGCCTGGAC GACGCACTGG GGCCGGTGGC GCGGCTGGCA GGAAAGCCGC TTTGGAGCGA GATGAAGCAA AACGCGCTCG CGGCCGGGAC CGGTGAGGAG GGAGGCGCGC GCGTGGTCCT GGAGCAGATC AATGGGCTAC CCGCCGATGT GGAGATCCAC ATAGTGGGGC ACAGCGCCGG ATCGATCTTC CACGCGCCGG TGGTCGAGGG GCTGGCGAAG ACGGGGCGCC CGATCAAGAG CTGCATCCTC TGGGCGCCGG CCTGCACCAC GGCGCTCTTC AAGCAGAGCT ATCTCCCCTC CATAGACAGC GGCCACATCG GGCGCTTCAC CCTCTTCACC TTGAACGACA AGGCGGAGCA GTGCGACAAC TGCGCGCGCA TCTACAACAA GTCGCTCCTG TACCAGGTGT CGAACGCGTT CGAGGCCTGG CCGCACATCC CGCTCTTCAA GGACGGGGTG CCGCTATTGG GGCTGGAGCG CTGCATCGAG AGCGACTCAA GGCTCAGGGA TCTCTTCTCC GGCAAGAACG CGGACTGGGT CAGGGCTCCG AACGACCTGA AGGACTCCCC CTGCGACTAT TCCACGGCCC GCCACCATGG GGATTTCGAT GACGATCAGG CCACGGTCAG GGCGACCCAG GCGCGCATGC TGGGTAAAAC GGAACTGAAG GGGGAATTCA GCTTCGAGGT CACCAAGTCG TCCTCGCGCC GGAGACGGGC GAACCTCTCG CGGTGA
|
Protein sequence | MSTPCLIWRF GRHPFSSTEL RRHIVSIGNN GVLRPGGSYG TTRHDVEDIF ARDFPALTAG WKRKRLLLYA HGGLVDEASA VQRVAEYRTE LLKAEIYPLA FIWHSDMFTT ITNILTDAMR KRRSEGFLDD SLDFMLDRLD DALGPVARLA GKPLWSEMKQ NALAAGTGEE GGARVVLEQI NGLPADVEIH IVGHSAGSIF HAPVVEGLAK TGRPIKSCIL WAPACTTALF KQSYLPSIDS GHIGRFTLFT LNDKAEQCDN CARIYNKSLL YQVSNAFEAW PHIPLFKDGV PLLGLERCIE SDSRLRDLFS GKNADWVRAP NDLKDSPCDY STARHHGDFD DDQATVRATQ ARMLGKTELK GEFSFEVTKS SSRRRRANLS R
|
| |