Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2391 |
Symbol | |
ID | 5323252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2467002 |
End bp | 2468108 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640791329 |
Product | peptidase A1 pepsin |
Protein accession | YP_001328058 |
Protein GI | 150397591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.116822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGG GTGGCGTTAG ATCTACCGTC GAAGCAGAGA ATACACTGGT GTTCGAACTT CGCCGCGGCG CGATTACGAA TAACGGCGCC ACACCATGGT GGGTTTCCGC CACTGTTGGG ACGAGCACCG TACCGCAGAC TTTCAAGTTC ATGATGGACA CCGGAACAAC CAACACATGG GTTACGGCGA AGAGCTGCTC CACCAGTGAA TGCACACAGC ACGATTCCTT CGACGAAACC GCGTCCTCGA CCTTCGTCTG GCAAGACCAG TCACTGACAC CCATCGACTT CAGCGGTTGG GGGAACATGG ACGTCAACCT CGGAAATGAC GTATGGGTTC TGTCTGACAA GAACGCCATC GACCAGGACG TCACCGTCGA CTTCTGGCTC AGCCAGTCCT ATTCCGGGGC CCGTTTTGGC GAACTTATCC AGGATGGGTT CATCGCCTGC GGTCCCTACG GTGACAATTC CCGGTCGAAT TTGATTTTCG ACGCGCTGTG GTACTCCGGT GCACTGGCTT CGCCGTGGAT GTCCTATTGG GTCGATTACA CGATCGACGG TGCAGCAGTC GACAGCGGCG AGATGATTTT TGGCGGCAGC AATGCCGACA AATACGACCC GGACACTGTT ATCTCCCTCG ATATCGTACC GGGAATCTGG ACTCACATGG GCGATTCGAT CGTCGTTGAC GGTGCCGAGA TATTTACGTC TCCCGCACTG CTGATCGACA CGGGTGCCTC TGAAATCAAA GGAGAGCAGG CGGAGATCGA TAAATTGATT GCCGCAGTAA CGCTGGACGG GCAGCTGCCG CTTTCTCCCG ACAACCCGGA TCACTATGCC TATCCCGACC TGGTCTTCAA TCTTGGAAAG ACCAGCGACG GCAGCACCGG GCAGTTGGTC TTCCCGCCCC GCGCCTATTT CAACTATATC GAGGCAGGCA CAGACCAAGG AAAATGGCAG ATTGCGATGA CCGTTCTTGA AGACCTTGGA GAAAACTCCC TGATCTTCGG CCGCAATCTT CTCGATCGGC TGTATTCGGA GTGGGAATAC GACACCTCGG GCACGTCGGT CGCCGGGAAA ACCATCCGGC TGGCACAACG GGTCTAG
|
Protein sequence | MRKGGVRSTV EAENTLVFEL RRGAITNNGA TPWWVSATVG TSTVPQTFKF MMDTGTTNTW VTAKSCSTSE CTQHDSFDET ASSTFVWQDQ SLTPIDFSGW GNMDVNLGND VWVLSDKNAI DQDVTVDFWL SQSYSGARFG ELIQDGFIAC GPYGDNSRSN LIFDALWYSG ALASPWMSYW VDYTIDGAAV DSGEMIFGGS NADKYDPDTV ISLDIVPGIW THMGDSIVVD GAEIFTSPAL LIDTGASEIK GEQAEIDKLI AAVTLDGQLP LSPDNPDHYA YPDLVFNLGK TSDGSTGQLV FPPRAYFNYI EAGTDQGKWQ IAMTVLEDLG ENSLIFGRNL LDRLYSEWEY DTSGTSVAGK TIRLAQRV
|
| |