Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5000 |
Symbol | |
ID | 5318649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1515366 |
End bp | 1517090 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640776782 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_001313714 |
Protein GI | 150377118 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00410599 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGCTT ATAGTTGCTC GAAAACGCAT GCATTTTTTA TTCATTTTCA CCATTTTTTT TGGCCGGAAA CGTCCCGGCT CCCGGAGGAA ATCATGGCTA CAATTACGTA TCATTTCCCA GCAGGACTTT ACCCAAATCA ATACAACCCA CCGTTGAGTG GCGTCAGCGT TAACCCGTCC TTTGGCGAGC TCGTGGACAT GAGCACGGGG AGTCGCTCAA CCACTACCAG CACTTCGGTG CTTTATCGGC TCGACAACGG TCTCAAGATG AAGCTCGTGG GCACGGGGTT CAGTTTTGAC GCGAGTGGCG ATGCCGTCGG AGGAACAATC ACATCGATTG AGGTTCTTCT GAACAACGGC ACGGGATTGA TACAAACAAT TAGCGGGCTG AATATTTCGC TTGAACTTTT CCAAGACGCA TCGGCCGCGT TCGATAGTTC TGGACTCGAG AGTTGGCTCG CGAGCGGCAA CGATACGATC AACGGCTCCC TCGGTAATGA TGAAATCTGG GGGCGCCTTG GAAACGATGT CCTCAACGGC AATGCGGGTA ATGACCTTGT CACAGGCGGC GGCGGTGCAG ACACCTACGA TGGAGGAGCC GGTTTTGATA TTCTGAACTT CCAGGACGCT TATGACTCCC CGACAGCGAT AAGAGGCATC AACCTCAACG CCACGGCCGG GACCGTCGTT GATCAGTTCG GTTTTTCGGA AACCTTCCAG AATTTCGAGG AATTCAGAGG AACCCAGTTC TCGGACTCGA TGATGGGGTC CTCGGTCGAT GAGACGTTTT TTGGGTTTGG CGGACGCGAC AATATCAATG GTGGTGCCGG CATCGACACC GTTCGATACG ACCGCGATTT CCAGCGAGGC GCCACAAAGG GTGTGAGCAT CGATCTCAGT ACCGGTATTG CGACCGACGG CTTTGGATCT CGAGATACTC TCACCAGCAT CGAGAATATC CGAGCAACTG AATTCAAGGA TACAATCGTC GGCAGTTCGG CAGCCAACTT CCTCCGCACG TTCGCGGGCA ATGATTCAAT CAATGGCGGT GGCGGCTCAG ACAACATGCG CGGCGGCATA GGAAACGATA CGTATTTCGT AGACAACACC GGCGATATCG TTGACGAAGC CGCCGACTCG GGCGCGGGTA CGGACACCGT TCAATCCACA GTCTCGTTCA ATCTCGGCAA CACCGCGGTC GCCAAAGGTG GCGTGGAGAA CCTTGTGCTC CTTGGCACCG GGAACATAAA CGGCACCGGC AATGCCTTGA ACAACACCCT GAGTGGCAAT ACCGGCAACA ACACTTTCAG CGGCTTTGCC GGCAACGATA CAATTGACGG GGGCCTCGGC AACGACCTGA TCAATGGGGG CCTCGGCAAT GACAGCCTGA CCGGCGGAGC CGGCTTAGAC ACATTTAATT TCAGCAACGC TCTGGATGCC ACGAACAACG TCGACACGAT CAACGGGTTC GTTGTCGCGG ATGATACGAT CCGATTGGAA AATGCGGTCT TTACGGGGAT CGTCGGTACC GGCACGTTGA CTGCGGCGCA GTTCGTAACG AATACCACTG GCCTAGCTGC TGACGCCGAC GACCGAATTA TTTATGATAG CGACACCGGA AGGCTGCTCT ACGACAGCGA TGGCGACGGA GCGGGCGGTT CTGTTCATTT CGCCACGGTC GGCACGAACC TCGGCATGAC TGCGAGCGAT TTCTTTGTTG TGTAG
|
Protein sequence | MAAYSCSKTH AFFIHFHHFF WPETSRLPEE IMATITYHFP AGLYPNQYNP PLSGVSVNPS FGELVDMSTG SRSTTTSTSV LYRLDNGLKM KLVGTGFSFD ASGDAVGGTI TSIEVLLNNG TGLIQTISGL NISLELFQDA SAAFDSSGLE SWLASGNDTI NGSLGNDEIW GRLGNDVLNG NAGNDLVTGG GGADTYDGGA GFDILNFQDA YDSPTAIRGI NLNATAGTVV DQFGFSETFQ NFEEFRGTQF SDSMMGSSVD ETFFGFGGRD NINGGAGIDT VRYDRDFQRG ATKGVSIDLS TGIATDGFGS RDTLTSIENI RATEFKDTIV GSSAANFLRT FAGNDSINGG GGSDNMRGGI GNDTYFVDNT GDIVDEAADS GAGTDTVQST VSFNLGNTAV AKGGVENLVL LGTGNINGTG NALNNTLSGN TGNNTFSGFA GNDTIDGGLG NDLINGGLGN DSLTGGAGLD TFNFSNALDA TNNVDTINGF VVADDTIRLE NAVFTGIVGT GTLTAAQFVT NTTGLAADAD DRIIYDSDTG RLLYDSDGDG AGGSVHFATV GTNLGMTASD FFVV
|
| |