Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1336 |
Symbol | |
ID | 5322184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1422413 |
End bp | 1423666 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640790278 |
Product | HK97 family phage portal protein |
Protein accession | YP_001327021 |
Protein GI | 150396554 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.352989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTCA TTGATAGGCT GCTCGGCCGC AAGGCTGAAG AAAAAGCCGT GTCGTTCGAT CCCGTGTGGC TCGACTGGTT CGGCTCTAGG CAATCGAAGG CAGGTGTTCC CGTAAATTGG GAGCGCGCGC TGGACGTTTC CACGGTATTC GCGTGTATCC GAGTTATTGC TAACGGCGTT GCGCAGGTTC CCTTGCGCGT CATGAAGGAA CTGCCTGACG GGAAGGGCGG CACTCCCGCC ACCGAACACC CGCTGTACGC CGTCCTGAAC CGGCGACCAA ACAAGTGGAT GACGTCTTTC GAACTCCGAG AGACGCTCAT ATTTCACGCA GCGCTGACCG GCAACGCGTT CTTCTACAAA AACATGGTAC GTGGCCAGGT CAGGGAACTT ATCCCGATTG ACCCTGGCTG CGTCACGATT ACGCGGAGCA ACGATTATTC GCTGACCTAC ACGGTAAGCG GCATTGGTGG GCGGTCTATG GACTTCCCGC AGTCGCTGAT CTGGCACATT CGCGGGCCGT CTTGGGACAC GTGGCGCGGC TTGGACGCCG TGCAGCAGGC TCGCGAGGCA ATCGGCCTCA CGATCGCGAC GGAGAACACG CAGGCCGAGC TCCACGCCAA CGGCGCGATG CCCTCCGGTG TGTATTCGAC CGATCAGAAG ATTGATCCGG AGAAATACAA GCAGATTCAG GCTTGGATTG CGGCTCAGGT AAGCGGCGCC AACCGGCACA AGCCGTTTGT CATCGACTCG AACTTTAAGT GGACGCCGCA GTCAATGAGC GGCGTCGACG CACAGCATCT TGAGACGCGA AAATTCCAGA CCGAGCAGCT TTGCCAGTCT CTTGGCGTGT TTCCGCAGAT GATTGGCCAC GCCGGCCAGG CAATGACGTT TGCGAGCGCA GAGCAGGTGT TTTTGGCTCA CGTTGTGCAT ACGCTCGGGC CGTGGTGGGA GAGAATCCAG CAGTCCATAG ACGTCAATTT GCTCGACGGG CCGGAGGATG CCGGTTACTA CGCGAAATTT AACGCCAACG GACTGCTTAA GGGTGCCCAC AAGGACCGAG CCGAGTTCTA TTCCAAGGCT CTCGGTACTG GCGGGTCGCC TGCATACATG ACGCCGAACG AGATTCGCGC ACTTGAAGAC CTGAATCCGA TCGAGGGCGG CGACGAACTG CCGAAGCCGA CGAACGTTGG TGGCGCTCCA GCGCCAGACA AGCCGAAAGA CGGCGCACAG GATCCTAAAA ATGACGAAAA ATAA
|
Protein sequence | MGLIDRLLGR KAEEKAVSFD PVWLDWFGSR QSKAGVPVNW ERALDVSTVF ACIRVIANGV AQVPLRVMKE LPDGKGGTPA TEHPLYAVLN RRPNKWMTSF ELRETLIFHA ALTGNAFFYK NMVRGQVREL IPIDPGCVTI TRSNDYSLTY TVSGIGGRSM DFPQSLIWHI RGPSWDTWRG LDAVQQAREA IGLTIATENT QAELHANGAM PSGVYSTDQK IDPEKYKQIQ AWIAAQVSGA NRHKPFVIDS NFKWTPQSMS GVDAQHLETR KFQTEQLCQS LGVFPQMIGH AGQAMTFASA EQVFLAHVVH TLGPWWERIQ QSIDVNLLDG PEDAGYYAKF NANGLLKGAH KDRAEFYSKA LGTGGSPAYM TPNEIRALED LNPIEGGDEL PKPTNVGGAP APDKPKDGAQ DPKNDEK
|
| |