Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1334 |
Symbol | |
ID | 5322182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1420375 |
End bp | 1421649 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640790276 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001327019 |
Protein GI | 150396552 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.280266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATT CTGCCATCGA AAAAGTTATG ACTGCGTTTG AGGAATTTAA GTCCACGAAC GACGCGCGCC TCACGCAAAT TGAGAAGAAG GGCGCTGCTG ATCCAGTGAC TGCCGAGAAG CTCGGCCGCA TTGAGACTGA CCTTTCCAAA CTGGAGGATA TCAACCAGAA GCTGACCGCC GCTGCGCTTG AAGCCAAGAA GGAGCGGGAT CACGTCGACG AGCTCGAGGC AAAGCTCAAT CGACTGTCGC TCGCGGCTGC GAACGACAAC GTCCGACAGG ATGAAGTCAA ATCACGATCA AACACGTGGG CTCGCGCCGT CGTCGGCGCC ATCACCCGCG GAGAGATGAA TATCTCGGCC GACCAGCAGA AGGCTCTGGC GGACGTCACC GCCGAATACA AGGCTATGTC GGTCGGCAAC GATACCACTG GTGGGTACCT TGCGCCGGCA GAATACGTCC GAGAGATCAT CAAGGGCGTC ACAGAGCTCT CGCCAGTTCG TTCGATGGTC CGCGTCCGCC AGACCTCGTC GAAGGCGATC ATGATCCCGA AGCGCACTGG TCAATTCTCT GCCAGGTGGA CGGCTGAACA GGCCACTCGC ACGGAGACTG ACGGTCTCCG CTACGGCATG TGGGAAATTC CGACCCACGA GCTTTACGCT CTGGTGGACA TCAGCGAGCA GAACCTCGAG GATTCCGCTT TCGATATGGA AGCCGAAATC CGCCTCGAAG CTGGCGAACA GTTCGCAGTC GCTGAAGGTG CGGCCGTGGT TTCCGGCGAC GGCAACGGCA AGCCTGAAGG CTGGATGACT GCAAGTGGCG TTGGCGAAAA CAACTCCGGA TCAGCAACCA CGATTGCTGA TGCCGACGGC CAGGCAAATG GATTGCTGAC GCTGAAGCAT GCGCTGAAGA CGGCTTATAC CCGCAACGCC GTGTGGGCGC TGAACCGCAC CACACTTGGC TCTGTGCGAC GCTTGAAGGA CGCTGACAAG GGTTACGTTT GGCAACCTGG CCTTGCGCTC GGCAAGCCGA ACACAATCGA TGGCGATCCC TACGTCGAGG TTCCTGACAT GCCCAACGAG GGCGCCGGCG CTTTTCCCAT TGCTTACGGC GACTTCCAGC GTGGCTACAC GCTCGTTGAC CGCATTCAGA TGTCCATGCT TCGCGATCCT TACACGCAGG CAACCGTCGG CAACATTCGC TTCATGTTCC GTCGCCGTCT CGGCGGCCAG GTGACGCTTG CCGAAGCGTT CCGCAAGCTG AAGTGCGCGG CCTAA
|
Protein sequence | MADSAIEKVM TAFEEFKSTN DARLTQIEKK GAADPVTAEK LGRIETDLSK LEDINQKLTA AALEAKKERD HVDELEAKLN RLSLAAANDN VRQDEVKSRS NTWARAVVGA ITRGEMNISA DQQKALADVT AEYKAMSVGN DTTGGYLAPA EYVREIIKGV TELSPVRSMV RVRQTSSKAI MIPKRTGQFS ARWTAEQATR TETDGLRYGM WEIPTHELYA LVDISEQNLE DSAFDMEAEI RLEAGEQFAV AEGAAVVSGD GNGKPEGWMT ASGVGENNSG SATTIADADG QANGLLTLKH ALKTAYTRNA VWALNRTTLG SVRRLKDADK GYVWQPGLAL GKPNTIDGDP YVEVPDMPNE GAGAFPIAYG DFQRGYTLVD RIQMSMLRDP YTQATVGNIR FMFRRRLGGQ VTLAEAFRKL KCAA
|
| |