Gene Smed_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1334 
Symbol 
ID5322182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1420375 
End bp1421649 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID640790276 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001327019 
Protein GI150396552 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.280266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATT CTGCCATCGA AAAAGTTATG ACTGCGTTTG AGGAATTTAA GTCCACGAAC 
GACGCGCGCC TCACGCAAAT TGAGAAGAAG GGCGCTGCTG ATCCAGTGAC TGCCGAGAAG
CTCGGCCGCA TTGAGACTGA CCTTTCCAAA CTGGAGGATA TCAACCAGAA GCTGACCGCC
GCTGCGCTTG AAGCCAAGAA GGAGCGGGAT CACGTCGACG AGCTCGAGGC AAAGCTCAAT
CGACTGTCGC TCGCGGCTGC GAACGACAAC GTCCGACAGG ATGAAGTCAA ATCACGATCA
AACACGTGGG CTCGCGCCGT CGTCGGCGCC ATCACCCGCG GAGAGATGAA TATCTCGGCC
GACCAGCAGA AGGCTCTGGC GGACGTCACC GCCGAATACA AGGCTATGTC GGTCGGCAAC
GATACCACTG GTGGGTACCT TGCGCCGGCA GAATACGTCC GAGAGATCAT CAAGGGCGTC
ACAGAGCTCT CGCCAGTTCG TTCGATGGTC CGCGTCCGCC AGACCTCGTC GAAGGCGATC
ATGATCCCGA AGCGCACTGG TCAATTCTCT GCCAGGTGGA CGGCTGAACA GGCCACTCGC
ACGGAGACTG ACGGTCTCCG CTACGGCATG TGGGAAATTC CGACCCACGA GCTTTACGCT
CTGGTGGACA TCAGCGAGCA GAACCTCGAG GATTCCGCTT TCGATATGGA AGCCGAAATC
CGCCTCGAAG CTGGCGAACA GTTCGCAGTC GCTGAAGGTG CGGCCGTGGT TTCCGGCGAC
GGCAACGGCA AGCCTGAAGG CTGGATGACT GCAAGTGGCG TTGGCGAAAA CAACTCCGGA
TCAGCAACCA CGATTGCTGA TGCCGACGGC CAGGCAAATG GATTGCTGAC GCTGAAGCAT
GCGCTGAAGA CGGCTTATAC CCGCAACGCC GTGTGGGCGC TGAACCGCAC CACACTTGGC
TCTGTGCGAC GCTTGAAGGA CGCTGACAAG GGTTACGTTT GGCAACCTGG CCTTGCGCTC
GGCAAGCCGA ACACAATCGA TGGCGATCCC TACGTCGAGG TTCCTGACAT GCCCAACGAG
GGCGCCGGCG CTTTTCCCAT TGCTTACGGC GACTTCCAGC GTGGCTACAC GCTCGTTGAC
CGCATTCAGA TGTCCATGCT TCGCGATCCT TACACGCAGG CAACCGTCGG CAACATTCGC
TTCATGTTCC GTCGCCGTCT CGGCGGCCAG GTGACGCTTG CCGAAGCGTT CCGCAAGCTG
AAGTGCGCGG CCTAA
 
Protein sequence
MADSAIEKVM TAFEEFKSTN DARLTQIEKK GAADPVTAEK LGRIETDLSK LEDINQKLTA 
AALEAKKERD HVDELEAKLN RLSLAAANDN VRQDEVKSRS NTWARAVVGA ITRGEMNISA
DQQKALADVT AEYKAMSVGN DTTGGYLAPA EYVREIIKGV TELSPVRSMV RVRQTSSKAI
MIPKRTGQFS ARWTAEQATR TETDGLRYGM WEIPTHELYA LVDISEQNLE DSAFDMEAEI
RLEAGEQFAV AEGAAVVSGD GNGKPEGWMT ASGVGENNSG SATTIADADG QANGLLTLKH
ALKTAYTRNA VWALNRTTLG SVRRLKDADK GYVWQPGLAL GKPNTIDGDP YVEVPDMPNE
GAGAFPIAYG DFQRGYTLVD RIQMSMLRDP YTQATVGNIR FMFRRRLGGQ VTLAEAFRKL
KCAA