Gene Smed_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1336 
Symbol 
ID5322184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1422413 
End bp1423666 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content58% 
IMG OID640790278 
ProductHK97 family phage portal protein 
Protein accessionYP_001327021 
Protein GI150396554 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.352989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTCA TTGATAGGCT GCTCGGCCGC AAGGCTGAAG AAAAAGCCGT GTCGTTCGAT 
CCCGTGTGGC TCGACTGGTT CGGCTCTAGG CAATCGAAGG CAGGTGTTCC CGTAAATTGG
GAGCGCGCGC TGGACGTTTC CACGGTATTC GCGTGTATCC GAGTTATTGC TAACGGCGTT
GCGCAGGTTC CCTTGCGCGT CATGAAGGAA CTGCCTGACG GGAAGGGCGG CACTCCCGCC
ACCGAACACC CGCTGTACGC CGTCCTGAAC CGGCGACCAA ACAAGTGGAT GACGTCTTTC
GAACTCCGAG AGACGCTCAT ATTTCACGCA GCGCTGACCG GCAACGCGTT CTTCTACAAA
AACATGGTAC GTGGCCAGGT CAGGGAACTT ATCCCGATTG ACCCTGGCTG CGTCACGATT
ACGCGGAGCA ACGATTATTC GCTGACCTAC ACGGTAAGCG GCATTGGTGG GCGGTCTATG
GACTTCCCGC AGTCGCTGAT CTGGCACATT CGCGGGCCGT CTTGGGACAC GTGGCGCGGC
TTGGACGCCG TGCAGCAGGC TCGCGAGGCA ATCGGCCTCA CGATCGCGAC GGAGAACACG
CAGGCCGAGC TCCACGCCAA CGGCGCGATG CCCTCCGGTG TGTATTCGAC CGATCAGAAG
ATTGATCCGG AGAAATACAA GCAGATTCAG GCTTGGATTG CGGCTCAGGT AAGCGGCGCC
AACCGGCACA AGCCGTTTGT CATCGACTCG AACTTTAAGT GGACGCCGCA GTCAATGAGC
GGCGTCGACG CACAGCATCT TGAGACGCGA AAATTCCAGA CCGAGCAGCT TTGCCAGTCT
CTTGGCGTGT TTCCGCAGAT GATTGGCCAC GCCGGCCAGG CAATGACGTT TGCGAGCGCA
GAGCAGGTGT TTTTGGCTCA CGTTGTGCAT ACGCTCGGGC CGTGGTGGGA GAGAATCCAG
CAGTCCATAG ACGTCAATTT GCTCGACGGG CCGGAGGATG CCGGTTACTA CGCGAAATTT
AACGCCAACG GACTGCTTAA GGGTGCCCAC AAGGACCGAG CCGAGTTCTA TTCCAAGGCT
CTCGGTACTG GCGGGTCGCC TGCATACATG ACGCCGAACG AGATTCGCGC ACTTGAAGAC
CTGAATCCGA TCGAGGGCGG CGACGAACTG CCGAAGCCGA CGAACGTTGG TGGCGCTCCA
GCGCCAGACA AGCCGAAAGA CGGCGCACAG GATCCTAAAA ATGACGAAAA ATAA
 
Protein sequence
MGLIDRLLGR KAEEKAVSFD PVWLDWFGSR QSKAGVPVNW ERALDVSTVF ACIRVIANGV 
AQVPLRVMKE LPDGKGGTPA TEHPLYAVLN RRPNKWMTSF ELRETLIFHA ALTGNAFFYK
NMVRGQVREL IPIDPGCVTI TRSNDYSLTY TVSGIGGRSM DFPQSLIWHI RGPSWDTWRG
LDAVQQAREA IGLTIATENT QAELHANGAM PSGVYSTDQK IDPEKYKQIQ AWIAAQVSGA
NRHKPFVIDS NFKWTPQSMS GVDAQHLETR KFQTEQLCQS LGVFPQMIGH AGQAMTFASA
EQVFLAHVVH TLGPWWERIQ QSIDVNLLDG PEDAGYYAKF NANGLLKGAH KDRAEFYSKA
LGTGGSPAYM TPNEIRALED LNPIEGGDEL PKPTNVGGAP APDKPKDGAQ DPKNDEK