Gene Smed_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1658 
Symbol 
ID5322516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1747412 
End bp1749136 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content66% 
IMG OID640790598 
Productlambda family phage portal protein 
Protein accessionYP_001327330 
Protein GI150396863 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0805551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.257873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG ACGTCACGAT CCTCGGCCCC GATGCGAAGC CGCTTTCGCC GGCAGTTCGT 
GCCGCTGCCC GCGTGCAGGT TGCGAGAAAC CGGCTGATGG CGTCTTCGGC CTACCAAGGT
GCATCCTACG ATCACCCGTC CTTCGCCAAA TGGCGGCCGG GCACCTGGTC CGGGCAGTCG
GCGCTGACCT GGTCGCGCTC CGAGCTGGTC GACCGGCTGA ACGACGTGGC GCGCAATGAC
GGCTGGGGTG CCGCCGGCAC CTCGCGCCTC GTCGACAACA TCATCGGCTC GGGCTGGACG
CTTGCGGCGC GGCCGAACCA CGTCTCGCTC AACATGACCT TTGAGCAGGC GGAGGAGATC
GCCGACAAGA TCGAGGCCCT GTGGCGCGAT TACACGCAGG ACGTCGACAA ATGGTGCGAC
GCCGAGCGGA CGAAAACCAT GGCCGGCGTT CTCGGCCTTG CTGCCCGTCA GCGGTTTGGT
CCCGAGGGCG AGGCCTTCGG CGTCATCGTC TGGCAGGACA ATGCACCGTT GTTCCAGACG
GCAATCCATG TCGTCGATCC GGCCCGGTGT TCAAACCCGA ACGGGCGCAT GGACGAGGAG
TTCCTGCGCG ACGGCGTTGC CATCGACGGA TACGGCGCAC CGGTCGGCTA CCACTTCCGC
AAGTCGCATC CCGGCGAGTT CTTCGCCGGC AATACCGGCC TGTGGCATTG GGAGTATGTC
GATCGGGAGA CCGAATGGGG GCGCCCGATC GTCGTTCACG CCTACGAGCA GAAGCGCGCC
GGCATGACGC GCGGCGTTTC CGACTGGGCT CCGGTCATGC GGTCGATCAA GCAGTCGACC
GACTACGAGG ACTATGAGAG CCAGGCGGCA ATGCTGAACG CTGTCATGGC TGCCTTCATC
GAAACCCCCT TCGATCCGGA AGAGATGCTC GAGGCGATCG GCGCGGATTA CGGCAATGAC
GGCATCGCCA AGCTCTTCGG CGAAATGTCG GCCGCGCAGA AGGCCTATTA CGGCGCCGCA
CCGATCGATT TGCCCGGCGT TCGTATCAAC ACGCTGCAGC CCGGCGAAAA GGCGACGCTG
ACCAAGCCGG AGCACCCGAA CGCCAATTTC GAGGCCTTCG TCAATGCGGC GCTGCGCAAG
GTCGCGAGTG CGATCGGCGT CACCTACGAG CAGCTCACCA TGGACTGGAG CCAGGTGAAC
TATTCGTCGG CGCGCGCGGC ACTCCTCGAG ATCTGGCGCG GCTTCACCGC CAAGAAGGGC
GGCTTCGCCT CGCAGTTCAT GGCACCGATC TATCGGGCAT GGCTCGAGGA GGTGTTCGAC
AAGGGCCTGA TCGAGCTCCC GGCGGGAGCC GTTCCTTTCG AGCTGAACCC GGCAGCATGG
TGCCATGCGG ACTGGATCGG CCCCGGCCGC GGCTGGATCG ATCCGCTGCG CGAGGCGCAG
GCTGCCAGCG AGCGGCTCGC CGGCAATCTG ACCACGCTCC AGCAGGAAGC GGCCGAGCAG
GGGCGGGACT GGAAGATGGA TGCGCAGCAG CGCGCCCGGG AACGGGCCTT CTACGAACGG
CTCGGGCTCG ATCCAGATCC TGGCAAGCCC GAAGCCAGAT CGCAGGCGAG TGCCGCTCCG
CCAGCCGAGC CCGGCGACGA GGCCGAGGAA GAGGTCAACG GACGGACCTC GGCGCGGCGC
CATCCTGCCG GCATCCCGAG GATTGCCAGA AGGAAAACGG CATGA
 
Protein sequence
MSGDVTILGP DAKPLSPAVR AAARVQVARN RLMASSAYQG ASYDHPSFAK WRPGTWSGQS 
ALTWSRSELV DRLNDVARND GWGAAGTSRL VDNIIGSGWT LAARPNHVSL NMTFEQAEEI
ADKIEALWRD YTQDVDKWCD AERTKTMAGV LGLAARQRFG PEGEAFGVIV WQDNAPLFQT
AIHVVDPARC SNPNGRMDEE FLRDGVAIDG YGAPVGYHFR KSHPGEFFAG NTGLWHWEYV
DRETEWGRPI VVHAYEQKRA GMTRGVSDWA PVMRSIKQST DYEDYESQAA MLNAVMAAFI
ETPFDPEEML EAIGADYGND GIAKLFGEMS AAQKAYYGAA PIDLPGVRIN TLQPGEKATL
TKPEHPNANF EAFVNAALRK VASAIGVTYE QLTMDWSQVN YSSARAALLE IWRGFTAKKG
GFASQFMAPI YRAWLEEVFD KGLIELPAGA VPFELNPAAW CHADWIGPGR GWIDPLREAQ
AASERLAGNL TTLQQEAAEQ GRDWKMDAQQ RARERAFYER LGLDPDPGKP EARSQASAAP
PAEPGDEAEE EVNGRTSARR HPAGIPRIAR RKTA