Gene Smed_1660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1660 
Symbol 
ID5322518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1749380 
End bp1751422 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content65% 
IMG OID640790600 
Productphage terminase GpA 
Protein accessionYP_001327332 
Protein GI150396865 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.284129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGC TGTTCAATCC CGAGCGGCTC GCTCTCAGCG TGCTTGCCGA GATCTGCGAA 
CCGCCGCCGG CAGTCGATTA TCTCGACTGG GCGAAGCGGA ACATCGTGTT CTCGGAACGC
ATCACGGACC ATCCGGGGCC GTACAACGAA GACCTGGTGC CGTTCTTCTC GGAGATCCTG
CGAGCGTTGT CGCCGGAAGA TCCGTGCAAC ATCGTCAGCC TGGCGAAGTC GGCGCAGATC
GGCGGTACCA TCTGCGCCAA CATCTTCACG CTCGGCTCGC TCGACATGGC GCCCGGCGAT
TTCCTCTATG TCCACCCGAC CGAGGAGAAC GCCGCCCGCT GGTCGAAGAC GAAGCTGATG
CCGCTGGTGC GCGAGATGCC GGCGGTCGCC AAGCTGTTCT CGCAAAACAG CCGCGATGCG
AGCAACTCGG TGCTCTACAA GGAACGCGTC GACGGGCGCG GCGCCATCCA GGCGGCCGGC
GCCAACTCGC CGGCAGGCCT GTCGATGATC TCGCCGCGAA AGCAGGTCCA GGACGATCTT
GCCAAGTGGC AAATGAACGA GGCTGGTGAT CCGGAGGTGC AGGCGGACAG CCGCAGCAAG
GCGTTCTTCA ACGGCAAGAT CTTCAAGATC TCGACGCCGA TGGTATCGCC GGGCTGCAAG
ATCACGTCGA ACTATCAGGA AGGGACGCAG GAGACCTACC ATGTCCCCTG TCCGCACTGC
CAAGAGCTGC AGGAGCTGCG CTGGGAGAAC ATGCGGGATC ACATCGATCC CGAGCATCCC
GAGCAGGCGC ATTTCGTCTG CATCCATTGC GGCTGCGAGA TCCACGAGCA CCATCGCGAA
TGGATGGTGA AGCCGGAAAA CGGCGCGAAG TGGGTTGCCA AATATCCGGA GCGCGGCCGC
CGCCATCGAT CGTTCCGCAT CTGGATGGCC TATTCGCCTT TCGAGCGCTG GGAGAACCTG
GCGCGCGAGT GGCTGACGGT CCAGGCCGGT GGCCCGGAGA ACCGGGAAAA GGGTTCTGGC
GCCGAGCAGA CGTTCTGGAA TGACTGGCTC GGGCTCGCCT TCGAGGCGGA CAACAAGGCG
ATCGACTGGG AAGTGCTCCG CGATCGCGCC GAGGACCACG GTTTCCAGCG CGGTGTCATC
CCGGCCGAGG CGCTGGCGCT GGTGCTCGGC ATGGACGTGC AGGGCGACCG CGTCGAGTGG
CTGCTGGTCG GCTACGGCAG GAATCGGTAC CGGGCCGTCA TCGACCACGG CGTCGTCGAC
CATCGCGCCG GCAGCCACCT GGCGGACGCC AAGGAACATT CCGGCCATAT CTCCGAGCCG
GAGGTTCGCA CCGCCCTCGA TCGGCTGCTG CAGCGCGAAT GGCTCGACGA TGCCGGCCGC
AAGCGAACCG CCGACCGCGT CGCCATCGAC GGCAACGCCT ATACCGACGA CGTCTGGAAC
TGGGTTCGCA AGCATCCGAA GTCGCGCGTC ATCATGGTGC GCGGCGGCAA TACGGAAGCC
GCGCCGCCGA TCGTGCAGAC GAAAGAGTAT GACCGGAAGG GCAAGCCGAA GAAGCAGAAG
TGGTCCTCCC GCTTCTTCAC CTTCAACGCC TCGGCCTTCA AGATCCGGCT CTACCGGGAC
TACAAGAAAG ACGATCCGGA GCAGGCGGGC TATATCCGTT TCGCCCGCGG CTTCGGCGAC
GATTTCTACC AGCAGGCGAC ATCGGAAGCC CGGGTACCGG AGAAGACCCG GAGCGGTCAC
ACCCGCTACG TCTGGAAACT CGGCGAGGGC AAGCGCAACG AAATCATCGA CATGCTCAAC
CAGAGCCTGG CCGGTGCCTA TCGCTGGGGC GTGCCCTACT GGACCGACGA GGAATGGGAC
GCGATCGCCG ATCGCCTCGG CCGCCTCGAA GCGCCGCAAC AGGGCGATCT CGAGGATCAT
CTGAACCAGA TCGCCGTCAA GACCGAACCT GCCGCAGGCC AAAGCGCCGC GGCAGAACAG
CAATCGCCGC TCGTCGCCGC CGCCCTCGCG CGCGCCGCCC GGGCAGCGCA GCGAAACCGC
TAG
 
Protein sequence
MTVLFNPERL ALSVLAEICE PPPAVDYLDW AKRNIVFSER ITDHPGPYNE DLVPFFSEIL 
RALSPEDPCN IVSLAKSAQI GGTICANIFT LGSLDMAPGD FLYVHPTEEN AARWSKTKLM
PLVREMPAVA KLFSQNSRDA SNSVLYKERV DGRGAIQAAG ANSPAGLSMI SPRKQVQDDL
AKWQMNEAGD PEVQADSRSK AFFNGKIFKI STPMVSPGCK ITSNYQEGTQ ETYHVPCPHC
QELQELRWEN MRDHIDPEHP EQAHFVCIHC GCEIHEHHRE WMVKPENGAK WVAKYPERGR
RHRSFRIWMA YSPFERWENL AREWLTVQAG GPENREKGSG AEQTFWNDWL GLAFEADNKA
IDWEVLRDRA EDHGFQRGVI PAEALALVLG MDVQGDRVEW LLVGYGRNRY RAVIDHGVVD
HRAGSHLADA KEHSGHISEP EVRTALDRLL QREWLDDAGR KRTADRVAID GNAYTDDVWN
WVRKHPKSRV IMVRGGNTEA APPIVQTKEY DRKGKPKKQK WSSRFFTFNA SAFKIRLYRD
YKKDDPEQAG YIRFARGFGD DFYQQATSEA RVPEKTRSGH TRYVWKLGEG KRNEIIDMLN
QSLAGAYRWG VPYWTDEEWD AIADRLGRLE APQQGDLEDH LNQIAVKTEP AAGQSAAAEQ
QSPLVAAALA RAARAAQRNR