Gene Smed_4567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4567 
Symbol 
ID5319235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1055119 
End bp1056345 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content62% 
IMG OID640776368 
Productglycosyl transferase group 1 
Protein accessionYP_001313300 
Protein GI150376704 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.392838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACACTTG TTGAAAAGGT TCCGCGCGCC GAAGCCGAGC ACATTGCCGA AGGGTCGGCA 
GGACGCGGCA GCGCGGCACG TTTTCCGCAG CAGAAGCGCG TGATAGCCGT CGTCGCCAGC
TTGACAGCCT CTCTGGTGAT CTTCCGGCTC GAATTGCTGA AGCGGCTGGT CGCTGCCGGG
CACGACGTCA TCGCCTTCGC TCCTGAACAT GACGCCCGGG TCGAGCAGGA GCTTGCGCAG
ATCGGCGTGC GCTTTATCCG GATCCCGATG GCGCGCACCG GCCTCAACCC GCTGGAGGAC
CTGCGGACCT TTTGGGCGTT GCGGCGACAC TTCGCCCGGC TGAAGCCGGA CATTGTCCTT
CCCTACACGA TGAAACCGAT CATCTATGCC GGCATTGCGG CGAGGACGCT GGGTATTAGA
GAACGCTGCT TTCTCGTCAC GGGCCTCGGC CACATATTCT CAGAGGCTGC CGGCGCTTCG
CTCAAGGCGA AGGCCATACG CCACCTCTGT GTCCGGCTAT ATCGTACAGC GCTGCGGGGC
GCCCGCGTCG TCTTCGTCTA CAACGACGCG GATGAGAACG ACATTCGTCG TTATCGGATG
CTGAGTGGCC ATCTCTCGCC GACGATGATC TCCGGATCGG GAGTCGATCT CGACCATTTC
GCGTTCTCGA CGCCGCCTCG CGGCGGACCG ACTTTCCTGA TGGTCGCCCG GCTGCTGCGC
GACAAGGGCG TCGTCGAATA TGTAGAAGCC GCGCGTATTG TCCGTCGCTC TTTCCCGAAT
GCCAGATTTC AGCTGCTTGG CCATTTCGAC AGCAATCCGA CGGCAATTTC CCGAGAGGAA
ATCGACGCAT GGGGGCGCGA AGGGATACTC GACTATCTTG GCACCACTGT CGATGTGCGG
CCATATCTTG CGGCATGCAA CGCCTTCGTT CTGCCCTCTT ACTATCGCGA AGGAATTCCG
CGGAGCATTC TCGAGGCGCT GGCGACAGGA CGGCCCGTGA TCACGACGGA TCTGCCGGGC
TGCCGCGACA CCGTGCAGCC GGGGAAAAAC GGCTTGGTGG TCAAGGCGCG TGACGTCGCC
GCCCTCGCGG AAGCGATGAC CACTGTCGCA AAGAACCCGG ACCTGGCGGA GGAGATGGGA
AGGCGGTCGC GGGAACTCGC GGAAACCAGG TTCGACGTTC ACATGATCAA CAGAATGCTG
TTTGACGGCA TGCATCTGAC CGATTGA
 
Protein sequence
MTLVEKVPRA EAEHIAEGSA GRGSAARFPQ QKRVIAVVAS LTASLVIFRL ELLKRLVAAG 
HDVIAFAPEH DARVEQELAQ IGVRFIRIPM ARTGLNPLED LRTFWALRRH FARLKPDIVL
PYTMKPIIYA GIAARTLGIR ERCFLVTGLG HIFSEAAGAS LKAKAIRHLC VRLYRTALRG
ARVVFVYNDA DENDIRRYRM LSGHLSPTMI SGSGVDLDHF AFSTPPRGGP TFLMVARLLR
DKGVVEYVEA ARIVRRSFPN ARFQLLGHFD SNPTAISREE IDAWGREGIL DYLGTTVDVR
PYLAACNAFV LPSYYREGIP RSILEALATG RPVITTDLPG CRDTVQPGKN GLVVKARDVA
ALAEAMTTVA KNPDLAEEMG RRSRELAETR FDVHMINRML FDGMHLTD