Gene Smed_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0655 
Symbol 
ID5321491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp707521 
End bp709002 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content62% 
IMG OID640789591 
Productglycosyl transferase family protein 
Protein accessionYP_001326346 
Protein GI150395879 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.835096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATG CGGTCGATCG ACGGCCGGGG GCGGTCGTTC TGGCGGTGGC TGGTTATTTC 
CTGCTGTGCA TCACCCTTCG GATATCCGTC TCGAGCTCTC TCGAGATCGA CGAGGCGGAA
CAGGCCTTTC TGTCGCAATT CCTCGAACTC GGATACGGAC CGCAGCCGCC CTTCTACAAT
TGGCTGCAAT ACGGCCTGGC GGAGCTTTTC GGAGCGTCCG TGGCGACGAT GACCATTCTC
AAGAATGGGC TCCTGTTGCT CTGTTGCCTG TTCTACGGCC TTGCGGCACG GCAGGTTCTG
GCCGATCGGC GCCTGGCGGC CATTGCGATG CTCGGCGTCC TTGCGCTGCC TCCGGTTTTT
CTGCTGGCGC AGCGCGACCT CTCGCACACG GTTGCCGCAC TTTTCGCGGT ATCGTTGTTT
CTCTACGGCT TTCTGAGAAC GATGAAACGG CCCAGCCTTT TCTGGTATCT CCTCACAGGC
CTTGCGGTGG GCATCGGCCT AATGGCGAAG TATAACTTCG CTTTGGTGCC GGTCGCGGCA
ATCGTCGCTG TACTGCCGGA ACGGGAGATG CGGCCGCGGA TTCTCGACTG GCGCATTCTG
CCGGCAATTG CGGTCGCAGC TCTCATCGTT TTACCCCATG CTTACTGGAT GCTGCAGAAC
CTTGGTTTCG CCTCGGGTGG CACCTTGAAC GAGATGAGGG AGCGGGAGGC GGAAGGACGC
CTGCTTCAGG CCTTCTACGG TGCCTACTCG CTCGCCTCTG CGATCATCGG CGGCAGCCTG
GTCCCGCTGC TCGTCTTTGG CCTCGCCTTC CGCGGCAAGC TCCGTGCCAT CTGGAAGGCG
GAAAGCCAAT GGAGCCGAAT CGTGGGGCGT ATGCTTGCCC TTTGCTTGAT TGCGGTCCTG
CTCGTGGTGC TCGGCGTCGC CGCAACACAT GTGCGCGAAA AATGGCTGGT TCTCTTCCTC
GTTCTCATTC CGCTTTACCT CTGCCTGAAG ATCGAAGCGG CGAATATCGA CCTTAGTGAC
AGCCTCCGGC GCTTTTTCCT GCTGGTCTGC GTCATTGCGC TGGGCGCGCT CGTGCTGGTG
TCGGCGCGCG CCGTCGTGCG ACCGTGGTTC GGCGACTACT CGCGGCTGAA TATACCCTAC
TCCGCGTTCG CCCAAGCGGT GGCGCAGGCG AAAGGCGGGC AGCCCGCACT TATCCTCGCC
AATGACAAGC AGATTGCGGG AAATCTCCGA ACGCAGTTCG GCGGGGCGCA GGTCACGATG
CCAAGGCCCT CGAATGCGCT TCCGGTCGAT ATGTCGCGAA GGCCCCTTCT CGTGGTCTGG
CATGATGACG TCCGACCGGA AGCACCCGTT CCCGAGCTGC TCCGGAACAC TCTATCCGCC
CTTGGCGCCC CGGCAGCGGC GCCAAGCCAC CTTGACCTGC CCTATCTCTA CGGCACCGGC
CCGGATCGCT ACAGTTTCAG CTATATCTGG ATTGCGGAAT AG
 
Protein sequence
MLDAVDRRPG AVVLAVAGYF LLCITLRISV SSSLEIDEAE QAFLSQFLEL GYGPQPPFYN 
WLQYGLAELF GASVATMTIL KNGLLLLCCL FYGLAARQVL ADRRLAAIAM LGVLALPPVF
LLAQRDLSHT VAALFAVSLF LYGFLRTMKR PSLFWYLLTG LAVGIGLMAK YNFALVPVAA
IVAVLPEREM RPRILDWRIL PAIAVAALIV LPHAYWMLQN LGFASGGTLN EMREREAEGR
LLQAFYGAYS LASAIIGGSL VPLLVFGLAF RGKLRAIWKA ESQWSRIVGR MLALCLIAVL
LVVLGVAATH VREKWLVLFL VLIPLYLCLK IEAANIDLSD SLRRFFLLVC VIALGALVLV
SARAVVRPWF GDYSRLNIPY SAFAQAVAQA KGGQPALILA NDKQIAGNLR TQFGGAQVTM
PRPSNALPVD MSRRPLLVVW HDDVRPEAPV PELLRNTLSA LGAPAAAPSH LDLPYLYGTG
PDRYSFSYIW IAE