Gene Smed_5017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5017 
Symbol 
ID5318756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1539276 
End bp1540769 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content58% 
IMG OID640776799 
Productglycosyl transferase family protein 
Protein accessionYP_001313731 
Protein GI150377135 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.506364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGTC TGCTGGTTCG AAGACCGGAT CTCTTATACG TTCTGATCGC CGGCTATTGC 
CTGCTGAGCG TCATCCTGAA AGTTCTTAGA CCGGACTCGC TTGAGATAGA CGAATCCGAG
CAAGCACTAC TCTCACAATA TCTGCTGCTC GGATACGGAG GCCAGCCGCC CTTCTACAAT
TGGCTTCAAT ATGGAGTCGT GGCTCTATTT GGTATCTCCG TCGCGTCGCT CGCCATTTTG
AAAAACGGTC TTCTGTTCCT CTTCCTCCTA CTCTACGGCC TTACGGCCCG CCTTCTCTCC
AGCAGTCCGC TCGCTCCGGC GGTGGCGGTG CTGGGCGTCC TCACGCTCCC ACCGGTTTTC
CTGCTGTCAC AGCGCGATTT ATCCCACACC GTCGCCGCGC TATTTGCCGT GTCGCTGTTC
CTTTACGGCT TTTTCCGTGC GCTAAAGAAC CCACCAAAGG TTGGCCACTA CTTGCTTGTC
GGTGTTGCTG TGGGGCTCGG GGCGATTTCG AAATACAATT TCGTTATCTT GCCGCTGGCG
GCACTGCTCG CCATCCTGCC AGAGGCGAAG CTGCGCAAAT ATCTGTTCGA CTGGCGGGTG
CTCGCCTCAG TTGCGGTTTG CGCAGTCATC GTGGCACCCC ATGCCTACTG GGTCGTCAAT
AATCTCGGAC ATGCCACCGG TGTCACTGTC GCTGAGATGA AGGAGGGGGC AGACAGCGCG
CTGCTGCCGC ACGCGATCCA AGGCCTTGTT TCGCTCGCGG TTGCGGCCCT CAAGGGCGTC
GCACTGACCT TTGCCGTCTT CGGATTGATC TTCTACGCGG ATGTTGGGAA AATCCTGTGC
GCCGAGAGCC TATGGACGAG GGTCGTCGGC AGGATGATTG TCGCCTGCTT CCTCATGATA
GCGTTTATCG TCGTAGCAAT GGATGCGACC CATATCCGTG CGAAGTGGTT GGCGCTTTTT
ACTGCGCTTC TCCCACTTTA TCTGACGCTG AAAATCGATG CCGCCGGCCT GGATCCTGCC
CGGCGCCTGC CGGCATTTTT TTCGATTTCG GGCATCCTCT CCGTCGGCGT CATCGTGATG
CTTTGGGCAA GGGTCTTTGT CGGACCAATG ATTGGCGATT ATTCCTTCGC GCACACGCCT
TACAGCGGTT TTGCCCGCAT GGTGCGCGCT GATCCGGGTC CGCCCCGGGT CGCCATCGTG
GTCGACGACA GGATCGTGGC GGGTAATCTC AGGATTCAGT TTCCCGATAC CCCGATCATC
CTGACCGGAT TTTCTCAGGA GGCGGAGAGA CACCTTCCCC CCGGTCGAAT TTTGGCGGCC
TGGTCGGCAG AAGGTAAGAG GCGAGCGGAG ATTCCTCCGC GCATTACGGG CCTACTGCAA
CTCATGCCCG TCAGGATCGC CGACGCCAAG CCGACGATCG TCTCAGTTCC TTACAACGCC
GGGCGCCCGG GCGACACATA CACTTTTGCC TATATTTGGG CGGACTTCCA CTGA
 
Protein sequence
MRSLLVRRPD LLYVLIAGYC LLSVILKVLR PDSLEIDESE QALLSQYLLL GYGGQPPFYN 
WLQYGVVALF GISVASLAIL KNGLLFLFLL LYGLTARLLS SSPLAPAVAV LGVLTLPPVF
LLSQRDLSHT VAALFAVSLF LYGFFRALKN PPKVGHYLLV GVAVGLGAIS KYNFVILPLA
ALLAILPEAK LRKYLFDWRV LASVAVCAVI VAPHAYWVVN NLGHATGVTV AEMKEGADSA
LLPHAIQGLV SLAVAALKGV ALTFAVFGLI FYADVGKILC AESLWTRVVG RMIVACFLMI
AFIVVAMDAT HIRAKWLALF TALLPLYLTL KIDAAGLDPA RRLPAFFSIS GILSVGVIVM
LWARVFVGPM IGDYSFAHTP YSGFARMVRA DPGPPRVAIV VDDRIVAGNL RIQFPDTPII
LTGFSQEAER HLPPGRILAA WSAEGKRRAE IPPRITGLLQ LMPVRIADAK PTIVSVPYNA
GRPGDTYTFA YIWADFH