Gene Smed_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4020 
Symbol 
ID5318829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp476548 
End bp477567 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID640775828 
Productglycosyl transferase group 1 
Protein accessionYP_001312761 
Protein GI150376165 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGGAAG AGCTCGTCCG GCAGGGTCAC GAGGTCACCC TCTTCGCGAG CGGCGACTCG 
ATGACATCGG CAAGGCTCGT CCCCTGCTCG CAAATGGCGC TGAGGTTGAA CCCCGCCATT
CAGGATCCGA TTCCCTATCA CATGATGCTG CTGGAAGAAG TGCGCCGGCA GGCGCCCGCC
TTCGACATCC TGCACTTCCA CATCGATCTT CTTCATTTTC CCCTTGTCCG CGACCTCGCC
GGAAAGACGG TGACGACCCT GCACGGCCGG CTCGATCTGC CCGACCTCCA GCCGTTCTAC
GCCGCGTTTC CGGATGTCCC CCTGGTTTCT ATCTCGCACG ATCAGCGGCG GCCGATGCCG
CCGGTCAACT GGATCGCCAC TGTCCATCAC GGGCTGGCTC CGGACGTCCT TCCCTTCACC
AGCCCTCCCA AGGGCGACTA CCTGGCCTTT CTTGGCCGTA TCTCGCCGGA AAAGCGGCCC
GACCGTGCGA TCGAAATCGC CGCCCGGGCC GGCATGCCGT TGAAAATGGC CGCGAAGGTG
GACCGGGTGG ACGAGCCCTA TTGGCTGAGT GAGATCGAGC CGCTCATCCG GCGCTATCCG
AATGTCGAGT TCATCGGCGA AATCAACGAA CATCAGAAGG CCGAATTCCT TGGAAACGCG
CGCGCCCTCC TCTTCCCGAT CGACTGGCCC GAACCCTTCG GTCTGGTGAT GATCGAGGCG
ATGGCCTGCG GCACGCCGGT GATCGCCTTT CGCTGCGGCT CGGTGCCTGA GATCGTCGAC
CACGGCGTTT CGGGTTTCAT CGTCGACAGC ACCGAGGAAG CGTTGAAGGC GGTCCACGGA
CTCGACCGGG TCGATCGCCG TATGGTCCGT GCCACATTCG ACAGGCGTTT TACGGCCCAG
CGCATGGCAA ACGATTATCT CGATATCTAT CGTGCTCTGG CAAGCGGCAG CCAAAGGGTC
ATGCCGATCC ATGCTGCGAA CGAAAGCAAT GCCGGCCCTG GCTCTGCCCG GGTCGCCTGA
 
Protein sequence
MTEELVRQGH EVTLFASGDS MTSARLVPCS QMALRLNPAI QDPIPYHMML LEEVRRQAPA 
FDILHFHIDL LHFPLVRDLA GKTVTTLHGR LDLPDLQPFY AAFPDVPLVS ISHDQRRPMP
PVNWIATVHH GLAPDVLPFT SPPKGDYLAF LGRISPEKRP DRAIEIAARA GMPLKMAAKV
DRVDEPYWLS EIEPLIRRYP NVEFIGEINE HQKAEFLGNA RALLFPIDWP EPFGLVMIEA
MACGTPVIAF RCGSVPEIVD HGVSGFIVDS TEEALKAVHG LDRVDRRMVR ATFDRRFTAQ
RMANDYLDIY RALASGSQRV MPIHAANESN AGPGSARVA