Gene Strop_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0939 
Symbol 
ID5057383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1054456 
End bp1055601 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID640473209 
Productglycosyl transferase, group 1 
Protein accessionYP_001157794 
Protein GI145593497 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.872922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.705324 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGCCG GCAGCCCGCC CCGTGTGCTC ATTGACGCCA CGAGTGTTCC CGCCGATCGT 
GGCGGCGTCG GTAGATATGT TGACGGCCTG CTCGGCGCGC TCGGGAAGGT CTGCGGCACC
AGCGTTGACC TGGTCGTGGT CAGCCTTCGG ACGGATCTTG AACGCTATAC CCGGATGCTG
CCCGGGGCGG AGATCATCCC CGCCCCGGCC GCTGTGGCGC ACCGGCCCGC GCGGCTCGCC
TGGGAACAGA CCGGCCTGCC GCTGCTCGCC CAGCAGGTCG GCGCCCAGGT GCTGCATTCG
CCCTTCTACA CCTGCCCGCT GCGGGCGGGC TGTCCGGTCA CCGTGACCGT GCACGACGCC
ACCTTCTTCA CCGAGCCAGA GCACTACGAC AAGTCCCGTC GCACCTTCTT CCGCAGCGCG
ATCCGGACGT CGTTGCGCCG CGCCGACCGG GTGATCGTGC CCAGTAAAGC CACCCGGGAC
GAGCTGATTC GGCTGTTGGA CGCTGACCCG ACCCGGATTG ATGTCGCGTA CCACGGGGTT
GATCATGTCG CGTTCCACGC CCCGAGCGCC GAGGAGAAGG CCCGGGTCCG GGCCCGGCTG
GGGCTCGGCA GCCAGAGCTA CGTCGCGTTC CTCGGTGCCA AGGAGCCCCG CAAGAACGTT
CCCAACCTCA TTCGGGGCTG GGCGCGGGCC GTGGCGGACC GGCACCAGCC GCCAGCCCTG
GTGGTCGCCG GGGGGCAGGG GCACGACGAC GAGATCGATC GCGCGGTCGC CGAGGTGCCG
TCGCACCTGC GCCTGCTCCG CCCCGGTTAC CTGCGCTACG CCGACCTGCC GGGTTTCCTC
GGTGGGGCCT TGGTCTCCGC CTACCCGTCG TACGGCGAGG GGTTCGGCCT GCCGATCCTG
GAGGCGATGG CCTGTGCGGC GCCGGTGCTG ACGACGCCCC GGCTCTCTCT GCCCGAGGTG
GGCGGCGAGG CGGTCGCGTA CACCAGCGAG GCACCGGATC AGATCGCCGC CGACCTGGCC
GCGTTGCTCG ACGACGAACA CCGCCGGCTG GCGCTGGCCC AGGCCGGGTT CGACCGGGCC
AAGGAGTTCA CCTGGCAATC CAGCGCCGAC GTGCACCTCG CCGCCTGGTC GCGGGCCCGG
TCGTGA
 
Protein sequence
MTAGSPPRVL IDATSVPADR GGVGRYVDGL LGALGKVCGT SVDLVVVSLR TDLERYTRML 
PGAEIIPAPA AVAHRPARLA WEQTGLPLLA QQVGAQVLHS PFYTCPLRAG CPVTVTVHDA
TFFTEPEHYD KSRRTFFRSA IRTSLRRADR VIVPSKATRD ELIRLLDADP TRIDVAYHGV
DHVAFHAPSA EEKARVRARL GLGSQSYVAF LGAKEPRKNV PNLIRGWARA VADRHQPPAL
VVAGGQGHDD EIDRAVAEVP SHLRLLRPGY LRYADLPGFL GGALVSAYPS YGEGFGLPIL
EAMACAAPVL TTPRLSLPEV GGEAVAYTSE APDQIAADLA ALLDDEHRRL ALAQAGFDRA
KEFTWQSSAD VHLAAWSRAR S