Gene Strop_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2080 
Symbol 
ID5058543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2354037 
End bp2355380 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content72% 
IMG OID640474343 
Productglycosyl transferase, group 1 
Protein accessionYP_001158909 
Protein GI145594612 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.371281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.082698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATTGCG ACGGTGTCGC CGCGGCTGGC CGTGGTCGGG TGGTCATGCT CGTGGACAAC 
AGCATCGACG GCGACTCCCG TGTGCAGAAG GCTGCCCGCT CGGCAGCTGA CGCGGGCTGG
GACGTCACCC TGCTGGGCTG TGGCGACACC GCGCGGGAGG GACAACTTGG TACCGCTACC
GTCCGCGTGC TGGCGATGCC CCGCCCCGCC GCGCGGTCCG GGGTCCGGCA GCGCCTCGTC
GCCCGCGGTG GAGCGCTACT GCGGGTGGCG CGACTGCTTC GTCGGCCGTC GGAGTACGTC
CAGGTGCTGC TCTGGCACGC GATCTCCGGC GACCGGGCGT GGCGGCGGCT CGAGCCGGGG
CTGTGGACGT ACGAACGGGT GTTCGGTCCG GTGGTTGACG CGGTGGCGCC GGACCTGATC
CACGCCCACG ACTTCCGGAT GCTCGGCGTC GCCGCCCGGG CCGTGGAGCG GGCGGAGGCG
GTGGGCCGGC AGGTGAGGCT GGTGTGGGAC GCGCACGAGT GGTTGGCCGG TGCTCGACCA
CGCCGCGACA ACGTGCGGTG GCTGCCGGCC CATCTCGCGT ACCTGCGGGA GTACGTGCCA
CGCGTTGACG GCGTGGTCAC CGTCTCGGCT GCCCTGGCCG ACCTGCTCCG CAGCGAGTAC
CGGCTGGCCG AGGAACCGAC CGTACTGCTC AACGCCCCGG CGGTGGCGGA CCCGCCACCG
CCGGACGTGC CGGACCTGCG GGCCCGGTGC GGAGTCGGGC CGGAGACCCC CCTGCTGGTG
TACAGCGGGG CACTTGCCGA GCAACGGGGA GTGGGAACGG TGATCGAGGC GTTGCCGCGT
CTTCCCGATG TCCACCTTGC GCTGGTGGTC GGTGACGTGA GCGCGCCGTA CCTGCGGCAG
CTGTTGGATG TCGCTGCGCG GCTGGGCGTG GCCGACCGGG TGCACCCCCA GCCGTACGTG
CCGCACCAGC AGGTGAGCGC CTTCCTCAGC GCCGCCGACG TGGGGCTGAT CCCGCTGCAC
CACTGGCCGA ACCACGAGAT CGCCTTGATC ACAAAGTTCT TCGAGTACGC CCACGCCCGG
CTGCCCATTG TGGTCAGCGA CGTGCAGACG ATGGCCGACA CCGTCCGCGC TACCGGCCAG
GGCGAGGTGT TCCGGGCGCG GGACGTCGGT GACCTCGTCC GCGCCGTGCA GGCGGTGCTC
GCCGAACCGC AACGGTATCG CCGAAGCTAC GACGGGCCGG ACTCGCCCCT GGTTGACTGG
ACCTGGGAGG CGCAAGTAGA CCGCCTCGAC GCCCTCTACC GCCGGTTGCT GAGCCCGACC
GTCGAGTCCG TGGAGACGTC GTGA
 
Protein sequence
MHCDGVAAAG RGRVVMLVDN SIDGDSRVQK AARSAADAGW DVTLLGCGDT AREGQLGTAT 
VRVLAMPRPA ARSGVRQRLV ARGGALLRVA RLLRRPSEYV QVLLWHAISG DRAWRRLEPG
LWTYERVFGP VVDAVAPDLI HAHDFRMLGV AARAVERAEA VGRQVRLVWD AHEWLAGARP
RRDNVRWLPA HLAYLREYVP RVDGVVTVSA ALADLLRSEY RLAEEPTVLL NAPAVADPPP
PDVPDLRARC GVGPETPLLV YSGALAEQRG VGTVIEALPR LPDVHLALVV GDVSAPYLRQ
LLDVAARLGV ADRVHPQPYV PHQQVSAFLS AADVGLIPLH HWPNHEIALI TKFFEYAHAR
LPIVVSDVQT MADTVRATGQ GEVFRARDVG DLVRAVQAVL AEPQRYRRSY DGPDSPLVDW
TWEAQVDRLD ALYRRLLSPT VESVETS