Gene Strop_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0331 
Symbol 
ID5056769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp382614 
End bp384062 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content71% 
IMG OID640472603 
Productglycosyl transferase, group 1 
Protein accessionYP_001157194 
Protein GI145592897 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGAATT ACCGATCCCC GATCGGGAAG ATGACACCTG GCGTGGAGGT TGACCGAGAA 
GCGCCGGTCG TGCGGGCGGC AACCGTGGCA GAAGGAGCGG ACGTGGCGGA ACAGCACACC
GGTGTCGGTC ATCAGCGAGG TGCCCGTCCG TGGCCCCTGC CCCGCCGTAT CGCGACCCTC
TCCGTGCACA CCTCGCCGCT GCACCAGCCT GGCACCGGTG ACGCCGGCGG GATGAATGTC
TACATTCTGG AGGTCGCCCG GCGATTGGCC GAGGCGAACG TCGAGGTCGA GATCTTCACC
CGGGCGACCG CGGCCGACCT ACCGCCGGTG GTCGAGATGG TGCCGGGTGT GCACGTCCGG
CACATCATGT CCGGCCCGTT GGGTGGGCTG ACCAAGGAGG AACTGCCCGG CCAGCTCTGC
GCGTTCACCG CGGGGGTGCT TCGGGCCGAG GCCGTCCGGG CCGCGGGGCA CTACGACCTC
ATCCACTCGC ACTACTGGCT CTCCGGGCAG GTCGGCTGGC TGGCCAAGGA GCGTTGGGGG
GTTCCGCTGG TGCACACCGC GCACACCCTC GCCAAGGTCA AGAATGCGCA ACTCGCCGCC
GGGGACCGGC CGGAGCCCAA GGCTCGGGTG ATCGGCGAGG AGCAGGTGGT GGCGGAGGCC
GACCGCCTGG TCGCCAACAC CAAGACCGAG GCCGGTGACC TGATCGACCG GTACGATGCC
GACCCGACCC GGGTTGAGGT GGTCGAACCG GGGGTGGATC TGGCCCGGTT CTGCCCTGCC
TCCGGTGATC GCGCGCGGGC GCAGGTCCTC GCCCGTCGTC GGCTGGACCT GCCCGAGCGC
GGCTACGTGG TGGCGTTCGT CGGCCGGATC CAGCCGCTCA AGGCACCCGA CGTGCTGATC
CGTGCGGCGG CGGCGTTGCG CCAACGGGAT CCGGCCCTCG CCGATGACAT GACGGTGGTG
GTCTGCGGTG GCCCCAGCGG TAGCGGGCTC GAGCGGCCGA CCCACCTGAT CGAGCTGGCC
GCCGCGTTGG GCATCACCGA TCGGGTCCGG TTCCTGCCGC CGCAGACCGG CGACGACCTG
CCCGCCCTGT ATCGGGCGGC CGACCTGGTG GCGGTCCCGT CCTACAACGA GAGCTTCGGG
CTGGTGGCGT TGGAGGCGCA GGCCTGCGGT ACGCCGGTGG TGGCGGCCGC GGTCGGCGGC
TTGAACACCG CGGTACGCGA CGAGGTCAGC GGGGTCCTCG TGGATGGCCA CGACCCGGTC
GCATGGGCCC GTTCGCTGGG CCGCCTGCTG CCGGACGCCG GCCGGCGCGC GATGTTGGCC
CGGGGCGCGC AACGCCACGC CCGCAACTTC TCCTGGGATC GGACGGTGAA AGACCTGTTG
GATGTCTACG GCGAGGCGGT CGCCGAGCAC CGAACCCGAT TGTCTGACTT CGCCACCTGC
TCTCGGTGA
 
Protein sequence
MRNYRSPIGK MTPGVEVDRE APVVRAATVA EGADVAEQHT GVGHQRGARP WPLPRRIATL 
SVHTSPLHQP GTGDAGGMNV YILEVARRLA EANVEVEIFT RATAADLPPV VEMVPGVHVR
HIMSGPLGGL TKEELPGQLC AFTAGVLRAE AVRAAGHYDL IHSHYWLSGQ VGWLAKERWG
VPLVHTAHTL AKVKNAQLAA GDRPEPKARV IGEEQVVAEA DRLVANTKTE AGDLIDRYDA
DPTRVEVVEP GVDLARFCPA SGDRARAQVL ARRRLDLPER GYVVAFVGRI QPLKAPDVLI
RAAAALRQRD PALADDMTVV VCGGPSGSGL ERPTHLIELA AALGITDRVR FLPPQTGDDL
PALYRAADLV AVPSYNESFG LVALEAQACG TPVVAAAVGG LNTAVRDEVS GVLVDGHDPV
AWARSLGRLL PDAGRRAMLA RGAQRHARNF SWDRTVKDLL DVYGEAVAEH RTRLSDFATC
SR