Gene Strop_3247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3247 
Symbol 
ID5059712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3722387 
End bp3723514 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content74% 
IMG OID640475495 
Productglycosyl transferase family protein 
Protein accessionYP_001160059 
Protein GI145595762 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.14939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.122558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTGT TGCTGGCGGT GCTGGCCGGC GTGGCCGCGC TGACCGCGCA CACCCTGGTC 
AACGCCGGCC GTTGGCTGCG CCGCCCGGCC GGGACGCCGG CAACGGTGAC CGAACCGGTG
GCGGTGCTGC TGCCGCTGCG CGACGAGGCT GCCCGAGTCA CCCCATGCCT GCGCGCGCTG
CTGGCCCAGC GCGACGTACC AGAGCTACAG ATCGTGGTGC TCGACGACGG GTCAACCGAC
GGCACCCGCG AGGTCGTCCG CACGGTCGCC GGCGACGACT CCCGGGTCAC CCTGCTCGAC
GGCGGCGCTC CACCGCCCGG TTGGCTGGGC AAGCCGCACG CCTGCTGGCA GCTCGCCACC
CGGGCCGATC CGGCCGCCAC CGTGCTGGTC TTCGTCGACG CCGACGTGGT GCTCGCCCCG
CACGCCGTGG CCGCGGCGGT CGGCGAGCTA CGCGCCGCGC GGGTGACGCT GCTGTCGCCG
TACCCCCGAA TCCTGGTCAC GACGGTGGCC GACCGGCTGG TTCAGCCGCT GTTGCAGTGG
TTGTGGCTGA CGTTCCTGCC ACTGCCCGCG ATGGAACGGT CGGCCCGGCC GTCCCTGGCC
GCGGCCGGTG GGCAGTTCCT GGTCGTGGAC CGGGTCGGGT ACAACGCCGC CGGTGGACAC
GCAGCGGTGT CCGACCGGGT TCTGGAGGAT GTCGAGTTGG CCCGGGCGGT CAAACGGTCC
GGCGGCCAGG TCGCCCTCGC AGACGGCTCG CAGCTGGCCA CCTGCCGGAT GTACGACGAC
TGGCCGCAGC TACGCGACGG CTACTCGAAG TCGCTGTGGG CCTCGTTCGG TCATCCCTCG
GCGGCAGCCA CGGTGGTCGC GCTGCTGCTG CTGCTCTACA CCGTCCCCGC GCTGGTCGCC
GTGGCCGCGC TGGTCGGCGG CGCGCCAGGG GCAGCCGCCG TCGCCGCTGC GGCATACCTG
CTCGGGGTCG CCGGGCGAGT GGTCAGCGCC CGGGCGACCA GCGGCCGGTG GTGGCCAGAC
GCGTTGGGGC ATCCCGCGTC GGTAGCGGTC CTCGGTTGGC TGACCCTACG GTCGTACCAT
CTGCGGAAGC GACGGCGCCT GAGTTGGCGG GGCCGTCCGG TCGTCTAG
 
Protein sequence
MILLLAVLAG VAALTAHTLV NAGRWLRRPA GTPATVTEPV AVLLPLRDEA ARVTPCLRAL 
LAQRDVPELQ IVVLDDGSTD GTREVVRTVA GDDSRVTLLD GGAPPPGWLG KPHACWQLAT
RADPAATVLV FVDADVVLAP HAVAAAVGEL RAARVTLLSP YPRILVTTVA DRLVQPLLQW
LWLTFLPLPA MERSARPSLA AAGGQFLVVD RVGYNAAGGH AAVSDRVLED VELARAVKRS
GGQVALADGS QLATCRMYDD WPQLRDGYSK SLWASFGHPS AAATVVALLL LLYTVPALVA
VAALVGGAPG AAAVAAAAYL LGVAGRVVSA RATSGRWWPD ALGHPASVAV LGWLTLRSYH
LRKRRRLSWR GRPVV