Gene Sare_2328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2328 
Symbol 
ID5704252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2676954 
End bp2678255 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID641271806 
Productglycosyl transferase family protein 
Protein accessionYP_001537177 
Protein GI159037924 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0735299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00124939 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTGATCG CTACCACACC GGCACCCGGA CACGTCGTCA GCATGGTGGA TGTGGCGGGC 
GAACTGACTC GGCGTGGACA TGAGGTGCGG TGGTACACCG GCCGTGCCTT TCAGGAGCAG
GTCGAGCAGG CCGGGGCGCG CTTCGAGCCG ATGAGTGAGG CACTCGACTT CGGCGGCAGG
AGTCGGGAGG AGGCGTTTCC CAGCCACGCC GGGCTGACCG GCCTCGCCAG TTTCAAGATC
GGCGTGCGTG ACATCTTCTA CCACACGGCG CCCGGCCAGC TCGACGACCT ACTTCGCGTG
CTCGAACGCT TTCCCGCCGA CTGCATCCTC GCCGATGACA TGTGTTATGG CGCGTGTTTC
GCGAGCGAGC GAACCGGCCT GCCGATGGCC TGGCTGAGCA ATTCGATCTA CATCCTGGGC
AGTAGGGATA CCGCACCGCT CGGGTTGGGG CTGCAACCGA GTTCGTCGCC GCTGGGACGG
GCCCGTAACG CTCTGCTGCG ATTCCTCGGT GACCATGTGT CCATGCGAGA CCTACGCCGG
GAGGCCGACC GCACGCGAGC CTCGGTGAAC CTGCCGCGGC TGAGGACACG GGCCATGGAG
AACATCACGC GCCCCCCAGA TTTGTACCTG GTGAGCACCG TGCCGTCCTT CGAGTTCTCC
CGCAGCGATC TCCTACCAGG CACACACTTC ATCGGCGGTC TCTTCGGACT TCCCCCGGAG
CGGTTCGAGC CGCCCAGTTG GTGGCAGGAG TTGGATGGAG ACAAGCCGGT GGTGCTCATC
ACCCAGGGCA CCACCGCCAA CGACGTCGAC CGGCTACTCG TTCCCGCAGT CCGGGCGCTG
GCCCGCGAGG ACCTGCTCGT CGTGGTGACC ACGGGAAGCG ACCTGGATGT CGACCTGCTA
CGGCCGCTAC CCGGCAACGT CCGGTTGGAG CGGTTCGTTT CCTATCACCA TCTGTTGCCC
CGCGTGGACG TGATGCTGAC CAACGGCGGC TACAACGGTG TCAACGCCGC CCTCGCCCAT
GGCGTACCCC TGGTCGCCGC TCCGGCGACC GAGGAGAATC CCGACGTCGC GGCCCGGATC
GCGTGGTCCG GAGCGGGCGT CGTTCTCGCC CGGCGCGCGG TGTCAGAGGC CACCCTGCGT
AACGCCGTAG TCACCGTTCT GCACGACGAG CGCTACCGGC AACGGGCACA CGTGCTGTCC
CGCGAGCACC AGCGCTACGA CGCCCCGCGA CGAGCCGCCG AGCTCATCGA GGCCATGGCC
GAATCCCAGG GCCGAGTCCC TACCGGAGGT CCCACCCAAT GA
 
Protein sequence
MLIATTPAPG HVVSMVDVAG ELTRRGHEVR WYTGRAFQEQ VEQAGARFEP MSEALDFGGR 
SREEAFPSHA GLTGLASFKI GVRDIFYHTA PGQLDDLLRV LERFPADCIL ADDMCYGACF
ASERTGLPMA WLSNSIYILG SRDTAPLGLG LQPSSSPLGR ARNALLRFLG DHVSMRDLRR
EADRTRASVN LPRLRTRAME NITRPPDLYL VSTVPSFEFS RSDLLPGTHF IGGLFGLPPE
RFEPPSWWQE LDGDKPVVLI TQGTTANDVD RLLVPAVRAL AREDLLVVVT TGSDLDVDLL
RPLPGNVRLE RFVSYHHLLP RVDVMLTNGG YNGVNAALAH GVPLVAAPAT EENPDVAARI
AWSGAGVVLA RRAVSEATLR NAVVTVLHDE RYRQRAHVLS REHQRYDAPR RAAELIEAMA
ESQGRVPTGG PTQ