Gene Sare_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2047 
Symbol 
ID5705492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2341224 
End bp2342411 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content71% 
IMG OID641271534 
Productglycosyl transferase family protein 
Protein accessionYP_001536905 
Protein GI159037652 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.555181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0026549 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATCGACG TGCGTGTGCT GTTCGCCAGT CTCGGAACAC ACGGCCACAC CTACCCACTC 
CTGCCGCTGG CCGCCGCCGC TCGGGACGCC GGCCACGAGG TAACCTTCGC CACCGGTGAG
GGCTTCGCCG AGGTGTTGCG TGCGCAGGGC TTCGACCCGA TCGCCACCGG GATGCCGGTC
TTCGACGGCT TCCTGGCGGC GCTACGGATC CGCTTCGATA CCGACAGCCC CGATGGGCTG
ACACCCGAGC AGCTCAGCGA GCTTCCCCAG ATCGTGTTCG GGCAGGTGAT GCCGCAGCGC
ATCTTCGACA GGCTCCAACC GGTGCTCGAC CGGGTGCGAC CCGACCTCGT GGTGCAGGAG
ATCAGCAACT ACGGCGCAGG ACTTGCCGCC ACCAAGGCCG GCATCCCGAC CATCTGCCAC
GGAGTCGGCC GTGACACCCC GGACGAGCTC ACCCGCTCCA TCGAGGACGA GGTGGGCAGG
CTCGCCGCTC AGCTCGGCAT CGACCTGCCG CCCGGGCGTA TCGACGCCTT CGGCAACCCG
TTCCTCGACA TCTTTCCGCC GTCGTTGCAG GAGCCGGCGT TTCGTTCCCG CCCCGAGCGG
TACGAGTTGC GCCCGGTGCC GTTCACCGAA CGGCCGAAAG TGCCGGACTG GGTACTCGCG
CGGACCAGGT CCCGGCCCCT GGTGTATCTG ACCCTGGGCA CCTCCAGCGG CGGCACCGTC
GAGGTGCTGC GGGCCGCGAT CGACGGCCTG GCCACCCTGG ACGTCGACGT CCTCGTCGCG
GGCGGCCCGT CGCTCGATCT CGCCCAGCTC GGCGAGGTGC CGACCAGCGT GCGGCTGGAG
TCGTGGGTCT CGCAGGCGGC GCTGCTTCCC CACGTCGACC TCGTGGTCCA TCACGGTGGC
AGCGGGACCA CCATCGGCGC GTTCGACGCT GGCGTGCCGC AGCTCTCCTT TCCGTGGGCG
GGTGACTCGT TCGCGAACGC CCAAGCCGTG ACCCAGGCGG AGGCCGGTGA CCACCTGCCG
CCCGGCGGTG TCAACGCCGA GGCGGTGGCG GACGCCGCGA AGCGGCTGAT CGCCGACGAG
AGCTACCGGA CGGCGGCGAA GGCGGTCGCC GTCGAGATCG CCGCGATGCC GACCCCCGAC
GAGGTCGCCC GCCGGCTGCC CGAGTTCGCC GGACGGCGGG CCGCCTGA
 
Protein sequence
MIDVRVLFAS LGTHGHTYPL LPLAAAARDA GHEVTFATGE GFAEVLRAQG FDPIATGMPV 
FDGFLAALRI RFDTDSPDGL TPEQLSELPQ IVFGQVMPQR IFDRLQPVLD RVRPDLVVQE
ISNYGAGLAA TKAGIPTICH GVGRDTPDEL TRSIEDEVGR LAAQLGIDLP PGRIDAFGNP
FLDIFPPSLQ EPAFRSRPER YELRPVPFTE RPKVPDWVLA RTRSRPLVYL TLGTSSGGTV
EVLRAAIDGL ATLDVDVLVA GGPSLDLAQL GEVPTSVRLE SWVSQAALLP HVDLVVHHGG
SGTTIGAFDA GVPQLSFPWA GDSFANAQAV TQAEAGDHLP PGGVNAEAVA DAAKRLIADE
SYRTAAKAVA VEIAAMPTPD EVARRLPEFA GRRAA