Gene Sare_5004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5004 
Symbol 
ID5705744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5672190 
End bp5673497 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content72% 
IMG OID641274397 
Productglycosyl transferase group 1 
Protein accessionYP_001539738 
Protein GI159040485 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000372843 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATCG TGGTGGCACA CAACCGGTAC CGGGAGGCTC AGCCGTCCGG CGAGAACACC 
ATCGTCGACG GGGAGATCAC CCAGCTGACC GCGGCCGGGG TCGAAGTGCT GCCGTTCCTG
CGTAGCTCCG ACGAGATCGG GTCAATGTCC ACGCCCGCGA AGGCGCTGTT GCCGATCTCC
CCGCTCTACG CTCCCCGGGC CCAGCGGGAG CTGGGTCGCC TGCTCGACGA GCACCGGCCG
GACGTGCTGC ACCTGCACAA CCCGTACCCG CTGCTCTCCC CCTGGGTGGT GCGGACCGCG
CACCGTCATG GCGTGCCGGT GGTCCAGACG GTGCACAACT ACCGGCAGGT CTGCTCGTCC
GGGCTGTACT TTCGGGATGG GATGATCTGC CAGGACTGCC GGGGGCGGGT ACTGGGCGTA
CCAGCGATCG TGCACCGCTG CTATCGGGGG TCGCGGGCGC AGAGCGCACT GATGGCGACG
ACGCTTGCCG CACACCGAGG CACCTGGCGC TCGGTGGACC GATTCATCGC GCTCACCACG
GCGATCGCCG AGCACCTCGG GGACTACGGC ATCCCGCAGC AGCGGGTCGT GGTCAAGCCG
AACGCGGTCC CCGACCCAGG TGCTCCGGCG CCGCTGGGCA CCGGCTTTCT CTTCCTCGGC
CGGCTCACCC CGGAGAAGGG GCTGGACCTG CTGCTCGACG CCTGGCGCCG GCATCCGGAC
GGTGCGCTCG GCCCGCTCCG CATCGCCGGT GACGGCGAGT TACGACCACT GGTGCAGCAG
GCCGCGGAGC AGCGGGCTGA TGTGACCTTC CTCGGCCCGC TGGACCGGGA CGGGGTCCAC
GCCGCACTGG TGGCCAGCGC GGTGGTGCTG GCCACCTCCA CCTGGCACGA CGTGCTGCCG
ACCGTGATCA TTGAGGCGTT GGCCGCCGGC CGGCCGGTGC TCGGCACCGC CCTCGGGGGT
ATCCCGTACC TGGTGGGCGC CGATACCCCC CGTGAACCCG CCGGTACCGG ACCGGCCGAT
GTTGCATCGG CTGCCGCCGC GGCGACCGGC CCAGGGTCAC CGCCCCCGGT GGCGGCCACC
GCCGTGCCGG TCGGGATCCT GGTCGGGGAG GCGGGCTGGG TGGTGCCGCC GGATCCGGCC
GCACTGGCCG CCGCGCTGCC GGTGGCCGCC GCTGGTGCCG CCACCCGGGC ACCGGCCGCG
CGGGCCCGCT ACGAGCGCAC CTTCCATCCG GACGTGGTCA CGCGGCGCCT GATCGAGATC
TACACCGACA CTGCCCGCAC GGCGGGACCG GGCCGGTCAA CGCAGTGA
 
Protein sequence
MKIVVAHNRY REAQPSGENT IVDGEITQLT AAGVEVLPFL RSSDEIGSMS TPAKALLPIS 
PLYAPRAQRE LGRLLDEHRP DVLHLHNPYP LLSPWVVRTA HRHGVPVVQT VHNYRQVCSS
GLYFRDGMIC QDCRGRVLGV PAIVHRCYRG SRAQSALMAT TLAAHRGTWR SVDRFIALTT
AIAEHLGDYG IPQQRVVVKP NAVPDPGAPA PLGTGFLFLG RLTPEKGLDL LLDAWRRHPD
GALGPLRIAG DGELRPLVQQ AAEQRADVTF LGPLDRDGVH AALVASAVVL ATSTWHDVLP
TVIIEALAAG RPVLGTALGG IPYLVGADTP REPAGTGPAD VASAAAAATG PGSPPPVAAT
AVPVGILVGE AGWVVPPDPA ALAAALPVAA AGAATRAPAA RARYERTFHP DVVTRRLIEI
YTDTARTAGP GRSTQ