Gene Sare_2263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2263 
Symbol 
ID5706715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2601429 
End bp2602649 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content73% 
IMG OID641271742 
Productglycosyl transferase group 1 
Protein accessionYP_001537113 
Protein GI159037860 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0495388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0132313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCG GCGTCCTGAC CTATCACTTC CCTCCAGAAC CAGCCTTCAT CCCGGGCAGC 
CTCGCCGAGG AGTTGGCCCG CCGCGGCCAC GAGGTCCGGG TGCTGACCGG GTTTCCGGAC
TACCCCGGCG GGCACTCCTA CCCTGGCTGG CACCAGCGTT GGCGCCACGA GACCCACAGC
GAACGGCTGA CCGTACGGCG AGTGCCGCGC TACACCGGCC GGAACGGCTC CGTCCGCGGT
CGGGCAGCGG GCCCCCTGTC GTTCGCCGGC AGTGTGTCGC TGGTCGGCCG CCGGTTCCTC
GCCGGCATCG ATGCGCTCTA CGTCCACCAG CCGCCCCCGG CCGCCTTCGC CACCGCCAGC
CTGCTTCGGA TGCTCGGCCG AGTGCCGACC GTCGTGCACG TGCCGGACGT GTGGGCCGGC
GGTTCGGAAC CAACGGCGGG TGAGCCCAGC CGGTGGGCTG GCCGGATCGC TGGTGCGATG
GCGTGGACCT ACCGCCGAGC CGACCGGATC GTGGTCGCCG CACCCTCCCT GCGGGACCTC
GTCGTGACAG CGGGCGCCGA CCCGGATCGC GTCGAGGTGG TGCTCAACTG GACCGACGAG
CGGATCTTCC GGCCAGCCCG GCCCAGCCCG GCAGCCGGTA AACTGGTCCG TCGCGATGGC
CGCTGCGTGG TCATGTACGC CGGCACCATC GGTGCCCGGC AGGGGCTGGA GACAGCGGTA
CGGGCGGCAG CGGCCGTCGA CCGTGGGATG GATCTCGTGT TGGTCGGGTC GGGCGAGCAG
GAGCGGCGGG TGCGGGGGCT CGCCGCCGAA CTGCGTACCG ACAACGTGCG GTTCGTCGAG
CGACGCTCGC CGTTGGACAT GCCGGAGCTG TACGCGGCCG CCGACTACCA GTTGGTCATG
CTCCGGGACC TGCCCGAACT GCGCGGCACC CTCCCCGGCA AGTTGCCGAC GGCCCTGTCG
TGCGCGGCGC CGGTCATCGC CTCGGCCGGT GGCGACACCG CCGAGGTGGT GGAGCGGGCC
AGGGCCGGGC TGTCCTGCCC ACCGGAGGAA TGGGACGCGC TCGCCGACCG GTTCTGGCTG
GCCGCCACCA TTCCGCCGGC CGCCCGTGCG GAGATGGGCC GGCGGGGCCG GGAGGCGTAC
CTGCGGCAGA TGTCGATGCC CGCTGGTGTG GATCGGATCG AGCGCCTGCT GAGCGAGGCC
GTCGGCGGAC GTCGCCGGTG A
 
Protein sequence
MRIGVLTYHF PPEPAFIPGS LAEELARRGH EVRVLTGFPD YPGGHSYPGW HQRWRHETHS 
ERLTVRRVPR YTGRNGSVRG RAAGPLSFAG SVSLVGRRFL AGIDALYVHQ PPPAAFATAS
LLRMLGRVPT VVHVPDVWAG GSEPTAGEPS RWAGRIAGAM AWTYRRADRI VVAAPSLRDL
VVTAGADPDR VEVVLNWTDE RIFRPARPSP AAGKLVRRDG RCVVMYAGTI GARQGLETAV
RAAAAVDRGM DLVLVGSGEQ ERRVRGLAAE LRTDNVRFVE RRSPLDMPEL YAAADYQLVM
LRDLPELRGT LPGKLPTALS CAAPVIASAG GDTAEVVERA RAGLSCPPEE WDALADRFWL
AATIPPAARA EMGRRGREAY LRQMSMPAGV DRIERLLSEA VGGRRR