Gene Sare_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2749 
Symbol 
ID5708351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3130789 
End bp3131823 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content76% 
IMG OID641272205 
Productglycosyl transferase group 1 
Protein accessionYP_001537575 
Protein GI159038322 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.611394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00344637 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCCGTA CCCTGCACGC GGTGCTGCCG GGAGACGTCG ACGATCCGGA CAGTCCAAGC 
GGCGGCAACC GGTATGACCG ACGGGTGCTC GACGGCCTGT CCGCCGCCGG CTGGTCGGTC
CACGAGCACC CCGTTGCCGG GGACTGGCCG CACCCGGCGC CCGCGAACCG TGCCGCGCTC
GCCGGCATGC TCGGCGCGCT GCCCTCCGGG TCGCTTGTCC TGCTCGACGG GCTCGTCGCC
GGCGCGGTAC CGGAGATCCT GGCCCCGAAC GCCGGGCGGC TACGCCTGGT GGTGCTCGTG
CACCTGCCGC TGGGCGGCGG TGCGGAACGT GCGGCCCTGC GCCACGCGAC AGTGGTCGTG
GCCACCAGTA CGTGGACCCG TCGGTGGCTG CTCGACCGGT ACCGGCTGCC CGCCGACCGG
GTGCGGGTCG CGGTACCCGG GGTCGACCGT GCCGCCGCCG TGCCCGGTTC TCCGGCGGGT
GGGCGACTGC TCTGCGTGGC CGCGGTCACC CCACACAAGG GACACGACAC ACTCGTCGCC
GCCCTCGCGG CGGTCGGTGA GCTGGACTGG CGCTGCGACT GCCTCGGGCC GCTCGGCCGG
GATCCCGGCT TCGTCGAGCG GCTGCGGCGG CACATCGCCG CACTCGGACT CGCCCAGCGG
GTGCGCCTCG TCGGACCGCG CACCGGTCCT GCCCTGGCCG CCGGATACGC CACCGCCGAC
CTGCTGGTGC TGGCCTCACG CGTCGAGACG TACGGGATGG TGGTGACGGA GGCTCTCGCC
CGGGGCGTCC CGGTGCTGAC CACCACCGCA GGTGGGTTAC CGGCCACCCT CGGTCGCGCC
CCGGACGGCG CCGCGCCGGG ACTGCTGGTG CCCCCGGATG ACCCGGCGGC CCTCGCCGGA
GCGCTACGCC GCTGGCTCAC CGACCCGACC CTGCGGAACC GGCTGCGTCG CGCCGCGCGC
GACCGCCGGG AGACCCTCAC CGACTGGACC ACCACCACCA TGTCGCTCGC GGCGGCGCTG
GAAGGAACGG AGTGA
 
Protein sequence
MTRTLHAVLP GDVDDPDSPS GGNRYDRRVL DGLSAAGWSV HEHPVAGDWP HPAPANRAAL 
AGMLGALPSG SLVLLDGLVA GAVPEILAPN AGRLRLVVLV HLPLGGGAER AALRHATVVV
ATSTWTRRWL LDRYRLPADR VRVAVPGVDR AAAVPGSPAG GRLLCVAAVT PHKGHDTLVA
ALAAVGELDW RCDCLGPLGR DPGFVERLRR HIAALGLAQR VRLVGPRTGP ALAAGYATAD
LLVLASRVET YGMVVTEALA RGVPVLTTTA GGLPATLGRA PDGAAPGLLV PPDDPAALAG
ALRRWLTDPT LRNRLRRAAR DRRETLTDWT TTTMSLAAAL EGTE