Gene Sare_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2036 
Symbol 
ID5705690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2330656 
End bp2331849 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content71% 
IMG OID641271526 
Productglycosyl transferase family protein 
Protein accessionYP_001536897 
Protein GI159037644 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000746123 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCCATC TGCTGTTCGT CAACGTGGCC AGCCACGGTC TGGTCCTGCC CACCCTGGCG 
GTGGTCACCG AGCTGGTCCG GCGCGGACAC CGGGTCAGCT ACGTCACCGC GGGCGGGTTC
GCCGTGCCGG TCGCGGCGGC CGGCGCGACC GTGGTGCCGT ACGCCTCGGA GATCATCGAC
GCTGACGCCG CCGAGGTGTT CGGTGCCAAC GACCTCGGTG TCCGACCGCA CCTGATGTAT
CTGCGAGAGA ACATGTCGGT GCTGCGGGCC ACCGCTGCCG CGCTCGACGA CGACGTCCCG
GACCTGGTTC TCTACGATGA CTTCCCGTTC ATCGCCGGGC AGCTGCTGGC CGCCCGCTGG
GATCGACCGG CCGGCCGGCT CAGCGCCGCC TTCGCCTCCA ACGAGCACTA CTCCTTTTCC
CAGGACATGA TCGGGTTGGC CGGGACGATC GACCCGCTGG ACCTCCCGGC GTTCCGGGAC
AACCTGGCGG CGTTGCTCGC CGAGCACGGT CTGACCCGGT CGGTGGTCGC GTGCTGGCAG
CACGTCGAGC AGTTCAACCT GGTGTTCGTG CCGAAGGCGT TCCAGATCGC CGGGGAGTGC
TTCGACGAGC GGTTCGAGTT CGTGGGGCCC TGTTTCGGGC AACGACGCTA TCTCGGGCGG
TGGACACCCC CCACCGACGA CCGACCGGTC GTGCTGGTGT CGCTGGGCAC CACCTTCAAC
GACCGGCCGG GGTTCTTCCG TGACTGCGCC CGCGCGTTCG CCGATCAGCC CTGGCACGTG
GTGATGACCC TCGGCGACCA GGTCGATCCG GCGCAGCTCG GTGAGTTGCC ACCGAATGTG
GAGGCGCACC CGTGGGTGCC GCACGTGGAG GTGCTGGAGC GGGCGAGGGT CTGCGTGACG
CACGGTGGCA TGGGCACCCT GATGGAGGCG CTGCACTGGG GGCGTCCACT GGTTGTCGTG
CCGCAGTCCT TCGACGTGCA GCCGATGGCC CGCCGGATCG ACCAGCTCGG TCTCGGTGTG
CTTCTGCCCG GGGCGAAGGC CGACGGGCAG GAGCTGCTCG CCGCTGTCGA GCGGGTGGCC
GGCGACCCGG CGCTGGCGCA GCGGGTGGCG GCGATGCGGG AGCAGGTGCG GCGGGCCGGC
GGCGCGTACC GCGCCGCCGG TGCGATCGAG GCGTATCTGT CCCGGCGCCG GTGA
 
Protein sequence
MAHLLFVNVA SHGLVLPTLA VVTELVRRGH RVSYVTAGGF AVPVAAAGAT VVPYASEIID 
ADAAEVFGAN DLGVRPHLMY LRENMSVLRA TAAALDDDVP DLVLYDDFPF IAGQLLAARW
DRPAGRLSAA FASNEHYSFS QDMIGLAGTI DPLDLPAFRD NLAALLAEHG LTRSVVACWQ
HVEQFNLVFV PKAFQIAGEC FDERFEFVGP CFGQRRYLGR WTPPTDDRPV VLVSLGTTFN
DRPGFFRDCA RAFADQPWHV VMTLGDQVDP AQLGELPPNV EAHPWVPHVE VLERARVCVT
HGGMGTLMEA LHWGRPLVVV PQSFDVQPMA RRIDQLGLGV LLPGAKADGQ ELLAAVERVA
GDPALAQRVA AMREQVRRAG GAYRAAGAIE AYLSRRR