Gene Sare_2933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2933 
Symbol 
ID5705238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3321892 
End bp3323310 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content65% 
IMG OID641272382 
Producthypothetical protein 
Protein accessionYP_001537750 
Protein GI159038497 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.697027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.639426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC TGTTCACAGT TTTTCCCTCG TACGTGCACC TGTATCCGCT GGCACCAGTG 
GCGTGGGCGC TGCAGAGCGT CGGACACGAG GTCCGGGTCG CCTCCTCCGG AAACTTCGCG
AGGGCAATCT CCAGCGTCGG CCTCACGCCG CTGTCGCTGG GTGATCCGGA TGCTGTCGAA
GCACGCCTGC GGCCGGGGGC CAAGCAGCCA CCGAATCCAC AGGAGGTCCT CGCGTACGCC
GACCTCATGG GGCTGGATTC CGCGGAACGC GAGAACTGGA TCGTCTTCTA CCAGTGGCTG
TTGAACCCAG TTTCGGACTA TGTGCGCGTG GATCAGCCCG AGGCCGCACA CCTTGTTCGG
TTCGCGCAAC GGTGGCAGCC AGACCTCGTG TTGTGGGATC CTATCTTTCC CGCCGGTGCG
GTGGCGGCTC GCGCGTGCGG TGCGGCCCAC GGTCGCTTCC TCGGGGCGGC CCTCGACTAC
TTCATGTACG GCACGGAGCG ACTCGAAGCG GCCCGTGACA AAGTACGCAA TGCCGGTCTA
TCGGACAATC CGCTCGCGGA CCTCATCCGC CCCTTGGCTG ATCACCACGA CGTTGACGTT
GACGATGAGC TCCTCAGGGG GCAGTGGACC GTGGATCCCA TGCCGGAAGG CGTCAGCCTC
TCCACGGGCG GTCACAAGGT TCCGGTTCGT TGGGTGCCCT ACGTGGGTGG TGAACCGTGT
CAGGAGTGGG TGCTCGATGG GCCCACGAGC CGACCGCGGG TCGTGCTGTC CCTTGGTGAG
TCGGCCCGAC GGTATGTTGC CGGGGACTGG GGGCGCACGC CCAAACTGCT GGACGCTCTG
GCGGGGATGG ATGTCGACGT CATCGCAACC CTGAATGAGC GCCAGCTACA GGGCATCTCG
ACTGTTCCGG ACAACGTCCG CGTCATCGAA TGGGTGCCGC TGACGCAGCT TATGCCTACC
AGCTCGTTGC TGATACATCA TGGCGGTACC GGCACGACGA TGTCGGCCCT GGCCAACCGC
GTGCCTCAGC TGGTCTGCGA CACAGATGAG TCGTTCCTGA TGGGTCCGGC CGACGTGGTG
CCGCGACTCG GTGATGCCGG GGTCTACCGC GCTGGTCGTG AGTTCGGAGT GACCGATGAC
GACGCCGACG GCGACGCCGA GCAGGAGGGC TGGGTGATCC CTGGTCGTCA CCTCATGGCG
CCACCGTGGT CGGGCGTGAT GACTCAGTAC GGTGCTGGTG AGCGACTCAA TCATCAGGTG
ATGTCCGGCA CCGAGATCCG TGACCGGATC ACGCACGTGC TCTCCGAGCC ATCGTTCGCA
GCTGGTGCGC GCGAGCTGTA TGAGGCGTGG ATGGCCAGGC CAAGCCCGAG CGACATCGTT
TCGACTCTGG AGTCCTTGAC CGCCGAGCAC CGCCGTTGA
 
Protein sequence
MRILFTVFPS YVHLYPLAPV AWALQSVGHE VRVASSGNFA RAISSVGLTP LSLGDPDAVE 
ARLRPGAKQP PNPQEVLAYA DLMGLDSAER ENWIVFYQWL LNPVSDYVRV DQPEAAHLVR
FAQRWQPDLV LWDPIFPAGA VAARACGAAH GRFLGAALDY FMYGTERLEA ARDKVRNAGL
SDNPLADLIR PLADHHDVDV DDELLRGQWT VDPMPEGVSL STGGHKVPVR WVPYVGGEPC
QEWVLDGPTS RPRVVLSLGE SARRYVAGDW GRTPKLLDAL AGMDVDVIAT LNERQLQGIS
TVPDNVRVIE WVPLTQLMPT SSLLIHHGGT GTTMSALANR VPQLVCDTDE SFLMGPADVV
PRLGDAGVYR AGREFGVTDD DADGDAEQEG WVIPGRHLMA PPWSGVMTQY GAGERLNHQV
MSGTEIRDRI THVLSEPSFA AGARELYEAW MARPSPSDIV STLESLTAEH RR