Gene Sare_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1116 
Symbol 
ID5706059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1260481 
End bp1261590 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content71% 
IMG OID641270631 
Producthypothetical protein 
Protein accessionYP_001536015 
Protein GI159036762 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0131047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTGCG CGGTGCTGCT GGTCGCCTCC GGTTGCAGCT TCGGCGAGCC AGGACCCGAC 
CCAGCCGGTG AGCCGCCCAT CCTCCCCACC CCGTCAACCA GCCGGGACGG CACCGGCTCT
GAGGTGGTGG CCACCGTGCT GGCGAAGGGG CTGCGCGTGC CGTGGGGCAT CGCCTTTCTG
CCGGACGGCG GTGCGCTGGT GACCGAGCGG GACTCGGGCC GGATTCTCCA GGTAGGGCCC
GAGTCTGAAC CCGACGGGCT GCGGGTGACC GAGGTGCAGA CCCTCGCCGA GGTGACCGCG
GGTGGCGAGG GCGGTCTGAT GGGAATCGCC GTCTCACCGG ACTACCAGCA GGACCGCACG
GTCTTCGTCT ACTACACGGC CGAGCGGGAC AACCGCATCG CCCGACTCAC CCTCGGCGAG
CCGCCGCGCC CGATCCTGAC CGGCATTCCG AAGGCGCGCA CCCACAACGG CGGTGGCCTC
GCCTTCGGAC CGGACGGGCA GCTCTACGCC AGCACCGGCG ACGCCGGCGA CCGAAACCAG
GCGCAGGACG ACAAGCGGCT CGGCGGAAAG ATCCTCCGGA TCACCACCGA CGGCGAGCCG
GCACCGGGCA ATCCGTTCCC CGACTCGCCC GTGTGGTCGC TGGGGCACCG CAACGTGCAG
GGCTTCACCT GGACAGATGG CCGAATGTAC GCCGTCGAAT TCGGCCAGAG CACCTGGGAC
GAGATCAACG TGGTCGAAAA GGGACGTAAC TACGGTTGGC CGGCCGTCGA GGGCCGCTCC
GACGACCGGC GATACGTCAA CCCGATCGTC CAGTGGCCGA CCTCGGACGC CTCCTGCTCC
GGGCTGGCCC ACGCGGAAAG TGTCCTCGCC ACGGCCTGCC TCCGCGGTCG GCGACTCTGG
CTGGTCGAGC TGACCGGCAC CGGAACCGTC CTCGGCCAGC CGCGCGACCT GCTGACCAAC
CAGTACGGCC GGTTACGGGC GATCGCCGCG GCACCGGATG GCTCGTTCTG GGTGAGCACC
TCGAACCACG ACGGGCGCGG AGATCCGGTA GCGGAGGACG ACCGGCTCCT GCGGCTGGTG
TTCGCCGACG GCGGAGCCGG GCGAAGCTGA
 
Protein sequence
MSCAVLLVAS GCSFGEPGPD PAGEPPILPT PSTSRDGTGS EVVATVLAKG LRVPWGIAFL 
PDGGALVTER DSGRILQVGP ESEPDGLRVT EVQTLAEVTA GGEGGLMGIA VSPDYQQDRT
VFVYYTAERD NRIARLTLGE PPRPILTGIP KARTHNGGGL AFGPDGQLYA STGDAGDRNQ
AQDDKRLGGK ILRITTDGEP APGNPFPDSP VWSLGHRNVQ GFTWTDGRMY AVEFGQSTWD
EINVVEKGRN YGWPAVEGRS DDRRYVNPIV QWPTSDASCS GLAHAESVLA TACLRGRRLW
LVELTGTGTV LGQPRDLLTN QYGRLRAIAA APDGSFWVST SNHDGRGDPV AEDDRLLRLV
FADGGAGRS