Gene Sare_3317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3317 
Symbol 
ID5707184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3828469 
End bp3829647 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID641272744 
Producttransaldolase 
Protein accessionYP_001538111 
Protein GI159038858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.706771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00145949 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGACA GGCTGGGTGA GCTCACCGCC GCGGGCGTGG CGGTCTGGCT CGATGATCTT 
TCACGGATAC GACTCAGCTC CGGCGAGCTG GACCGGTTGC GCCGGGAGAA GCACCTGGTC
GGCGTGACCA CCAACCCGAC GATCTTCGCG AAGGCCCTGG GCGACGCCGA GGAGTACGAC
TGGCAGTTGC ACGACCTCGC TATGCGCGGG ATAGCCGTCG AGGAGGCGGT GCGCAACCTC
ACCGCGTACG ACGTGCGCTG GGCCTGTGAT GTGATGCGAC CGGCGTACGA GGCGTCGGCG
GGCGTGGACG GACGGGTCTC ACTGGAGGTG GACCCCCGGC TGGCGTACGA GACGGACAAG
ACCGTCGCCG AGGCGCGGGC GCTCTGGTGG CTGGTCGACC GACCGAACCT GTTCATCAAG
ATCCCGGCCA CCGAGGCCGG GCTCCCGGCG ATCACCGCGG CCCTGGCCGA GGGGATCAGC
GTCAACGTCA CCCTGATCTT CGGCCTGGAC CGCTATTCGG CGGTGATGGA GGCGTTCCTG
GCCGGCCTGG AGCAGGCCAA GGCGAACGGC CACGACCTGT CCAAGATCGG CTCAGTGGCG
TCGTTCTTCG TCTCCCGGGT CGACACCGAG GTCGACAAGC GGCTGGAGAA GATCGGCTCG
GAGCAGGCCA GCAAGCTGCG CGGTCGGGCC GCGGTCGCCA ACGCCCGACT GGCCTACGAG
CGCTACAGCC AGGTCTTCGC CTCCGACCGG TGGCAGGCGC TCGCCGACGC CGGGGCGCAC
CCGCAGCGAC CGCTGTGGGC CTCCACCTCG ACGAAGAACC CGGACTACCG GGACGTGATC
TACGTCGAAG AGCTGATCGC CCCCGGCACG GTCAACACGA TGCCCGAGCC GGTGATCAAC
GCCTACGCCG AGCACGGCGA GACCAGCGGC GACACGGTGA CTGCGGCCTA CGACGAGGCC
CGGACGGTCT TCGCGGGCCT GGCGTCGGCG GGTGTCGACA TGACCGACGT GATCGACACC
CTGGAACGCG AGGGGGTGGA GAAGTTCGAG GCGAGCTGGA ACCAGCTACT CGAAGGCGTC
CGCAGGTCCC TCGCCGCCGC CGACCAGGGC ACCGACCACC CCGGCGACGC CGCCAGAAGC
AACGCGCAGG CCGCCGAGCG GGCGGGGGGC AACGCGTGA
 
Protein sequence
MTDRLGELTA AGVAVWLDDL SRIRLSSGEL DRLRREKHLV GVTTNPTIFA KALGDAEEYD 
WQLHDLAMRG IAVEEAVRNL TAYDVRWACD VMRPAYEASA GVDGRVSLEV DPRLAYETDK
TVAEARALWW LVDRPNLFIK IPATEAGLPA ITAALAEGIS VNVTLIFGLD RYSAVMEAFL
AGLEQAKANG HDLSKIGSVA SFFVSRVDTE VDKRLEKIGS EQASKLRGRA AVANARLAYE
RYSQVFASDR WQALADAGAH PQRPLWASTS TKNPDYRDVI YVEELIAPGT VNTMPEPVIN
AYAEHGETSG DTVTAAYDEA RTVFAGLASA GVDMTDVIDT LEREGVEKFE ASWNQLLEGV
RRSLAAADQG TDHPGDAARS NAQAAERAGG NA