Gene Sare_0838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0838 
Symbol 
ID5707274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp939399 
End bp941129 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content63% 
IMG OID641270356 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001535747 
Protein GI159036494 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00172744 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.208315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGCA GCGGGGGGTC GGTCACAGTC GGCGAACTGT TGCTTGGTCG CCTCCACGAC 
CTCGGCGTGC GTCATGTTTT TGGAGTGCCC GGCGACTATG CAATGGACTT CATAGATCAG
ATCATGACGT TCGATGGCAT CGACTGGATC GGTAGCTCCA GCGAGTTCAA CGCCGGCTGC
AGTGCGGACG GCTACGCCCG AGTTGCTGGC ATAGGTGCCA TTGTTACCCA ATTTGGTGTG
GGCGAACTGT CGACCATGAA TGCATTGGCT GGCGCAATGG CTGAGTCGGT GCCTATCGTC
TCGGTCGTGG GTGGCCCGAT GCTGGAAGTC ATGCGGCAGC GCACGTCGAT TCACCACTCA
CTCGCGGATG GCGATTCCGA GCGTTGGATT CGGATGGCCC GCGAGGTGAC GGTTGCCCAA
GCCTCGTTGA CGCCCGAATG TGCACTGCAG GAGATCGACC GGGTGTTGGC CGAGTGCTGG
TCCCAGCAGC GTCCCGTATA CATTCGAATT CCCGGTGATG TGGCCATAGC TCCCGTCTCC
CGACCGTCGC GACGCTTCAC CCGCCCAAAT CCGGTCGTGT TGCCCGCACA ACTGGACGCG
TTCGCCGCCG CTGCTCAGCG CCTGCTCGCT GGTGCCGAAC GGCCAGCCTT GCTGGTGGGG
AATCTACCGA TACGCCTCGG TCTTGGTGCG GCTGTCGCCG CGCTCGCCAA CGAGCGAAAC
TGGCCGATCG CCACTCAGAT GCTCGGCCGA GGGCTGGTTG ACGAGACAGA TCCCCACTAC
ATCGGCATCT ACAACGGGGC CGAAAGTTCG GCTCCGGTCC GCGAGGTGGT CGAAGGCGCC
GACGTCCTGG TCTGTCTGGG AACCACCTTT TTCGACTGGA ATGGTCTGTT CACCGCTGAA
CTGGATCCTG CCCGGATCAT AAACCTGAGG CGGGACGGCG CTGTGGTCGG CGGAACCTGT
TTCGCCCCGG TATCCATGGC CGCGGCGCTG GATCGGTTGC ACGAGATGGC TGCCAGTCGT
TCGGTTGGCT GGCCGAGCGC TGCCCTGTTG CACGACCCGC CCGAAATCGA CAGGGCAAGC
ACCGACCCAA TCCGTCAGGA ACGACTCTGG TCGGCGGTTC AGGACGTACT GCGTCCGGGG
GATATTTTGG TCTCGGAGGT CGGCACCGCG TTCTTCGGTG CGGCAACAAT GCGCCTACCG
GCTGGGACGA CTGTATTGGC GGCGCCAATC TGGAGCTTGG CCGGCTATAC CACGCCTGCT
GCTTTCGGCG CCGGGATCGC CGCGCCTGAC CGGCGGGTGG TGTCGATCAC CGGCGACGGG
GGAATACAAA TTTCCCCACA GGAGATAAGC CGAATGTTCG TCTTTGATCA ACATCCGATC
ATATTCGTGG TGAATAACGG TGGCTACAGT AGCGAGCGAG CGCTTGAAAA AGCGGTCGGG
GAAGAAACTC AGGCGTACAC CCAGATCCCT GACTGGCGAT ATAGTGAGAT ACCGGCGGTG
TTTGCGCCGG AGGGAACCTT CGTTGCGCAT GTCGCCCGCA CCGAGGCCGA GTTGGCTGGG
ATCTTGGCCG GTGTCGACGG GCGTACGGAC CGGCTGACTC TCATTGAGGT GATCGTCGAC
CCGACGGATC TGCCTCCAGG GTTGCCGCAG TGGAGCCAGG AGGCATCCGC CTTTATCTAC
CACGCGCAGT TCCCTGCCCC GGCCAGTCTC CCCTGGGGCG GGCTCGGCTA G
 
Protein sequence
MGGSGGSVTV GELLLGRLHD LGVRHVFGVP GDYAMDFIDQ IMTFDGIDWI GSSSEFNAGC 
SADGYARVAG IGAIVTQFGV GELSTMNALA GAMAESVPIV SVVGGPMLEV MRQRTSIHHS
LADGDSERWI RMAREVTVAQ ASLTPECALQ EIDRVLAECW SQQRPVYIRI PGDVAIAPVS
RPSRRFTRPN PVVLPAQLDA FAAAAQRLLA GAERPALLVG NLPIRLGLGA AVAALANERN
WPIATQMLGR GLVDETDPHY IGIYNGAESS APVREVVEGA DVLVCLGTTF FDWNGLFTAE
LDPARIINLR RDGAVVGGTC FAPVSMAAAL DRLHEMAASR SVGWPSAALL HDPPEIDRAS
TDPIRQERLW SAVQDVLRPG DILVSEVGTA FFGAATMRLP AGTTVLAAPI WSLAGYTTPA
AFGAGIAAPD RRVVSITGDG GIQISPQEIS RMFVFDQHPI IFVVNNGGYS SERALEKAVG
EETQAYTQIP DWRYSEIPAV FAPEGTFVAH VARTEAELAG ILAGVDGRTD RLTLIEVIVD
PTDLPPGLPQ WSQEASAFIY HAQFPAPASL PWGGLG