Gene Sare_2955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2955 
Symbol 
ID5707809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3351680 
End bp3353377 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content72% 
IMG OID641272404 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001537772 
Protein GI159038519 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.836437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0528961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGCA TCCCGGGTCA CCCGCCCGGC CAACCGCGCA CGGCGGCCAC CACCCTCGTG 
GCCGCCCTGC TCGGACACGA CGTCGACCGG GTGTTCTGCG TGGCCGGCGA GAGCTACCTG
GCGGTCCTCG ACGCCCTGTA CGACACCCCG ACCGTCGAGG TGGTGACCTG CCGGCACGAG
GCGTCGGCGG CGTTCGCCGC CGTCGCCGAC GCCAAGCTGA CTGGTCGGGC GGGTGTCTGC
CTGGTCAGTC GCGGCCCCGG GGCCACCAAC GCAGGCATCG CCGTGCACTC GGCCGCCCAG
GACGCCACCC CGCTGGTCCT GCTCGTCGGC CACGTGCCGC GGTCCGAGAT CGGCACCGAC
GCGTTTCAGG AGATCGACCC GCGCGCCTTC TCCGGTCTGG CCAAACAGGT ACTGGTGCTG
CTGGATCCGG CACGCACCGG CGAGTTCGTC GCCCGGGCCT TCCGGGTCGC CGAGGCCGGT
ACCCGCGGGC CGGTGGTGCT GGTTCTTCCC GAGGATGTCC TGGCCATGTC AGATCCGGTC
ACGCCAGTGC CCGCCCGCTG GGCGGCAGCC GCGCCGGTGG CCGCCGCCGA GGATCTACAG
GCGGTGCGGG CGCTGCTGGC ACGGTCACGA CGACCGTTGC TCGTGGCGGG CGGCGACCTC
TCCGGCGACC GGGGCCGGTG TCTGCTGCGC GAGGTGGCGC ACCGACACCG GTTTCCGGTG
GTGACCAGCA ACAAACGGCA GGACCTCCTC GACAACCGGG ACTCCTGCTA CGCCGGCCAT
CTGCACAACA ACACCCAGGA GAGGCAGATC GCGGCACTGG ATCGGGCGGA TCTCGTCCTG
GCGGTCGGAA CCCGGCTGGA CGACGTGACC ACGTGTGGCC GGCGGCTGCC GCGCCCCGGT
CGGCCCGATC AGCCGTTGGT GCATGTGCAC GCCGATCCGC AGCGGCTCGG GCGGACCCAC
CCGCCGGCCG TCGGGTTGGC CTGCGATCCG GTCGCCTTCC TCGGCCAACT GGCACTGGAG
CCCGCGTACC CGGACGCCGG CCGGGAGACC TGGATCGACG AGTTGCACGC GATCGAGGTC
GAGAAGGCCG TCTGGTCCGA GCATCCGAGC GACGACGGCG TCGCGTTCGG TGCCGTGGTC
GCCGGCCTCG ACGAGCTCAC CGACGGCGAC GTGGTCGTTG CCGTCGACTC CGGCACCTTC
ACCAGTTGGC TGTACCGCTA CCTGCGGCTG AGCGGCGAGG GGCGGATGCT CGGAGTCGGA
TCCAGCGCGA TGGGTTTCGG CGTCCCGGCC GGCGTGGCCG CTGCACTGCG GACACGCCGC
CCGGTCGTGG TGGTCGTCGG CGACGGCGGG TTCCTGATGA CGGGCAGCGA ACTGGCCACA
GCGGTGAGCC ACCGGCTGCC CCTGGTCGTT CTCGTCGCCA ACAACGGCAG CTACGGCACG
ATCCGCCTGC ACCAGGAACG GGAGTTTCCC GGGCGGGTCA TCGCCACCGA TCTGAGTAAC
CCCGACTTCG TCCAGCTCGC CCGCGCGTTC GGCGCGCTGG GCCTGATCGT GCAGGCCGAG
GAGGACGTCG AGCCCTGCCT GGCCCGGGCA CTCGCCCACG GGGGGCCGGT CGTGGTCGAC
GTACGGACCA GCCTGAGCTG GATCACCGCC TACCGACGGA TGCGAACGCG GGTGGCCGCG
GATGTGGGGT CGGCATGA
 
Protein sequence
MNGIPGHPPG QPRTAATTLV AALLGHDVDR VFCVAGESYL AVLDALYDTP TVEVVTCRHE 
ASAAFAAVAD AKLTGRAGVC LVSRGPGATN AGIAVHSAAQ DATPLVLLVG HVPRSEIGTD
AFQEIDPRAF SGLAKQVLVL LDPARTGEFV ARAFRVAEAG TRGPVVLVLP EDVLAMSDPV
TPVPARWAAA APVAAAEDLQ AVRALLARSR RPLLVAGGDL SGDRGRCLLR EVAHRHRFPV
VTSNKRQDLL DNRDSCYAGH LHNNTQERQI AALDRADLVL AVGTRLDDVT TCGRRLPRPG
RPDQPLVHVH ADPQRLGRTH PPAVGLACDP VAFLGQLALE PAYPDAGRET WIDELHAIEV
EKAVWSEHPS DDGVAFGAVV AGLDELTDGD VVVAVDSGTF TSWLYRYLRL SGEGRMLGVG
SSAMGFGVPA GVAAALRTRR PVVVVVGDGG FLMTGSELAT AVSHRLPLVV LVANNGSYGT
IRLHQEREFP GRVIATDLSN PDFVQLARAF GALGLIVQAE EDVEPCLARA LAHGGPVVVD
VRTSLSWITA YRRMRTRVAA DVGSA