Gene Sare_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2067 
Symbol 
ID5703278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2366876 
End bp2368582 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content70% 
IMG OID641271553 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001536924 
Protein GI159037671 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.246741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGCC AGGTACGCGT TGTGGATCGC ATCGCTGCGA CACTGGCCCG GTTGGGTGTC 
CGCCACGTCT TCGGCGTCAG CGGCGCCAAC ATCGAGGACC TGTACGACGC GCTGCGCGGC
ACCGACGGTG CGACCTGCGG TGTAGTGGCC AAACACGAGT TCTCTGCGGC CACCATGGCA
GACGGCTCAG CCCGCGTCAC CGGTCGCTTC GGCGTGGTGT CGACGACCTC CGGCGGCGCC
GCGATGAATC TGGTGCCTGG GCTGGCGGAG GCGTATGCCT CGCGGGTGCC GATGTTGGCC
CTGGTCGGCC AGCCACCCAC GGCCCAGGAG GGACGTGGGG CGTTCCAGGA AACCAGCGGT
CTGGCGGGCT CGTTCGACGC GATGGCGGTA CTCGACCCTG TCTCCCGGTT CTGTGCCCGG
GTGGAGGATC CGGCTCGTAT CGACGCTGCC CTCACCGCAG CGATCTCCGC GGCACACCAG
GACCCGAAGG GGCCGGCGGT GCTGCTGCTA CCCAAGGATG TGCAGCAGGC ACTGGTCGAC
GACTCCCCGA GTCGTGTCCT CACCGTCGCG GCGCCGGCCA GCCCGGCACC GACGCCGGCC
CTGGACCAGG CGGCAGCTCT GCTCCGCGAG GCACGGCAGG CTCTGGTCAT CGCCGGTGCG
GGTGTGGCCT CGTCGGGTGG CCGACGGGAA CTGGATCGTC TGGTGGGGCG CCTCGGCGCA
TGGGTGGCGG CCACCCCAGA CGCCAGGGAT GCCTTCGACA ACCGTCATCC GGCATTCGCT
GGTGTCGCCG GGGTGATGGG ACACGACGCC GTCGGCGAGC TACTGCAACG AGCCGACCTG
TGCCTGCTGG CGGGCACCCG GCTGCCCGCC CTGGCGCGCA ACGGACTGGA AGAGGCGCTG
GCGGGGATGC CGGTGATCTG CGTCGACCCC GAGCCACCGC ACATCCCGGG CCTCGCACTG
ATGGGTTGCC CGCAAGCCAC ACTGCGCGCC TTGTCGCTGC GTTTGGGCGC TCACCGACGG
TCATGTCCGC CTCACCCTGG GCCTGTCCTG CTCTCGGGCG GTACGTCGTC CGGGGAGACG
CTGCGGGCAT ACGCCGATGC CCTCCACGTG ATCTCCACCG TCCTGGCGCC CGACGTTCAC
GTCTTCGTCG ATGCCGGCAA TGCGGCTGCG GCCGCGATCC ATGCGCTGTC GCCTGCCCCT
CGAGGACGTT TCGTCGTGGC GCTGGGAATG GGTGGTATGG GCTACACCTT CGGAGCGGGG
ATCGGCGCGG CGCTGGCCAC TGGACGGCGT ACGTACGTCC TGGCAGGAGA TGGTGCGTTC
TATGCACACG GCACCGAGGT GCACACCGCA CTGGAAGCCG CCGCCCCCGT CACCTTCGTG
ATCTTCAACA ACAACGCGCA CGCCATGTGC GTCACCCGCG AGGACCTGTT CCAAGGCGGC
GCCAGCGGCG TCAACGCCTT CCGGCCGTCG GACATCGCCG CCGGTGTATC CGCGATGTTT
CCGGGTCTTC GGGCAACCCG CGCCAGCACC GCGCCCCAGT TGCGTGCGGC CCTGCTGGCG
GGTCAGGCCG GTGGCGGTCC GGCTCTCGTG GCCATGGACT TCGATCCCGC TGAACTACCG
CCGTTTCGTC CGTTCCTGGC CGCGGGCCAG GCACCCGTCA ACAACCAGGA GGGTGATCAT
GACGACCGCG CCGTCCACGT TGGCTGA
 
Protein sequence
MTGQVRVVDR IAATLARLGV RHVFGVSGAN IEDLYDALRG TDGATCGVVA KHEFSAATMA 
DGSARVTGRF GVVSTTSGGA AMNLVPGLAE AYASRVPMLA LVGQPPTAQE GRGAFQETSG
LAGSFDAMAV LDPVSRFCAR VEDPARIDAA LTAAISAAHQ DPKGPAVLLL PKDVQQALVD
DSPSRVLTVA APASPAPTPA LDQAAALLRE ARQALVIAGA GVASSGGRRE LDRLVGRLGA
WVAATPDARD AFDNRHPAFA GVAGVMGHDA VGELLQRADL CLLAGTRLPA LARNGLEEAL
AGMPVICVDP EPPHIPGLAL MGCPQATLRA LSLRLGAHRR SCPPHPGPVL LSGGTSSGET
LRAYADALHV ISTVLAPDVH VFVDAGNAAA AAIHALSPAP RGRFVVALGM GGMGYTFGAG
IGAALATGRR TYVLAGDGAF YAHGTEVHTA LEAAAPVTFV IFNNNAHAMC VTREDLFQGG
ASGVNAFRPS DIAAGVSAMF PGLRATRAST APQLRAALLA GQAGGGPALV AMDFDPAELP
PFRPFLAAGQ APVNNQEGDH DDRAVHVG