Gene Sare_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1100 
Symbol 
ID5707021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1236804 
End bp1238426 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content73% 
IMG OID641270615 
Producthypothetical protein 
Protein accessionYP_001535999 
Protein GI159036746 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.575745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0877269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGC GGATCGAGGG GCACGGCGGG GCGCTCGCGC TCGCGGCGCT GCGTGGGCAC 
GGCGTTCGGG AGATGTTCAC CCTCTCCGGG GGGCATGTCT TCCCGCTCTA CGACGCCGCG
CACCAGGCCG GGTTCCCGCT CTACGACGTC CGGCACGAGC AGTCGGCGGT TTTCGCGGCC
GAGGCTGTCG CGAAGCTTCA GCGCCGTCCC AGCCTCGCCG TGCTGACCGC CGGCCCGGGG
GTCACCAACG GCATCTCCGG ATTGACCAGC TCCTACTTCA ACGCGTCACC GGTCCTGGTG
CTCGGTGGGC GAGCGCCGCA GTTCCGGTGG GGGGCGGGCA GCCTCCAGGA GTTGGACCAC
CTGCCGTTGG TCACCCCGGT GACGAAGCAC GCCGAGACGA TCGCGCATGC GGCTGACGTC
CCCCGGGCGG TGGGCGTCGC GTTGACCACG GCGCTCACCC CGCACCGCGG CCCGGTCTTC
TTGGACCTTC CACTGGAGGT GGCCTTCTCG GTGGCCGACG CCGACCTGCC CGCTGTCGCC
CCGATTGCCC CGATCGAGGC GGACCCGGCG GAGGTCGCGG AGGCCGCTGC GCTGATCGCC
GAGGGGCGGC GTCCCGTGCT CGTCGCCGGC TCCGACGTCT ACGCCGGCGA CGCTGTCGAA
CCCCTGCGCG CGGCGGCCGA GGCGCTCGCG GTGCCCGTCT TCACCAACGG TCAGGGCCGA
GGCGCGCTGC CGCCGGAGCA CCCGCTCGCC TTCACCAGGT CCCGCCGGGT GGCCCTCCAG
AAGGCCGACG TGGTGGTGGT CGTCGGCACG CCGCTCGACT TCCGGCTCAA CTTCGGGGAC
TTCGGTGCCG CGACCGTGGT GCACGTGGTG GACGCGCCGA GTCAGCGCGC CGGGCATGTC
CAACCTGCGG TCGCGCCCGC GGGTGACCTG CGGCTGATCC TGTCCGCGTT CGCGGAATAC
TCCGGGGACC GGGTCGACCA CACGGAGTGG GTCGCCGAGC TGCGGGCTGT CGAGGACGCC
GCCCGGGCCC GCGACGCCGT GGCGATGGCC GCGGAGACCG ACCCGATTCG GCCCGCCCGC
GTCTACGGCG AGCTGCGTCG GGTGCTGACC CGGGACGCGA TCACCATTGG TGACGGCGGC
GACTTCGTTT CGTACGCGGG GCGCTACCTG GAGCCCGCCC AGCCCGGCAC CTGGCTGGAC
CCCGGTCCGT ACGGCTGCCT CGGCACCGGT ATGGGCTATG CCATGGGTGC TCGGGTGACC
CACCCCGACC GGCAGGTCTG CGTCCTGATG GGGGACGGTG CGGCCGGCTT CTCGCTGCTG
GACGTGGAGT CCCTGGTCCG GCAGCGGCTG CCGGTGGTGA TTGTCGTCGG CAACAACGGA
ATCTGGGGAC TGGAGAAGCA TCCCATGCGG GCGATGTACG GCTACGATGT GGCCGCGGAC
CTCCAGCCGG AGCTGCGCTA CGACCAGGTG GTCCAGGCAC TGGGCGGTGC GGGCGAGACG
GTGGCGAAGG CTGCCGACCT TGGGCCGGCG CTGACGCGCG CGTTCGAGGC CGGGGTGCCG
TACCTGGTCA ACGTGCTGAC CGACCCGGCC GACGCGTACC CCCGCTCGTC GAACCTCGCC
TGA
 
Protein sequence
MSERIEGHGG ALALAALRGH GVREMFTLSG GHVFPLYDAA HQAGFPLYDV RHEQSAVFAA 
EAVAKLQRRP SLAVLTAGPG VTNGISGLTS SYFNASPVLV LGGRAPQFRW GAGSLQELDH
LPLVTPVTKH AETIAHAADV PRAVGVALTT ALTPHRGPVF LDLPLEVAFS VADADLPAVA
PIAPIEADPA EVAEAAALIA EGRRPVLVAG SDVYAGDAVE PLRAAAEALA VPVFTNGQGR
GALPPEHPLA FTRSRRVALQ KADVVVVVGT PLDFRLNFGD FGAATVVHVV DAPSQRAGHV
QPAVAPAGDL RLILSAFAEY SGDRVDHTEW VAELRAVEDA ARARDAVAMA AETDPIRPAR
VYGELRRVLT RDAITIGDGG DFVSYAGRYL EPAQPGTWLD PGPYGCLGTG MGYAMGARVT
HPDRQVCVLM GDGAAGFSLL DVESLVRQRL PVVIVVGNNG IWGLEKHPMR AMYGYDVAAD
LQPELRYDQV VQALGGAGET VAKAADLGPA LTRAFEAGVP YLVNVLTDPA DAYPRSSNLA