Gene Sare_0511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0511 
Symbol 
ID5705529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp581066 
End bp582172 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content70% 
IMG OID641270037 
Productcitrate synthase 2 
Protein accessionYP_001535431 
Protein GI159036178 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.043222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0026302 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGACT TCAAACCGGG ACTGGAGGGC GTCGTAGCCT TCGAGACCGA GATCGCCGAA 
CCCGATCGGG AGGGTGGTTC GCTGCGCTAT CGCGGCGTCG ATATCGAAGA TCTTATCGGT
CAGGTCTCGT TCGGCAACGT CTGGGCCCTG TTGGCGGATG GGCGCTTCGG CCCGGGACTG
CCGCCGGCCG AGCCGTTCCC GGTCCCGGTG CACTCCGGCG ACATCCGGGT CGACGTGCAG
TCCGCTGTGG CGATGCTCGC CCCGTACTGG GGTCTCCACC AGCTGCTCGA CATCTCCGAC
GAGCAGGCCC GCGAGGACCT CGCCCGGGTC TCGGTGACCG CGCTCTCCTT CGTCGCCCAG
TCCGCGCGGG GTCTGGGCCT GCCGGCAGTG CCGCAGAAGG AGATCGACAA GGCGTCCACC
ATCGTCGAAC GCTTCATGAA GCGCTGGCGG GGCGAACCGG ACCCGCGGCA CGTCAAGGCC
GTCGACGCCT ACTTCATCTC CGCCGCCGAG CACGGCCTGA ACGCCTCCAC CTTCACCGCC
CGCATCGTGG CCTCCACCGG CGCGGACGCG GCGGCCTGCA TCTCCTCCGG CATCGGCGCA
CTCTCCGGGC CGCTACACGG CGGTGCGCCC TCCCGGGTAC TGAACATGCT CGAGGCGGTT
GAGCGCAGTG GTGACGCCGA GGGGTACGTA CGGGGCGTAC TCGACCGCGG TGAGCGGCTG
ATGGGTTTCG GTCATCGGGT CTACCGCGCC GAGGACCCGC GGGCCAGGGT GCTCCGCCGC
ACCGCCAAGG AACTGGGTGC CCCGCGCTTC GAAATCGCGG AGGCGCTGGA GAAGGCCGCC
CTGACCGAAC TGCACAGCCG CAAGCCGGAC CGGATTCTCG CCACCAACGT CGAGTTCTGG
TCGGCGGTCG TGCTGGACTT CGCCGAGGTA CCCGCCCATA TGTTCACCTC GATGTTCACC
TGCGCCCGAA TGGGCGGCTG GAGCGCGCAC ATTCTGGAAC AGAAGAAGCT GCAGCGACTC
GTCCGCCCGT CCGCCCGCTA CGTCGGGCCC GGCCCCCGCA GGCCGCACGA GGTCGAGGGC
TGGGACCAGG TCCCGCACGG CGTCTGA
 
Protein sequence
MADFKPGLEG VVAFETEIAE PDREGGSLRY RGVDIEDLIG QVSFGNVWAL LADGRFGPGL 
PPAEPFPVPV HSGDIRVDVQ SAVAMLAPYW GLHQLLDISD EQAREDLARV SVTALSFVAQ
SARGLGLPAV PQKEIDKAST IVERFMKRWR GEPDPRHVKA VDAYFISAAE HGLNASTFTA
RIVASTGADA AACISSGIGA LSGPLHGGAP SRVLNMLEAV ERSGDAEGYV RGVLDRGERL
MGFGHRVYRA EDPRARVLRR TAKELGAPRF EIAEALEKAA LTELHSRKPD RILATNVEFW
SAVVLDFAEV PAHMFTSMFT CARMGGWSAH ILEQKKLQRL VRPSARYVGP GPRRPHEVEG
WDQVPHGV