Gene Sare_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2284 
Symbol 
ID5706043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2623755 
End bp2625038 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content64% 
IMG OID641271762 
Productcitrate synthase I 
Protein accessionYP_001537133 
Protein GI159037880 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.193692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0343529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAG TCAAGCTCGA TCATCCCGGT GGGCAACTGT CGATGCCGGT GAACACGGCG 
ATCGAGGGTC CTGCCGGCAT CGGGGTGAGC AAGCTGCTCA AGGAAACCGG GATGACGACC
TACGATCCTG GCTTCGTGAA TACGGCTGCC TGCTCGTCCG CGATCACCTA CATCGACGGC
GACGCGGGGA TCCTGCGCTA CCGTGGGTAT CCGATCGAGC AACTGGCGGA GAAGTCCTCC
TTTCTGGAGG TCTCGTACCT GTTGATTTAT GGCGAACTGC CCACCCAGCA GCAGCTGGTC
GAGTTCACCG AGCGAATCCG ACGGCACTCG TTGCTGCACG AGGAGATGCG TCGGTTCTTC
GATGGCTTCC CCAGGGACGC GCACCCGATG GCGGTGCTCT CCTCGGCGGT CAGTGCCATC
TCCACCTTCT ACCAGGACAG CCTGGATCCG TTCGACGGCG CGCACGTGGA GATGTCGACG
GTCCGGCTGA TGGCCAAGGT GCCGACCATC GCCTCGTACG CCTACAAGAA GTCGATCGGT
CAGCCGCTGT TGTACCCGGA CAACTCCCTC GGGTACGTGG AGAACTTCCT CCGGATGACC
TTCGGTGTGC CGGCGGAGCA GTACGACGTC GACCCGGTCA TCGCCCGGGT GCTGGACATG
CTCTTCATCC TGCACGCCGA CCACGAGCAG AACTGCTCCA CCTCGACGGT ACGGCTGGTC
GGTTCCAGCA ACGCGAACCT CTTCGCCTCG GTCTCGGCCG GTGTGAACGC CTTGTTCGGT
CCGCTGCACG GCGGCGCCAA CCAGGCGGTG TTGGAGATGC TGGAAGCCAT CCAGGCCGAC
GGCGGCGATG TCCGTTCCTT CGTGCGGCGG GTCAAGGACC GGCAGGCCGG CGCCAAGCTG
ATGGGCTTCG GTCATCGGGT CTACAAGAAC TACGACCCGC GGGCGACGAT CGTGAAGAAG
GCCGCCCAGG ACGTGCTCGG CCGGATGGCC GCGTCGGACC CGATGCTGGA CCTCGCGATG
GAACTGGAGG AGATCGCGCT CGCTGACGAC TTCTTCGTCT CCCGCAGGCT CTACCCGAAC
GTGGACTTCT ACACCGGTCT GATCTACAAG GCCATGGGTT TCCCGACGAA GATGTTCACG
GTGCTGTTCG CGTTGGGGCG CCTCCCCGGC TGGATCGCCC AGTGGAGTGA GATGATCAAA
GACCCGGAGA CGAAGATCGG ACGCCCGCGG CAGATCTACA CCGGCTCCGC CCAGCGCGAC
TACGTCGACG TCGGACAGCG CTGA
 
Protein sequence
MTEVKLDHPG GQLSMPVNTA IEGPAGIGVS KLLKETGMTT YDPGFVNTAA CSSAITYIDG 
DAGILRYRGY PIEQLAEKSS FLEVSYLLIY GELPTQQQLV EFTERIRRHS LLHEEMRRFF
DGFPRDAHPM AVLSSAVSAI STFYQDSLDP FDGAHVEMST VRLMAKVPTI ASYAYKKSIG
QPLLYPDNSL GYVENFLRMT FGVPAEQYDV DPVIARVLDM LFILHADHEQ NCSTSTVRLV
GSSNANLFAS VSAGVNALFG PLHGGANQAV LEMLEAIQAD GGDVRSFVRR VKDRQAGAKL
MGFGHRVYKN YDPRATIVKK AAQDVLGRMA ASDPMLDLAM ELEEIALADD FFVSRRLYPN
VDFYTGLIYK AMGFPTKMFT VLFALGRLPG WIAQWSEMIK DPETKIGRPR QIYTGSAQRD
YVDVGQR