Gene Sare_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4173 
Symbol 
ID5703961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4739739 
End bp4740953 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID641273600 
Productisocitrate dehydrogenase 
Protein accessionYP_001538953 
Protein GI159039700 
COG category[C] Energy production and conversion 
COG ID[COG0538] Isocitrate dehydrogenases 
TIGRFAM ID[TIGR00127] isocitrate dehydrogenase, NADP-dependent, eukaryotic type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.374158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00222704 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAAGA TCAAGGTAAG TAACCCGGTC GTGGAGCTCG ACGGCGACGA GATGACCCGG 
ATCATCTGGA AGCAGATCCG GGAGCAGCTG ATCCTGCCCT ACCTCGACGT CGACCTGCGG
TACTACGACC TCTCGATCCA GCACCGTGAC GAGACCGACG ACCAGGCCAC CGTCGACGCC
GCCAACGCGA TCAAGGAGCA CGGTGTCGGC GTCAAGTGCG CCACGATCAC CCCGGACGAG
GCACGGGTCG AGGAGTTCGG CCTGAAGAAG ATGTGGCGGT CGCCGAACGG CACCATCCGT
AACATCCTCG GTGGCGTCGT CTTCCGCGAG CCCATCATCA TGTCCAACGT CCCGCGGCTG
GTGCCGGGCT GGACGAAGCC GATCATCATC GGCCGGCACG CCCACGGCGA CCAGTACAAG
GCCAGCGACT TCGTCGTCCC CGGGCCGGGC AAGGTCACCA TCACCTACAC CCCGGCCGAC
GGGGCGCAGC CGATCGAGAT GGAGATCGCC GACTTCCCCG GCGGCGGAGT CGCCATGGGC
ATGTACAACT TTGACGAGTC GATCCGGGAC TTCGCCCGTG CCTCCATGCG GTACGGCCTC
GACCGCGGCT ACCCGGTCTA CCTGTCGACC AAGAACACCA TCCTCAAGGC GTACGACGGT
CGGTTCAAGG ACATCTTCGC CGAGGTCTAC GAGAACGAGT TCAAGGCCGA CTTCGAGGCT
GCCGGCATCA GCTACGAGCA CCGACTGATC GACGACATGG TCGCTGCGGC GCTCAAGTGG
GAGGGCGGCT TCGTCTGGGC CTGCAAGAAC TACGACGGTG ACGTGCAGTC CGACACCGTC
GCGCAGGGCT TCGGCTCGCT CGGTCTGATG ACGTCCGTGC TGATGACCCC CGACGGCCGC
ACCGTCGAGG CCGAGGCCGC GCACGGCACC GTCACCCGGC ACTACCGGCA GTACCAGAAG
GGCGAGAAGA CCTCGACCAA CCCGATCGCC TCGATCTACG CCTGGACCCG GGGCCTGGCC
CACCGAGGCA AGCTGGACGG CACCCCGGCG GTTTCCGAGT TCGCCAACAC CCTGGAGAAG
GTCATCGTCG AGACCGTCGA GGGTGGCCAG ATGACCAAGG ACCTCGCGCT GCTCATCTCG
CGGGATGCCC CGTGGCTGAC CACCGACGAG TTCATGAACG CGCTGGACGA GAACCTGGCC
CGCAAGCTCG CCTGA
 
Protein sequence
MAKIKVSNPV VELDGDEMTR IIWKQIREQL ILPYLDVDLR YYDLSIQHRD ETDDQATVDA 
ANAIKEHGVG VKCATITPDE ARVEEFGLKK MWRSPNGTIR NILGGVVFRE PIIMSNVPRL
VPGWTKPIII GRHAHGDQYK ASDFVVPGPG KVTITYTPAD GAQPIEMEIA DFPGGGVAMG
MYNFDESIRD FARASMRYGL DRGYPVYLST KNTILKAYDG RFKDIFAEVY ENEFKADFEA
AGISYEHRLI DDMVAAALKW EGGFVWACKN YDGDVQSDTV AQGFGSLGLM TSVLMTPDGR
TVEAEAAHGT VTRHYRQYQK GEKTSTNPIA SIYAWTRGLA HRGKLDGTPA VSEFANTLEK
VIVETVEGGQ MTKDLALLIS RDAPWLTTDE FMNALDENLA RKLA