Gene Rcas_1617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1617 
Symbol 
ID5539093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2086594 
End bp2088651 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content62% 
IMG OID640893754 
Productshikimate/quinate 5-dehydrogenase 
Protein accessionYP_001431727 
Protein GI156741598 
COG category[R] General function prediction only 
COG ID[COG5322] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA TTGTAAGCAT CAGTCTTGGT TCCGCACGAC GCGATTATCA GTTCGTGACA 
ACCGTCCTGG GGCAACAGAT CGAGGTGCGG CGCATTGGCG CCAATGGCGA TACGGCGCTC
GCAGCCTCGC TGGTGCGCGA CTTCGACGGC AAAGTGGATG CTATCGGATT GGGAGGTCTG
ACTCCCGTGT TCCACGTCGG CGCAGCGCGC TACCCGCACC ACGAAGCACT CACGATTGCG
CGCGAGGCAC GCCGTACACC GGTCGTCGCC GGCAACCTGA TCAAGCAGAC CCTCGACCGG
TGGTGCGTGC AGCAGGCGAA CCAGGCGCAG CGCGGTATGT TCAACTATCG CCGCATCCTG
GTAACGAGCG GCATCGACCG TTATGCGCTG GCGCAGGCAC TGGCGCAGTA CGACGCAGAG
TTGCGCTTTG CCGATCCGGT CGTTCACAGC GGACTGCCGT TTCTACCGCT GCCGCGGTCC
ATCCCGCAGC TCGACATGTA TGCCGCAACA ACGCTGCCAA TCGCAGCGCT CCTGCCCTAC
CGCGTGCTGC ATCCGGTGGC GCTGGGCAGC GAAGGGCACG ATGCGCGCGC CGAAAAACAC
TTCGCCTGGG CGGATGTCAT TGCCGGCGAT TTTGCCTACA TCCGGCGTTT CGCCCCGCGC
GATCTGAACA ATAAAGTCAT TGTCACCGAT GATCCGTCGC CGGAAGAGAT CGAGGACCTG
CGCGCGCGTG GCGTCCGCAC GCTCGTTACG CTCTCGCCGC GCCTGGAGAG CGCCGATAAG
ACGCACCGCC CCTTCGTCGC CGCTGATGTG CTCGAAGCGA TTGTCGTGGC GATTCTTGAG
TCAGGTCCGA CGCCAACTGA GGGCGATATC ATCAATTTCA TCGATGAGGC GGATTGGGAA
CCGGAGGTGA TGACGTTGAG CGACACGGCG GAGAAGCCAC GGTTTGCCTT CGTCATCCAT
CCGCTGTCGC CGAAGTATAT CGCCAATCAC CCGCGTTTCC GTTTTACGCG CTACCTGCCA
GAACGCCTGG TCGAACGGGT GGCGGCGCAT TTCCCGCCCA TGTACATCTC GAAAATCCGC
GGCATTCGCT CACAGGCGAC CGGTGAGGAA ATCGAAGGAT TGTTGTTCAC CCTGGGGGCG
ACGCCACGCG AGTTGATGCG CCGCGACACG GCGTTCACCT ATCGCCGCCT GATCAAGTGC
GCTCGCATGG CGGAGCGGAT GGGCGCGAAA ATCATGGGGC TTGGCGCGTT CACGTCGGTG
GTCGGTGATG CAGGCATTAC CGTTGCACAG AAGAGCGACA TCGGCATTAC ATCGGGCAAC
TCGCTGACGG TTGCGGCGAC GCTCGAAGCC GCCAAGCAGG CGGTCATCAA GATGGGCGCG
ACCGACCTGA CGCGCGGCAA GGCAATGGTC GTCGGCGCAA CCGGTTCAAT CGGCTCGGTC
TGCGCCCGGT TGCTGGCACA GGCGATCGGT GATGTCGTGC TGGTTGCACC GCGTGTCGAA
CGTCTGCTGG CGCTCAAGAA GCAGATCGAG GCGGAAACGC CAGGCGCGCG GGTGACGGCG
GCGACCAGCG CCGATGCGTA CCTGGCGGAG TGCGATCTGA TCGTCACCAC CACGTCAGCG
TTGAGCGGTC GAGTGATCAA TGTCGATAAG CTCAAGCCCG GCGCCGTTGT GTGCGATGTG
GCCCGCCCGC CCGATGTCAA AAAAGAAGAT GCCGCCCGCC GCCCCGATGT GCTGGTGATC
GAGTCAGGTG AAATCCTGCT GCCGGGTGAA CCCGATTTTG GCTTCGACAT CGGACTGCCG
CCGGGCACGG CATATGCCTG CCTCTCCGAG ACGGCGCTGT TGACGATGGA ACATATGTAC
GGCGATTACA CCCTTGGGCG GAATATCGAC ATTGAGAAGG TCAAGGAGAT GTATCGCCTG
ATGAAGAAAC ATGGGCTGCA ACTCGCCGGT CTGCGTTCGT TCGATGAGTA TATCACCGAC
GAGATGATCG CAGAAAAACG ACGCCTGGCG GACGAGCGAC GTCGTCAACT CGGGATGCCA
GTTGCAGCAA CACGGTAA
 
Protein sequence
MKRIVSISLG SARRDYQFVT TVLGQQIEVR RIGANGDTAL AASLVRDFDG KVDAIGLGGL 
TPVFHVGAAR YPHHEALTIA REARRTPVVA GNLIKQTLDR WCVQQANQAQ RGMFNYRRIL
VTSGIDRYAL AQALAQYDAE LRFADPVVHS GLPFLPLPRS IPQLDMYAAT TLPIAALLPY
RVLHPVALGS EGHDARAEKH FAWADVIAGD FAYIRRFAPR DLNNKVIVTD DPSPEEIEDL
RARGVRTLVT LSPRLESADK THRPFVAADV LEAIVVAILE SGPTPTEGDI INFIDEADWE
PEVMTLSDTA EKPRFAFVIH PLSPKYIANH PRFRFTRYLP ERLVERVAAH FPPMYISKIR
GIRSQATGEE IEGLLFTLGA TPRELMRRDT AFTYRRLIKC ARMAERMGAK IMGLGAFTSV
VGDAGITVAQ KSDIGITSGN SLTVAATLEA AKQAVIKMGA TDLTRGKAMV VGATGSIGSV
CARLLAQAIG DVVLVAPRVE RLLALKKQIE AETPGARVTA ATSADAYLAE CDLIVTTTSA
LSGRVINVDK LKPGAVVCDV ARPPDVKKED AARRPDVLVI ESGEILLPGE PDFGFDIGLP
PGTAYACLSE TALLTMEHMY GDYTLGRNID IEKVKEMYRL MKKHGLQLAG LRSFDEYITD
EMIAEKRRLA DERRRQLGMP VAATR