Gene Rcas_2979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2979 
Symbol 
ID5540471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3863087 
End bp3864766 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content64% 
IMG OID640895097 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001433054 
Protein GI156742925 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.633648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000413857 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAGCG ACCTCAAACG CCACAGTCGC ACGATTACCG ATGGGCGCAC CCGCGCGGGG 
GCGCGCGCGA TGCTCAAGGC AATCGGCTTT ACCGACGAGG ACCTGGCAAA GCCGATCATT
GGCATTGCCA ACACCTGGAT CGAGACGATG CCGTGCAACA TCAACCTGCG CGCGCTGGCG
GCGCGGGTCA AGGAGGGTGT GCGCGCAGCA GGCGGCACGC CGATGGAGTT CAACACCGTC
GCCATTTCCG ATGGCGTCAC GATGGGCACG GAAGGAATGA AGGCATCATT GATCAGCCGC
GACCTGATCG CCGATTCCAT CGAACTGATG GGGCGCGGCT ATATGTTCGA CGCGATTATT
GCGCTGGTGG CGTGCGATAA AACGATCCCC GGCGCGGCGA TGGGGTTGAC GCGCCTGAAC
GTCCCCGGCT TCCTGCTCTA CGGCGGATCG ATTGCTCCTG GTCACTGGCG CGGCAAAGAG
ATCACGATTC AGCACGTGTA CGAGGCGATT GGTGCGGTTG CTGCCGGTAA AATGACCGAT
GAGGAATTGA AAGAGATCGA GGATGCGGCA TGTCCCGGTC CTGGCGCGTG CGGCGGTCAG
TACACCGCCA ACACAATGGC GACGGTCATG GAGATTATCG GGTTGTCGCC CATTGGCACG
GCAGCAGTGC CGGCCGCCGA CCCACGCAAG GACTCGGTCG GTTATCGTGC CGGTCAGTTG
ATCATGGATG TGTTGCGGCG CGACCTGAAG CCGCGCGATA TTCTGACGCG CGCTGCGTTC
GAGAATGCGA TTGCCAGCGT GGCATTGACC GGCGGTTCGA CCAATGCGGT GCTCCACCTG
CTGGCGTTGG CGCGGGAGGC CGGCGTGCCT CTGACGCTCG ACGACTTCGA CACAATCAGC
CGCCGCACCC CGCTCTGCTG CGACCTCATG CCGAGCGGGA AGTACTCTGC CATTCACGTC
GATCAGGCAG GCGGCATCCA GGTGATCGCC AAACGGCTCG TCGATGGCGG CTTTGCCCAC
GGCGACGCAA TCACCGTCAC CGGGCGCACA CTGGCGGAAG AGGCAGCGGA CGCCGTCGAA
ACACCCGGTC AGGATGTGAT CCGTCCGCTC GACAATCCGA TCAAACCGAC CGGCGGGTTG
CTGGTGCTGC GCGGCAACCT GGCGCCCGAA GGGTCGGTCG TCAAACTGTT CGGCTACGAA
CGCACCTACC ACCGCGGTCC GGCGAGGGTC TTCGATAGCG AAGAGGCGGC AATGGCTGCG
ATTGTCGGCG GCGAAATCCG GCCGGATGAC ATTGTTGTTA TCCGCTACGA AGGACCGCGC
GGCGGTCCTG GCATGCGTGA GATGCTTGGC GTTACCTCGG CAATCGTCGG CGCCGGTCTT
GGTCAGTCGG TGTCGCTCGT TACCGATGGG CGCTTCAGTG GTGCGACGCG CGGCGTGATG
ATCGGGCATG TGGCGCCGGA AGCGGCGCGT GGCGGCCCGC TTGCGATTGT TCAGGAAGGG
GATGAGATCG AAATCAATCT GGATGAGCGG CGCGTCGATC TGGTGCTTTC GGAAGAAGAG
ATCGCAGATC GATTGCTCGC CTGGCAGCCA CCAGCGCCGC GCTTCGAGTG GGGCGTAATG
GCGCGCTACA GCGCGCTGGT GTCGTCGGCA TCCGAGGGTG CAGTGCTGGT GACGCCGTAA
 
Protein sequence
MSSDLKRHSR TITDGRTRAG ARAMLKAIGF TDEDLAKPII GIANTWIETM PCNINLRALA 
ARVKEGVRAA GGTPMEFNTV AISDGVTMGT EGMKASLISR DLIADSIELM GRGYMFDAII
ALVACDKTIP GAAMGLTRLN VPGFLLYGGS IAPGHWRGKE ITIQHVYEAI GAVAAGKMTD
EELKEIEDAA CPGPGACGGQ YTANTMATVM EIIGLSPIGT AAVPAADPRK DSVGYRAGQL
IMDVLRRDLK PRDILTRAAF ENAIASVALT GGSTNAVLHL LALAREAGVP LTLDDFDTIS
RRTPLCCDLM PSGKYSAIHV DQAGGIQVIA KRLVDGGFAH GDAITVTGRT LAEEAADAVE
TPGQDVIRPL DNPIKPTGGL LVLRGNLAPE GSVVKLFGYE RTYHRGPARV FDSEEAAMAA
IVGGEIRPDD IVVIRYEGPR GGPGMREMLG VTSAIVGAGL GQSVSLVTDG RFSGATRGVM
IGHVAPEAAR GGPLAIVQEG DEIEINLDER RVDLVLSEEE IADRLLAWQP PAPRFEWGVM
ARYSALVSSA SEGAVLVTP