Gene RoseRS_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3155 
Symbol 
ID5210125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3971409 
End bp3973088 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content63% 
IMG OID640596746 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001277466 
Protein GI148657261 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGCG ACCTCAAACG CCGCAGTCGC ACCATTACCG ATGGGCGCAC CCGCGCCGGG 
GCGCGCGCCA TGCTCAAGGC AATCGGCTTC ACCGACGAAG ACCTGGCAAA ACCGATCATC
GGCATCGCCA ATACCTGGAT CGAGACGATG CCATGCAACA TCAACCTGCG CGCGCTGGCG
GCGCGGGTCA AGGAAGGCGT GCGCGCCGCC GGCGGCACGC CGATGGAGTT CAACACCGTC
GCTATCGCCG ATGGCGTCAC CATGGGTACG GAAGGGATGA AGGCGTCGCT GATCAGCCGC
GACCTGATTG CCGACTCGAT CGAACTGATG GGGCGCGGGT ATATGTTCGA CGCGATCATC
GCGCTGGTGG CATGCGACAA AACGATCCCC GGAGCAGCGA TGGGATTGAC GCGCCTGAAC
ATCCCCGGCT TCCTGCTCTA CGGCGGATCG ATTGCTCCCG GTCACTGGCG CGGCAAAGAG
ATCACCATCC AGCATGTGTA CGAAGCGATC GGTGCAGTTG CCGCCGGTAA GATGACCGAC
GAAGAACTGA AAGAGATCGA AGATGCCGCG TGCCCCGGTC CGGGAGCATG CGGAGGTCAG
TACACTGCCA ACACGATGGC AACCGTGATG GAAATTATCG GTCTCTCGCC GATGGGGACG
GCTGCGGTGC CGGCTGCCGA CCCGCGCAAA GATTCGGTTG GATACCGCGC CGGGCAGTTG
ATTATGGATG TGCTGCGACG CGATCTGAAG CCGCGCGATA TTCTGACACG GGCGGCATTC
GAGAATGCGA TCGCCAGCGT GGCGTTGACC GGCGGCTCGA CCAACGCGGT ACTCCATCTG
CTGGCGCTGG CGCGTGAGGC GGGCGTGCCG CTAACGCTCG ACGATTTCGA CGCGATCAGT
CGTCGTACAC CGCTCTGCTG CGACCTGATG CCGAGCGGGA AGTACTCTGC CATTCACGTC
GATCAGGCGG GCGGCATTCA GGTGATTGCG AGACGCCTGG TCGACGGCGG TTTCGCGCAC
GGCGATGCGA TCACCGTCAC TGGTCGCACC CTGGCGGAAG AGGCGGCGGA TGCTGTCGAA
ACGCCAGGTC AGGATGTGAT CCGTCCCCTC GATAATCCGA TCAAGCCGAC AGGCGGGTTG
CTGGTGCTGC GCGGCAATCT GGCGCCCGAA GGATCGGTGG TCAAACTGTT CGGTTATGAG
CGCACCTACC ACCGCGGTCC GGCGCGCGTC TTCGATGGCG AAGAGGCGGC AATGGCCGCT
ATCGTCGGCG GTGAAATTCG ACCGGACGAT ATTGTGATTA TTCGCTATGA AGGTCCGCGC
GGCGGTCCTG GCATGCGCGA GATGCTCGGC GTTACTTCCG CGATTGTCGG CGCCGGACTT
GGGCAGTCGG TCTCGCTCAT TACTGATGGA CGCTTCAGCG GCGCGACGCG CGGTGTGATG
ATCGGACACG TGGCGCCGGA GGCGGCGCGA GGCGGACCGC TTGCGATTGT GCAGGAGGGG
GATGAGATCG AGATCAATCT CGACGAGCGC CGTGTTGATC TGCTGCTGTC AGAAGAAGAG
ATTGCCGACC GGTTGCTCGC CTGGCAGCCG CCAGCGCCGC GTTATGAGTG GGGCGTGATG
GCGCGCTACG GTGCGCTGGT GTCGTCAGCG TCCGAAGGCG CCGTGCTGGT GACACCATGA
 
Protein sequence
MSSDLKRRSR TITDGRTRAG ARAMLKAIGF TDEDLAKPII GIANTWIETM PCNINLRALA 
ARVKEGVRAA GGTPMEFNTV AIADGVTMGT EGMKASLISR DLIADSIELM GRGYMFDAII
ALVACDKTIP GAAMGLTRLN IPGFLLYGGS IAPGHWRGKE ITIQHVYEAI GAVAAGKMTD
EELKEIEDAA CPGPGACGGQ YTANTMATVM EIIGLSPMGT AAVPAADPRK DSVGYRAGQL
IMDVLRRDLK PRDILTRAAF ENAIASVALT GGSTNAVLHL LALAREAGVP LTLDDFDAIS
RRTPLCCDLM PSGKYSAIHV DQAGGIQVIA RRLVDGGFAH GDAITVTGRT LAEEAADAVE
TPGQDVIRPL DNPIKPTGGL LVLRGNLAPE GSVVKLFGYE RTYHRGPARV FDGEEAAMAA
IVGGEIRPDD IVIIRYEGPR GGPGMREMLG VTSAIVGAGL GQSVSLITDG RFSGATRGVM
IGHVAPEAAR GGPLAIVQEG DEIEINLDER RVDLLLSEEE IADRLLAWQP PAPRYEWGVM
ARYGALVSSA SEGAVLVTP