Gene RoseRS_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1053 
Symbol 
ID5207999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1291295 
End bp1292560 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID640594667 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_001275412 
Protein GI148655207 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCAAA CGCTTGCAGA ACAGATCCTT TCACACGCTG CCGGGCGTCC GGTCTCCGCC 
GGCGAGAATA TCGTCGCCAG GATTGATCTG GCGATGATGC ACGACAGCAT CTCACCGAGC
ATCATCAAAA TCCTGCACCA CGAATTGGGC GCCGAACGGG TGTGGGATCG CGACCGTGTG
GCAGTGGTGG TGGACCACGT TGCACCGGCG GCGACGATCC AGAATGCTGA GCATCAACTG
GCGCTGCGAC GGTGGGTGCG TCAGCAGCAG ATCACGCATT TCTTCGATGT CGGGCGCGGC
ATTTCGCATC CGGTACTGGT CGAGGAGGGG CTGGCGCGCC CCGGTATGCT GATCATCGGC
AGCGACTCGC ACTCAACGGC GTATGGCGCG GTCGGGGCGT TTGGCGCCGG CATGGGTTCC
ACCGATATGG CGCTGGCGCT GGCGACCGGG CAGACGTGGC TGCGTGTGCC GGAGACGGTG
CGCATCCTGG CGCGTGGGCG GTTTCAACCT GGGGTGGGCG CAAAGGATCT GGGGCTGCGC
GTAGCGCGTC TGATCGGCGC CGATGGCGCG ACCTATCAGT CGGTCGAGTG GCACGGCGTC
GAGGAATTGA GCATCGGCGA ACGGATGACG CTCGCCACGC TCTCTGTTGA AATCGGTGCG
AAGGCAGGGA TCATTCCGCC TGTCGGTCCC GGCTGGGAAG AGCACGCCAC CCGTCGCGGC
ATCACCGTCC CGTCATGGTT GCGCGTCGAA GAGGGAGCGC GCTACAGTCG CACCGTGGAG
GTCGATCTCG ACACCCTTGA GCCGCAGGTG AGCGTGCCGC ACTTCGTGGA CAATGTGCGT
GACCTGAGCG ATCTGGGGCG TGTTGAGGTG GATGTGGTCT ATATTGGCAC CTGCACGAAC
GGTCATGCGA ACGACCTTGC TGCTGCTGCG CGCATCCTGA AGGGGCGCAA AGTGGCGCGC
GGTGTGCGCC TGCTCGTGGT GCCGGCGTCG AGCGAAGCGT TGCAGCAGGC GACGGCGGAT
GGAACGCTGG CGACACTCCT TGAATCCGGC GCAGCCATTG GCACCCCTGG ATGCGGCGCA
TGTATCGGGC GGCATATGGG CGTCCTGGCG CCCGGCGAGG TCTGCCTGTT CACCGGCAAC
CGTAACTTCC GCGGGCGCAT GGGATCGCCG GAGGCGCAGA TCTACCTGGC GTCGCCTGAA
GTTGCAGCGG CGACGGCAGT GCTCGGGTAT ATCGCCCATC CGGCGGAGGT GGTGAACGGG
CGGTGA
 
Protein sequence
MGQTLAEQIL SHAAGRPVSA GENIVARIDL AMMHDSISPS IIKILHHELG AERVWDRDRV 
AVVVDHVAPA ATIQNAEHQL ALRRWVRQQQ ITHFFDVGRG ISHPVLVEEG LARPGMLIIG
SDSHSTAYGA VGAFGAGMGS TDMALALATG QTWLRVPETV RILARGRFQP GVGAKDLGLR
VARLIGADGA TYQSVEWHGV EELSIGERMT LATLSVEIGA KAGIIPPVGP GWEEHATRRG
ITVPSWLRVE EGARYSRTVE VDLDTLEPQV SVPHFVDNVR DLSDLGRVEV DVVYIGTCTN
GHANDLAAAA RILKGRKVAR GVRLLVVPAS SEALQQATAD GTLATLLESG AAIGTPGCGA
CIGRHMGVLA PGEVCLFTGN RNFRGRMGSP EAQIYLASPE VAAATAVLGY IAHPAEVVNG
R