Gene Rcas_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1549 
Symbol 
ID5539025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1988333 
End bp1989751 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content64% 
IMG OID640893687 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001431660 
Protein GI156741531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.14031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCGCA CACTGTTCGA CAAAATCTGG GACACCCACA TCGTGCGCCC GCAAACGGCG 
GAAACGCCTG CGGTGCTTTA CATCGATCTC CATCTCATTC ACGAAGTAAC CTCGCCCCAG
GCATTCACCG AACTACGCCG GCGCGGGTTG AAAGTCCGCC GTCCGGATCG CACGCTGGCG
ACGATGGACC ACTCGACCCC AACCACGCCG CGCGGACCGG ATGGGATCAT TCCGGTGGTC
GATGCGCAGG CGGCGGCGCA ACTTCGCCAA CTCGAACAGA ACTGCGCCGA TTTCGGCATC
CCGCTCTTCG CGCTTGGCAG CGAGCGCCAG GGGATCGTCC ACGTCATCGG TCCTGAGCAG
GGGTTGACCC AACCCGGCAT GACCATCGTC TGTGGTGATA GCCACACCAG CACCCACGGC
GCTTTTGGTG CGCTGGCGTT CGGCATCGGC ACCTCGGAAG TCGCGCACGT CCTGGCGACG
CAATGCCTGA TCCAGAACCG CCCCAAAACG ATGGAGGTGC GCGTCGATGG TCGCCTGAAG
CCGGGAGTGA CTGCCAAGGA CATTATTCTG GCGATCATTG CGCGATACGG CGTCGGCGCC
GGCGTCGGGC ATGTGTTCGA GTACACCGGC GAGGCAATCC GCGCGCTCTC GATGGAAGAG
CGCATGACGA TCTGCAACAT GTCAATCGAA GGCGGCGCGC GCGCCGGGAT GATTGCGCCC
GATGACACGA CGTTCCAGTA CATTGCCGGG CGCCCCTTCG CGCCGAAAGG CGCAGCATGG
GACGAGGCCG TGGCATACTG GCGCACTCTG CCGACCGACG ATGGCGCGGT GTACGACCGG
ACGATCACCC TGGATGCATC GCAACTGACG CCGATGATCA CCTACGGCAC CAACCCCGGC
ATGGGGATAC CGATTGACGG TCGCATCCCG ACACCGGAGG AACTGCCCGA TCAGGCAGCG
CGCCAGGCGC TCGACAAGGC GCTGCGCTAT ATGGACCTGC GACCCGGTCA ACCGCTACTC
GGTCAAAAGG TCGATGTCGT CTTCCTGGGT AGTTGCACCA ACTCGCGCAT TTCGGACCTC
CGCATGGCGG CGAGTGTGCT GAAAGGGCGC AAGATCGCCG AAGGCGTGCG CATGATGGTC
GTGCCCGGCT CGCAGCAGGT GAAGAAGCAG GCGGAAGCCG AAGGACTGGA CCGTATTTTC
CGCGAGGCAG GCGCCGAATG GCGCGAAGCC GGCTGCTCTG CCTGCCTCGG CATGAACGAC
GACAAGGTTC CGCCGGGCAA ATATGCTGTC TCGACTAGCA ACCGCAACTT CGAGGGGCGC
CAGGGACCCG GCGCGCGCAC GCTTCTTGCC AGCCCGCTCA CCGCCGTTGC ATCCGCCATC
GAAGGCGTTG TCGCTGATCC GCGGAAATAT GTGGGATAG
 
Protein sequence
MPRTLFDKIW DTHIVRPQTA ETPAVLYIDL HLIHEVTSPQ AFTELRRRGL KVRRPDRTLA 
TMDHSTPTTP RGPDGIIPVV DAQAAAQLRQ LEQNCADFGI PLFALGSERQ GIVHVIGPEQ
GLTQPGMTIV CGDSHTSTHG AFGALAFGIG TSEVAHVLAT QCLIQNRPKT MEVRVDGRLK
PGVTAKDIIL AIIARYGVGA GVGHVFEYTG EAIRALSMEE RMTICNMSIE GGARAGMIAP
DDTTFQYIAG RPFAPKGAAW DEAVAYWRTL PTDDGAVYDR TITLDASQLT PMITYGTNPG
MGIPIDGRIP TPEELPDQAA RQALDKALRY MDLRPGQPLL GQKVDVVFLG SCTNSRISDL
RMAASVLKGR KIAEGVRMMV VPGSQQVKKQ AEAEGLDRIF REAGAEWREA GCSACLGMND
DKVPPGKYAV STSNRNFEGR QGPGARTLLA SPLTAVASAI EGVVADPRKY VG