Gene Rcas_3516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3516 
Symbol 
ID5541015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4581228 
End bp4582460 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content57% 
IMG OID640895634 
Product3-isopropylmalate dehydratase 
Protein accessionYP_001433584 
Protein GI156743455 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCAAA CGATTGCAGA AAAGGTTGTG TCGCACCATG CAGGGCGTCA GGTCATGGCG 
AACGAGATTG CAATTGTTGC GATTGATGGC GCAATGGCAA CCGACGCAAC CGCTCCATTG
GCGATTAAGG CCTTCCGTGA GATGGGTGGG GTTCGTTTGT GGGATCCGTC GCGTGTTGTG
TTGGTGATTG ACCATGCCGC TCCGGCGCCG AATGAGCAGG TGAGCAATCT TCACGCTCTG
ATGCGCGCCT TTGGGCGTGA GATGGGATGT GTCTTATATG ATGTTGGCGA GGGTATCTGC
CATCAGTTGA TGGTTGAATA TGATCACGTG CGTCCTGGCC AGATCATTCT TGGCGCCGAT
TCCCATACTC CAACGTATGG GGCGCTTGGC GCGTTTGCTA TGGGTGTCGG CTCAACCGAT
CTGGCAGCCG CATGGTTGAC CGGAAAGACG TGGCTCAAGA CGCCTGCAAG TATCAAGATT
GTGCTGGACG GCACGTTGCG CACCGGTGTG AGCGCGAAGG ATCTCGTCTT ATTTCTGGTC
AGGCAGATTG GCGCTGATGG CGCACGGTAT CAGGCGGTTG AGTTCACCGG TTCGGCAATT
CGCTCATTGA GCCTCGCTTC GCGAATGACG CTGGCCAATA TGACTGCTGA AATGGGGGCG
CTGACGGCAT TTGTCGACCT GCAAGGATTA GACTTGCCAT ACCGATTCGA TCCAATTCAC
CCCGATCCGG ATGCGGTCTA TAGTGTTGTC TATTCATTCA ACGTGGACCA TCTGCTCCCA
CAAGTGGCTA TTCCGCATGC GCCCAGCAAT GTGGTTCCCA TCGATGAAGT AGCGGGTACG
CCGATCCAGA TGGCATTCAT CGGTTCTTGC ACCAACAGTC GTCTCGAAGA TCTGCGCGCA
GCAGCAGCGG TATTACAAGG GCGCAAACTC GCCCCCGGCG TGCGCCTCAT TATCGCGCCT
GCATCACGGC AAGTCTTCAT GATGGCGCTG CAAGACGGCA CTATTGCCAC TCTCACCGAA
TCGGGCGCAA CCTTCATCAC CGCCGGGTGT GGTCCTTGCG TCGGTACCCA TCAGGGGATT
CCCGGTAATG GCGAGAATGT CATCACCAGC ACGAATCGCA ACTTCCGGGG ACGTATGGGT
AATCCGCACG CCAGCATTTA TCTCGCGTCG CCGGCAGTTG TGGCAGCTTC GGCACTGCGC
GGCGTCATTA CCGATCCTGC TGACGTACTC TGA
 
Protein sequence
MGQTIAEKVV SHHAGRQVMA NEIAIVAIDG AMATDATAPL AIKAFREMGG VRLWDPSRVV 
LVIDHAAPAP NEQVSNLHAL MRAFGREMGC VLYDVGEGIC HQLMVEYDHV RPGQIILGAD
SHTPTYGALG AFAMGVGSTD LAAAWLTGKT WLKTPASIKI VLDGTLRTGV SAKDLVLFLV
RQIGADGARY QAVEFTGSAI RSLSLASRMT LANMTAEMGA LTAFVDLQGL DLPYRFDPIH
PDPDAVYSVV YSFNVDHLLP QVAIPHAPSN VVPIDEVAGT PIQMAFIGSC TNSRLEDLRA
AAAVLQGRKL APGVRLIIAP ASRQVFMMAL QDGTIATLTE SGATFITAGC GPCVGTHQGI
PGNGENVITS TNRNFRGRMG NPHASIYLAS PAVVAASALR GVITDPADVL