Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3471 |
Symbol | |
ID | 5540970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4531309 |
End bp | 4532391 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640895589 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_001433539 |
Protein GI | 156743410 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR00169] 3-isopropylmalate dehydrogenase [TIGR02088] isopropylmalate/isohomocitrate dehydrogenases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.023683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCGC CACCGTTCAC CATTCTTGTC ATACCCGGCG ATGGCATCGG GCGTGAGGTC ATTCCGGCAG CCGTCGCTGT GCTTCAGGCG ACCAATCTGC CGTTGCAGTT CGTGGAAGCC GACGCGGGGT GGGACTGTTT CCAGCGCTGC GGTGACGCAT TGCCGCATGA AACGCTCGAC GCAGCGCGCG CGGCAGATGC CGTGCTGTTT GGCGCCGTCG CGTCTCCGAG TTACCCGGTT GCCGGGTACC GCAGCCCGAT TGTGCGATTG CGCCGCGAAC TCGATCTGTA CGCCAATATC CGTCCGGTCT TCGATGATCC GCCTCCTGGC GATCCGCGCG CGCGCCGGGT CGATCTGGCG GTGGTGCGCG AAAATACCGA AGGGCTGTAC GCCGGGCGCG AACGGGTCGA AGATGGCGGC GCAACTGCGA TTGCCGAGCG CGTGATTACC CGGCGCGCCA GCGAACGAAT TGCGCGGGTC GCCTTTGAAC TGGCGCGCGC TCGTCGTGCT GCGCGCCGCG CCGACGATGC GCCGCCGGGA AGAGTGACCA TTGTCCACAA GGCGAACGTT CTGCGCGAAA CCTGTGGATT GTTCCGCACC ATCGCTCTGG ACGTCGCACA GGCGTACCCC GACGTGCAGG CGGACGAAAT GCTCGTCGAT GCCTGCGCGC TGCACCTGGC GACGCGCCCG GAACGCTTCG ATGTGATTGT CACCACGAAC CTGTTCGGTG ATATCCTGTC GGATGTCGCC TGCGCCTGGG GTGGCGGGTT GGGTCTCGCG CCATCGGCGA ACCTGGGTGA ACGACACGCG CTGTTCGAAC CGGTCCACGG CGCCGCGCCC GACATCGCGG GGAAAGGAAT CGCCAATCCG CTTGCGGCGA TCGGGTGCGC TGCGTTATTG CTCGACCATC TTGCCAGCCG CGCGTCGCCC GATACTGCGT CTGCCATCCG CGCCTGGAGC GAACGCATCA GGCGCGCCGT TCGCCACGTG CGCGCCGTCG GACCGCACAC ACCCGACTTA GGAGGCGACG CTGCCACATC TGAGACGACT GCCGCAGTCA TTGCCCATAT GCTCACGTTG TAA
|
Protein sequence | MSAPPFTILV IPGDGIGREV IPAAVAVLQA TNLPLQFVEA DAGWDCFQRC GDALPHETLD AARAADAVLF GAVASPSYPV AGYRSPIVRL RRELDLYANI RPVFDDPPPG DPRARRVDLA VVRENTEGLY AGRERVEDGG ATAIAERVIT RRASERIARV AFELARARRA ARRADDAPPG RVTIVHKANV LRETCGLFRT IALDVAQAYP DVQADEMLVD ACALHLATRP ERFDVIVTTN LFGDILSDVA CAWGGGLGLA PSANLGERHA LFEPVHGAAP DIAGKGIANP LAAIGCAALL LDHLASRASP DTASAIRAWS ERIRRAVRHV RAVGPHTPDL GGDAATSETT AAVIAHMLTL
|
| |