Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0646 |
Symbol | |
ID | 6092063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 655009 |
End bp | 656265 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642487832 |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_001738682 |
Protein GI | 170288444 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00455068 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAGA CACTCGCAGA AAAGATCTTT TCTGAACATG TTGGAAGAGA CGTGAAAGCA GGTGAAATCG TACTCGCGAG AGTGGATATA GCCATGGCCC AGGATGGAAC AGGCCCTCTG ATGATAAACG AATTCAGAGA ACTCGGTTTC AAAGAAGTGA AGGTCCCGAA GGCCTTCCTC TTCATCGATC ATGCTTCTCC GAGCCCGAGG AAAGAGCTTT CGAACTCGCA GAAGATGATG AGAGAATTTG GAAAAGAGAT GGGAGTCAAG GTTTTCGATG CGGGAGACGG GATATCCCAC CAGATCCTCG CGGAAAAATA CGTGAAACCC GGCGATCTGG TAGCAGGTGC GGATTCGCAC ACCTGCACTG CCGGTGGGCT CGGTGCTTTC GGAACGGGAA TGGGGTCCAC AGATGTTGCG ATCATCTTCG GGCTTGGACA GAACTGGTTC AAAGTACCTG AGTCGATCAA AGTTGTGGTG AACGGGAAGT TACAGGATGG AGTTTACGCG AAAGACATTA TTCTCGAGAT CGCGAGAATT CTGGGAAGCG ACGGCGCAAC TTACAAAGCG TTGGAGTTCC ACGGAAGCTG TATCGAAAAT ATGAATGTGG AGGACAGACT CACCATTTCC AACATGGCGG TGGAAGTGGG AGCGAAAGCA GGTCTCATGC CTTCTGATGA GAAGACCAGA GAGTTCCTGA AAAAGATGGG AAGAGAGGAG GACTTCAGAG AGTTGAAAGC AGATCCAGAC GCGGTTTACG AGACAGAGAT AGAGATAGAT GCCACCACAC TCGAACCACT CGTCTCTTTG CCTCACTATG TGGACAACGT GAGAAAGGTA AGCGAGGTTG AAAAGGAAAA GATAAAGATA GATCAAGTGT TCATAGGAAC CTGTACGAAC GGAAGACTCC AGGATCTTGA GATCGCTTTG AAGATTCTTG AGAAACACGG AAAGCATCCG GATGTGAGGC TGATCGTTGG CCCTGCTTCA AGGAAGGTCT ACATGGACGC CCTTGAAAAG GGAATAATCA AGAAATTCGT TGAACTCGGA GCGGCAGTTA TACCACCAGG TTGCGGCCCG TGTGTTGGAA TTCACATGGG TGTTCTTGGA GACGGAGAGA GGGTACTTTC CACGCAGAAC AGAAACTTCA AGGGAAGGAT GGGGAATCCC AATGCGGAGA TATACCTTGC TTCTCCTGCA ACGGCGGCAG CCACCGCGGT AACCGGATAC ATCACAGATC CGAGAAGGTT CATTTGA
|
Protein sequence | MGKTLAEKIF SEHVGRDVKA GEIVLARVDI AMAQDGTGPL MINEFRELGF KEVKVPKAFL FIDHASPSPR KELSNSQKMM REFGKEMGVK VFDAGDGISH QILAEKYVKP GDLVAGADSH TCTAGGLGAF GTGMGSTDVA IIFGLGQNWF KVPESIKVVV NGKLQDGVYA KDIILEIARI LGSDGATYKA LEFHGSCIEN MNVEDRLTIS NMAVEVGAKA GLMPSDEKTR EFLKKMGREE DFRELKADPD AVYETEIEID ATTLEPLVSL PHYVDNVRKV SEVEKEKIKI DQVFIGTCTN GRLQDLEIAL KILEKHGKHP DVRLIVGPAS RKVYMDALEK GIIKKFVELG AAVIPPGCGP CVGIHMGVLG DGERVLSTQN RNFKGRMGNP NAEIYLASPA TAAATAVTGY ITDPRRFI
|
| |