Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1844 |
Symbol | |
ID | 6093295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1869411 |
End bp | 1870628 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642489038 |
Product | amidohydrolase |
Protein accession | YP_001739855 |
Protein GI | 170289617 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCTGG GAAACTGCCT CATACTGAAG GATTTTTCTT CTGAACCGTT CTTTGGCGCC GTTGAAATAG AATCGGGGAT CATAAAGCGG GTGATTCAGG GAAAGACAAA GGTCGACGTG GACCTTTCTG GAAAGATGAT CATGCCCGCC CTTTTCAACA CGCACACGCA CGCTCCAATG ACCCTTCTGA GAGGGGTGGC AGAAGATCTC AGTTTTGAAG ACTGGTTGTT TTCCAGGGTC CTTCCTTTGG AGGACAGACT GACAGAAAAG ATGATCTACT ACGGCACGAT TCTTGCACAG ATGGAGATGG CAAGGCATGG AACAGCGGGC TTTGTCGACA TGTACTTTCA CGAAGAATGG GTTGCAAAGG CAGTCAGAGA CTTCGGAATG AGAGCACTTC TCACACGTGG CCTTGTCGAC GATCATGGAG ACGACGGAGG GCGTCTCGAT GAAAACTTAA AGCTCTACCG TGAGTGGAAC GGATTCGACG GAAGGATCCT GGTCGGTTTC GGTCCACATT CACCGTATCT GTGTTCAAAG GAGTACCTAA AAAGGATCTT CGATGTTGCA AAATCCTTGG ATGCCCCCAT AACCATCCAT CTTTACGAAA CGTCGAAGGA AAACTACGAT CTTTCAGAGT TACTGGAGCT GGGCATGAAG AACGTGAAAA CGATAGCTGC CCACTGCGTT TACCTTCCAG AGGAACACTT TCGTTCGCTG AAGGATCTGC CTTTCTTTGT CTCGCACAAT CCTGCCAGCA ATCTGAAACT CGGAAACGGC ATCGCTTCTG TCTGGAAGAT GATAGAACGT GGTGTGAAAG TCACGCTCGG AACGGATGGA TCCGCGAGCA ACAACTCTCT GAACCTCTTC TTCGAGATGA GAGTTGCCAG TCTCCTTCAG AAAATGGAAG ATCCACGCAG GATGGATGTT GAAACGTGTC TGAAGATGGT AACGATCAAT GGGGCGAGGG CGATGGGTTT CAAGAGTGGA AAACTGGAAG AAGGATGGAA CGCAGACCTT GTGGTGATCG ATCTGGAACT TCCAGAGATG TTTCCCTCCA GGCACATCAA GAGTCATCTC GTCCATGCCT TTTCCGGAAA CGTCTTTGCC ACCATGGTGG CAGGAAGGTG GATCTACTAC GATGGAAAAT ACCCAACCAT AGACGAGAAT GAAGTGAAAA GAGAGTTGAA GAGAATCGAA AAAGAACTCT ACTCTTGA
|
Protein sequence | MILGNCLILK DFSSEPFFGA VEIESGIIKR VIQGKTKVDV DLSGKMIMPA LFNTHTHAPM TLLRGVAEDL SFEDWLFSRV LPLEDRLTEK MIYYGTILAQ MEMARHGTAG FVDMYFHEEW VAKAVRDFGM RALLTRGLVD DHGDDGGRLD ENLKLYREWN GFDGRILVGF GPHSPYLCSK EYLKRIFDVA KSLDAPITIH LYETSKENYD LSELLELGMK NVKTIAAHCV YLPEEHFRSL KDLPFFVSHN PASNLKLGNG IASVWKMIER GVKVTLGTDG SASNNSLNLF FEMRVASLLQ KMEDPRRMDV ETCLKMVTIN GARAMGFKSG KLEEGWNADL VVIDLELPEM FPSRHIKSHL VHAFSGNVFA TMVAGRWIYY DGKYPTIDEN EVKRELKRIE KELYS
|
| |