Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1811 |
Symbol | |
ID | 6093262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1834430 |
End bp | 1839034 |
Gene Length | 4605 bp |
Protein Length | 1534 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642489008 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001739825 |
Protein GI | 170289587 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0726728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAT TGATTTTCTT GGTTTTTCTT CTGATCAGCT TCAGCCTTTT TGGAGGATAC GCCTATTTCT CTAGATATCC TGTTCTGCAT CCAGATGAGG GACTCTTCTT TGTGATCTCA GATCTGGAGG ATGTCACCCT GAACGTGTGG AAGATAAGTG AAGAAGACTT TTTGAAAGCA GTTTTTGATC CAGAAAACTT CAACTTCTCG CTGTTAGAGA TAACACGTCC TATCTACAGT AAAAAGTTCT CCTCAGAGGA GTGGAAAGAA TTCTCATTTC CACTGAAAGA CAGGGGATTT TACTTTGCGA CTCTGGTTTC CAACGAGGGG ACGGTTTTCA GAAAGGTGAT GGACAGGGGC CTGTTCATCG TCACCGATCT GGAGGTGATT TACTTTTCCG ACAGCGAAAA GCTGAGGCTC CACGTGTTCG ACTCAGACGG TGATTTTGTG GAAGGAGCGG AGGTCCTTCT CTTTGAAGAT TCGAAACTGA TCGACAGAGT TTTCACCGGC AAAGACGGGG TCGTTTCTAT CACGAAACAC TTCGACACGT TCTACATCAG GTACGGGGAC TCTCGATTTT TGGGAGGGGT GTACTTTTCA GGTGGAGGGC TTGAAAGAGA AAAGCTTTTC TTTGTCACAG ACAGGCCGAT CTACAAACCT TCCGACACGG TCCACTTCAG AGGTCAGATC TTCTCTTTTG AAGAGGGTCT CTACAGAGCC TTTGAGAAAA CGAAAGTAAC CGTTTCCATT TTCGACACAA AGAACAACGA AGTTTACAGA TCGGAGCTTG AAACCGACGA GCTCGGTGGA TTCAACGGTT TTATGAAGCT TCCAGACACA GCCCCGGTCG GACTCTACAG GGTGAAGGTC GATCATGGAG GAAGACGCTA CTACGAATAT TTTCTGGTGG AAGAATACAG AAAGCCCGAG TACAAAGTCG AAATCGAAAC GGATAAAGAC TTGTACATAT CCGGCGAAGT TGTGAACTAC CTTGTCAGGG TGAAGTACTT CAACGACCAG CCTGTTGCGA AAGCGCAGGT TGCCTACTAC GTTCGAGCCT TTCCAGAGGA AGGAAGCGGA TATCTGGTTT ACAGGGGAAC GGACTTCACA GACGAAGAGG GAAACCTCAG ACTCGGTGTG AAAACAGAAG AAGGATTTCA GGGGTTCTAC CGGCTGGAGG TGATCGTGAC GGATGAAAGC CAGCGTCAGA TTGAGGAAAC AAGGTCTGTG AAGGTGTACG CAGACAACGT TCTGATATCT CCGATGAATC GGTACGTTTC CACCTCACCG GGCAAGCAGG TGAGGGTGAA GGTGAAAGTG ACGGATCTTT CCGGAAATCC TTTAGATGGA CTGCTCACCG TCTCTTCTGA AGATTCAACG AGCACGGTGG TTGTGGAAAA CGGTGAGACG ATCGTCACTT TCACTCCAAA AGAGCCAAAA AGTTACAGAA TAGAACTCTC CTTCGGAAAG GCGAAAACTC ACCTCTACGT GTACGCTTAC TACGGTGCAG GAACAAGCAG TGAGTTCGTC ATTAATCCAG CAACGAACAC GGTGAAACCT GGAGATGAAC TTTCGGTTCA GATCCTTGCA CCTGGTAAGG TGATGGGAGT TCTGGGAATC GTCTCAAACA GGGTTTACGA CACTATTCCT GTCTCCTTCA CCGGGTCTGT CAACCTGCGT GTCAGAATAC CGAAAGATAT CCCTGAGAAG AATCTCTTCC TCAGCTTCAT AGGACTCGAC GACAATGGGC GTATCTACAA GCTGGAAAGA CTGAACGTTC TGCTCGACAC GAATTTCACC ACCATGAAGA TTCTGTTCGA CAAGGATCAG TACGAACCTG GAGAAATGGC ACAGATCACG ATCGAATCGA ATGTGGACAG AGTCTGTCTT TTCCTCGTTG ATGAAGCGAT ATACGCCATG GTTGGAGCAG AACCACCGGT GCTCGAAAAC TTCCTCTATC CTCACATGAA CTATCCCCAA ACAAGAGGAG GATTTCCGCA TTACTGGAGA CTCTATGTTT CAAGGGATTC GTTCCGAAAC AAACTCGCTT CCCTCCCGGA GGAGAAGACC TTCGCCGATT TCAAACAGAA CGCCATTCCA TCTAAGTTGA ACGTCAGGGA GTACTTCCCG GACACGGCCC TCTGGATTCC TTCACTGGAG CTTCACAACG GAATCGCGAG GGTGAGCTTC AAGGTCCCAG ACAGCATCAC TTCTTTCAGG GCAACGGCCT ACGGTTTCTC AAAGGATCGA TTCTCCCAGG CAGAAAGCGA AATGGTCGTT TCCAAAAAGT TCTATCTGAT GCCTCACCTT CCGTCTTTTT TGAGGGAGGG TGATGTGATA AAAATATCCG CAACCGTTTT CAACAGGACT TCGAAGACGC TTCCGGTTCA ACTCATGGTG GAACTTCCCG AGAACATAGA ACTCCTCGAG GGGAGTTCCT CAAGACGCTT TTTGATGGAG GCGAACTCCT CACACACAGA GACCTGGACA GTGAAGGCTG TCTTTGCTTC TGAAGGAAGT TTTGTGAAAT TTACTGCGGT TGGAGGTGAT CTGAGCGACG CGGTCTCCAT GAGACTGCCC GTTGAAAGAT TCGCTTTTGA AAGGGAATTC TACCGCATCA TGCTCTTGGA CGGGAAAGAG ACGCTGAAGA TCCCGGAGCA GTTCATCTCA TCGAGGATAA GGTTTCTGGA CAGCATCGTT CCGCTCGTCG AGGATAGCCT GAAAAGGCTG GTAGACTTCC CGTACGGTTG TGTCGAACAG ACCATGAGCC GGTTCTTCCC GGCCGTGGTT GCAGCAAGTG TAGGAATAGA GGTGGAGAAC CTGGAAGAGA TCATCCAGAG GGGGCTGTTC AGACTCTACT CCTACCAGCA CAACGATGGC GGTTGGGGAT GGTTCAGATT CGACGAATCC GATGACTTCA TGACCTGCTA CGTGATGGAA GGGCTGTACT TCACCATGAA GGCGGGATAC GATGTCGCAG AAAGCGTCCT GCAGAGAGGA ATAGAGTATC TCAGGGAACA TCCGTCGGCC TACGGATCGT ACGTTCTCGA TCTGTACGGA GTAGATCACG AGCCGTTCAG GCCAGAAAGC GAAGTGGATC TGGTGTTTCT GAGTTTGAGT TCAAAAGAGG CTCTGAAACA GCTGATGAAC TACGTCGTCC AGGACGAGCA GAAGGCCTAT CTGAAAATAT CTTCCGATAA CCCCCTCATC AGTGAAATCC AGCTCAACAG CGTTTTCCTC AGGGCTCTTG CAAAGTGGAA AGAATTTCCG GAACTGGTAA GAAAAGTGAC AAATTACCTT CTCTTGAAAA AAGACAGCGC TTTCTGGACT TCCACAAAGG ACACGTCCTT TGTTATTCTG GCTCTCCTTG AGGCGATGCC GGAGTACGCT TCAACCACAC TGAAAGTCAT CAACTCCGAA AACACCTTCG AACTGAAGCC AGGTGAAGAA AGCTCCCTCG TTCCCGGTTC ACTGACCGTC TCTGGAAAAG GCATTGTGGA AGTGGAGGTA GTTTACATCG AAGTTCCGAA AGAGGCTGTG AGCGAAGGTT TGAAGATAAA AAGAGAATTC TACAAAAGGT ACGAACTGCT CATAGAGGAG AATAAAATGA TTGTGGATGC CTTCGTGCCG ATCGGGAGAG GATACGTACC ACACTCGATA CACCCTGTCG AGAAAGAACA AAACGAAGAA CTCTACATCC TGCCGTACAA GTACTGGAAG AAGACAATCG AATACAGAGG ATTTCCCCTC GAGATAGACG GTGCAGAAGT GAAAATAAAA GGAGAGACTT ACACGTTCTT CAGGATCGAA ACGTTCAACG GCCTGATTCT TGTAGTTCTC AGAAACGAAG CGCTCGTCTA TGATACGGAA AAGAACACTA TCACCAGGTA TCTGGATGTA ACAGACGCAG GTTTCATGAG AAGTGGTCCT GTTTTTCTCA TGAAGGGATT CGTGCTGGTC GGTGATGAAA AGATACCCGT TCCTGAAGAC GTTACGGGGC TGTCCTGCAC GATGGATGAG ATCCTGCTGA GGGGAGAAAA CAAAACGTAC TGGTACAGGA ACGGAGAGTT CGTGGATCTT CCGTTCGTTG CCAGAAGGGT ATTCTTCTGG GATGGGAAGA GGCTGGTTGC AGAGAGCATA CGCTTCAGCG GGTCTTCAAA GACTCTTCGG AACAGAGTTT TCGAGGTGGT CTTCGATGTT GAAGATGTGA GGATAGAGTT GGGAGACATA ATCAAAACGG TGGTGAGGGT TGAGGGAGAT GGAAATTATC TCATAGTGGA GGATTTCATC CCGTCCTGCG CGCAGGTGCT CTCGAACTAC AGAGAAAAAG GGATCGAGGA AAACAAGTTC TCGTACAGCT GGTACTTTTC ATGGAACGCA TGGTACTCTG GAAGGGAGAT TCGAACGGAC AGGGTTGCGC TCTTTGCCAG ATACCTTTAC GGTGATAGCT TTGACTACGT CTGGAGGGCG ACTGCAGAGG GGGTGTTTCA TCTTCTTCCG GCACGGGTTT ATCCGATGTA TTCTCGTGAT CTCTATGCTC ATACAGATCC AGATGTGCTT TTCATCGGGG CGGATTTTAT CGATGGAAGA GATGATCAAC CTTGA
|
Protein sequence | MKRLIFLVFL LISFSLFGGY AYFSRYPVLH PDEGLFFVIS DLEDVTLNVW KISEEDFLKA VFDPENFNFS LLEITRPIYS KKFSSEEWKE FSFPLKDRGF YFATLVSNEG TVFRKVMDRG LFIVTDLEVI YFSDSEKLRL HVFDSDGDFV EGAEVLLFED SKLIDRVFTG KDGVVSITKH FDTFYIRYGD SRFLGGVYFS GGGLEREKLF FVTDRPIYKP SDTVHFRGQI FSFEEGLYRA FEKTKVTVSI FDTKNNEVYR SELETDELGG FNGFMKLPDT APVGLYRVKV DHGGRRYYEY FLVEEYRKPE YKVEIETDKD LYISGEVVNY LVRVKYFNDQ PVAKAQVAYY VRAFPEEGSG YLVYRGTDFT DEEGNLRLGV KTEEGFQGFY RLEVIVTDES QRQIEETRSV KVYADNVLIS PMNRYVSTSP GKQVRVKVKV TDLSGNPLDG LLTVSSEDST STVVVENGET IVTFTPKEPK SYRIELSFGK AKTHLYVYAY YGAGTSSEFV INPATNTVKP GDELSVQILA PGKVMGVLGI VSNRVYDTIP VSFTGSVNLR VRIPKDIPEK NLFLSFIGLD DNGRIYKLER LNVLLDTNFT TMKILFDKDQ YEPGEMAQIT IESNVDRVCL FLVDEAIYAM VGAEPPVLEN FLYPHMNYPQ TRGGFPHYWR LYVSRDSFRN KLASLPEEKT FADFKQNAIP SKLNVREYFP DTALWIPSLE LHNGIARVSF KVPDSITSFR ATAYGFSKDR FSQAESEMVV SKKFYLMPHL PSFLREGDVI KISATVFNRT SKTLPVQLMV ELPENIELLE GSSSRRFLME ANSSHTETWT VKAVFASEGS FVKFTAVGGD LSDAVSMRLP VERFAFEREF YRIMLLDGKE TLKIPEQFIS SRIRFLDSIV PLVEDSLKRL VDFPYGCVEQ TMSRFFPAVV AASVGIEVEN LEEIIQRGLF RLYSYQHNDG GWGWFRFDES DDFMTCYVME GLYFTMKAGY DVAESVLQRG IEYLREHPSA YGSYVLDLYG VDHEPFRPES EVDLVFLSLS SKEALKQLMN YVVQDEQKAY LKISSDNPLI SEIQLNSVFL RALAKWKEFP ELVRKVTNYL LLKKDSAFWT STKDTSFVIL ALLEAMPEYA STTLKVINSE NTFELKPGEE SSLVPGSLTV SGKGIVEVEV VYIEVPKEAV SEGLKIKREF YKRYELLIEE NKMIVDAFVP IGRGYVPHSI HPVEKEQNEE LYILPYKYWK KTIEYRGFPL EIDGAEVKIK GETYTFFRIE TFNGLILVVL RNEALVYDTE KNTITRYLDV TDAGFMRSGP VFLMKGFVLV GDEKIPVPED VTGLSCTMDE ILLRGENKTY WYRNGEFVDL PFVARRVFFW DGKRLVAESI RFSGSSKTLR NRVFEVVFDV EDVRIELGDI IKTVVRVEGD GNYLIVEDFI PSCAQVLSNY REKGIEENKF SYSWYFSWNA WYSGREIRTD RVALFARYLY GDSFDYVWRA TAEGVFHLLP ARVYPMYSRD LYAHTDPDVL FIGADFIDGR DDQP
|
| |