Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1757 |
Symbol | |
ID | 5170509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 1758233 |
End bp | 1762837 |
Gene Length | 4605 bp |
Protein Length | 1534 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640564279 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001245334 |
Protein GI | 148270874 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000289734 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAT TGATTTTCTT GGTTTTTCTT CTGATCAGCT TCAGTCTTTT TGGAGGATAC GCCTATTTCT CTAGATATCC TGTTCTGCAT CCAGATGATG GACTCTTCTT TGTGATCTCA GATCTGGAGG ATGTCACCCT GAACGTGTGG AAGATAAGTG AAGAAGACTT TTTGAAGGCA GTTTTTGATC CAGAAAACTT CAACTTCTCG CTGTTAGAGA TAACACGTCC TATCTACAGT AAAAAGTTCT CCTCAGAGGA GTGGAAAGAA TTCTCATTTC CACTGAAAGA CAGGGGATTT TACTTTGCGA CTCTGGTTTC CAACGAGGGG ACGGTTTTCA GAAAGGTGAT GGACAGGGGC CTGTTCATCG TCACCGATCT GGAGGTGATT TACTTTTCCG ACAGCGAAAA GCTGAGGCTC CACGTGTTCG ACTCAGATGG TGATTTTGTG GAAGGAGCGG AGGTCCTTCC CTTTGAAGAT TCGAAACTGA TCGACAGAGT TTTCACCGGC AAAGACGGGG TCGTTTCTAT CACGAAACAC TTCGACACGT TCTACATCAG GTACGGGGAC TCTCGATTTT TGGGAGGGGT GTACTTTTCA GGTGGAGGGC TTGAAAGAGA AAAGCTTTTC TTTGTCACAG ACAGGCCGAT CTACAAACCT TCCGACACGG TCCACTTCAG AGGTCAGATC TTCTCTTTTG AAGAGGGTCT CTACAGAGCC TTTGAGAAAA CGAAAGTAAC CGTTTCCATT TTCGACACAA AGAACAACGA AGTTTACAGA TCGGAGCTTG AAACCGACGA GCTCGGTGGA TTCAACGGTT TTATGAAGCT TCCAGACACA GCCCCGGTCG GACTCTACAG GGTGAAGGTC GATCATGGAG GAAGACGCTA CTACGAATAT TTTCTGGTGG AAGAATACAG AAAGCCCGAG TACAAAGTCG AAATCGAAAC GGATAAAGAC TTGTACATAT CCGGCGAAGT TGTGAACTAC CTTGTCAGGG TGAAGTACTT CAACGACCAG CCTGTTGCGA AAGCGCAGGT TGCCTACTAC GTTCGAGCCT TTCCAGAGGA AGGAAGCGGA TATCTGGTTT ACAGGGGAAC GGACTTCACA GACGAAGAGG GAAACCTCAG ACTCGGTGTG AAAACAGAAG AAGGATTTCA GGGGTTCTAC CGGCTGGAGG TGATCGTGAC GGATGAAAGC CAGCGTCAGA TTGAGGAAAC AAGGTCTGTG AAGGTGTACG CAGACAACGT TCTGATATCT CCGATGAATC GGTACGTTTT CGCTTCACCG GGCAAGCAGG TGAGGGTGAA GGTGAAAGTG ACGGATCTTT CCGGAAATCC TTTAGATGGA CTGCTCACCG TCTCTTCCAA AGATTCAACG AGTACGGTGG TCGTGGAAAA CGGCGAAGCG ATCGTTACTT TCACTCCAAA AGAACCAGAA AGTTACAGAA TAGAACTCTC CTTCGGAAAG GCGAAAACTC ACCTCTACGT GTACGCTTAC TACGGTGCAG GAACAAGCAG CGAGTTCGTC ATTAATCCAG CAACGAACAC GGTGAAACCT GGAGATGAAC TTTCGGTTCA GATCCTTGCA CCTGGTAAGG TGATGGGAGT TCTGGGAATC GTCTCAAACA GGGTTTACGA CACTATTCCT GTCTCCTTCA CCGGGTCTGT CAACCTGCGT GTCAGAATAC CGAAAGATAT CCCTGAGAAG AATCTCTTCC TCAGCTTCAT AGGACTCGAC GACAATGGGC GTATCTACAA GCTGGAAAGG CTGAACGTTC TGCTCGACAC GAATTTCACC ACCATGAAGA TTCTGTTCGA CAAGGATCAG TACGAACCTG GAGAAATGGC ACAGATCACG ATCGAATCGA ATGTGGACAG AGTCTGTCTT TTCCTCGTTG ATGAAGCGAT ATACGCCATG GTTGGAGCAG AACCACCGGT GCTCGAAAAC TTCCTCTATC CTCACATGAA CTATCCCCAA ACAAGAGGAG GATTTCCGCA TTACTGGAGA CTCTATGTTT CAAGGGATTC GTTCCGAAAC AAACTCGCTT CCCTCCCGGA GGAGAAGACC TTCGCCGATT TCAAACAGAA CGCCATTCCA TCTAAGTTGA ACGTCAGGGA GTACTTCCCG GACACGGCCC TCTGGATTCC TTCACTGGAG CTTCACAACG GAATCGCGAG GGTGAGCTTC AAGGTCCCAG ACAGCATCAC TTCTTTCAGG GCAACGGCCT ACGGTTTCTC AAAGGATCGA TTCTCCCAGG CAGAAAGCGA AATGGTCGTT TCCAAAAAGT TCTATCTGAT GCCTCACCTT CCGTCTTTTT TGAGGGAGGG TGATGTGATA AAAATATCCG CAACCGTTTT CAACAGGACT TCGAAGACGC TTCCGGTTCA ACTCATGGTG GAACTTCCCG AGAACATAGA ACTCCTCGAG GGGAGTTCCT CAAGACGCTT TTTGATGGAG GCGAACTCCT CACACACAGA GACCTGGACA GTGAAGGCTG TCTTTGCTTC TGAAGGAAGT TTTGTGAAAT TTACTGCGGT TGGAGGTGAT CTGAGCGACG CGGTCTCCAT GAGACTGCCC GTTGAAAGAT TCGCTTTTGA AAGGGAATTC TACCGCATCA TGCTCTTGGA CGGGAAAGAG ACGCTGAAGA TCCCGGAGCA GTTCATCTCA TCGAGGATAA GGTTTCTGGA CAGCATCGTT CCGCTCGTCG AGGATAGCCT GAAAAGGCTG GTAGACTTCC CGTACGGTTG TGTCGAACAG ACCATGAGCC GGTTCTTCCC GGCCGTGGTT GCAGCAAGTG TAGGAATAGA GGTGGAGAAC CTGGAAGAGA TCATCCAGAG GGGGCTGTTC AGACTCTACT CCTACCAGCA CAACGATGGC GGTTGGGGAT GGTTCAGATT CGACGAATCC GATGACTTCA TGACCTGCTA CGTGATGGAA GGGCTGTACT TCACCATGAA GGCGGGATAC GATGTCGCAG AAAGCGTCCT GCAGAGAGGA ATAGAGTATC TCAGGGAACA TCCGTCGGCC TACGGATCGT ACGTTCTCGA TCTGTACGGA GTAGATCACG AGCCGTTCAG GCCAGAAAGC GAAGTGGATC TGGTGTTTCT GAGTTTGAGT TCAAAAGAGG CTCTGAAACA GCTGATGAAC TACGTCGTCC AGGACGAGCA GAAGGCCTAT CTGAAAATAT CTTCCGATAA CCCCCTCATC AGTGAAATCC AGCTCAACAG CGTTTTCCTC AGGGCTCTTG CAAAGTGGAA AGAATTTCCG GAACTGGAAA GAAAACTGGC AAATTACCTT CTCTTGAAAA AAGACAGCGC CTTCTGGACT TCCACAAAGG ACACGTCCTT TGTTATTCTG GCTCTCCTTG AGGCGATGCC GGAGTACGCT TCAACCACAC TGAAAGTCAT CAACTCCGAA AACACCTTCG AACTGAAGCC AGGTGAAGAA AGATCTCTCG TTCCCGGTTC ACTGATCGTT TCTGGAAAAG GCATCGTGGA GGTGGAGATA ACCTACGTCG AAGTTCCGAA AGAGTCTGTG AGTGAAGGCC TGGAGATAAA AAGAGAATTC TACAAAAGGT ACGAACTCCT GATAGAGGAG AAAAAAATGA TTGTAGATGC CTTCGTGCCG ATCGGGAGAG GATACGTACC GCACTCGATA CACCCTGTCG AGAAAGAGCA AAACGAAGAA CTCTACATCC TGCCGTACGA GTACTGGAAG AAGACAATCG AATACAGAGG ATTTCCCCTC GAGATAGACG GTGCAGAAGT GAAAATAAAA GGAGAGACTT ACACGTTCTT CAGGATCGAA ACGTTCAACG GCCTGATTCT TGTAGTTCTC AGAAACGAAG CGCTCGTCTA TGATACGGAA AAGAACACTA TCACCAGGTA TCTGGACGTG ATGGACGCAG GTTTCATGAA GAGTGGTCTT GTCTTTCTCA TGAAAGGATT CGTGCTGATC GGTGATGAAA AGATACCCGT TCCTGAAGAC GTTACGGGGC TGTCCTGCAC GATGGATGAG ATCCTGCTGA GGGGAGAAAA CAAAACGTAC TGGTACAGGA ACGGAGAGTT CGTGGATCTT CCGTTCGTTG CCAGAAGGGT ATTCTTCTGG GATGGAAAAA AGCTGGTTGC GGAGAACATA CGCTTCAGCG GGTCTTCAAA GACTCTTCGG AACAGAGTTT TCGAGGTGGT CTTCGATGTT GAAGATGTGA GGATAGAGTT GGGAGACATA ATCAAAACGG TGGTGAGGGT TGAAGGAGAC GGGAACTACC TCATAGTGGA GGATTTCATT CCGTCCTGCG CGCAGGTGCT CTCGAACTAC AGAGAAAAAG GGATCGAGGA AAACAAGTTC TCGTACAGCT GGTACTCTTC ATGGGACGCG TGGTACTCTG GAAGGGAGAT TCGAACGGAC AGGGTTGCGC TCTTTGCCCG ATACCTTTAC GGTGATAGTT TTGACTACGT CTGGAGGGCG ACTGCAGAGG GGGTGTTTCA TCTTCTTCCA GCACGGATTT ATCCGATGTA TTCTCGTGAT CTCTATGCTC ATACAGATCC AGATGTGCTT TTCATCGGGG CGGATTTTAT CGATGGAAGA GATGATCAAC CTTGA
|
Protein sequence | MKRLIFLVFL LISFSLFGGY AYFSRYPVLH PDDGLFFVIS DLEDVTLNVW KISEEDFLKA VFDPENFNFS LLEITRPIYS KKFSSEEWKE FSFPLKDRGF YFATLVSNEG TVFRKVMDRG LFIVTDLEVI YFSDSEKLRL HVFDSDGDFV EGAEVLPFED SKLIDRVFTG KDGVVSITKH FDTFYIRYGD SRFLGGVYFS GGGLEREKLF FVTDRPIYKP SDTVHFRGQI FSFEEGLYRA FEKTKVTVSI FDTKNNEVYR SELETDELGG FNGFMKLPDT APVGLYRVKV DHGGRRYYEY FLVEEYRKPE YKVEIETDKD LYISGEVVNY LVRVKYFNDQ PVAKAQVAYY VRAFPEEGSG YLVYRGTDFT DEEGNLRLGV KTEEGFQGFY RLEVIVTDES QRQIEETRSV KVYADNVLIS PMNRYVFASP GKQVRVKVKV TDLSGNPLDG LLTVSSKDST STVVVENGEA IVTFTPKEPE SYRIELSFGK AKTHLYVYAY YGAGTSSEFV INPATNTVKP GDELSVQILA PGKVMGVLGI VSNRVYDTIP VSFTGSVNLR VRIPKDIPEK NLFLSFIGLD DNGRIYKLER LNVLLDTNFT TMKILFDKDQ YEPGEMAQIT IESNVDRVCL FLVDEAIYAM VGAEPPVLEN FLYPHMNYPQ TRGGFPHYWR LYVSRDSFRN KLASLPEEKT FADFKQNAIP SKLNVREYFP DTALWIPSLE LHNGIARVSF KVPDSITSFR ATAYGFSKDR FSQAESEMVV SKKFYLMPHL PSFLREGDVI KISATVFNRT SKTLPVQLMV ELPENIELLE GSSSRRFLME ANSSHTETWT VKAVFASEGS FVKFTAVGGD LSDAVSMRLP VERFAFEREF YRIMLLDGKE TLKIPEQFIS SRIRFLDSIV PLVEDSLKRL VDFPYGCVEQ TMSRFFPAVV AASVGIEVEN LEEIIQRGLF RLYSYQHNDG GWGWFRFDES DDFMTCYVME GLYFTMKAGY DVAESVLQRG IEYLREHPSA YGSYVLDLYG VDHEPFRPES EVDLVFLSLS SKEALKQLMN YVVQDEQKAY LKISSDNPLI SEIQLNSVFL RALAKWKEFP ELERKLANYL LLKKDSAFWT STKDTSFVIL ALLEAMPEYA STTLKVINSE NTFELKPGEE RSLVPGSLIV SGKGIVEVEI TYVEVPKESV SEGLEIKREF YKRYELLIEE KKMIVDAFVP IGRGYVPHSI HPVEKEQNEE LYILPYEYWK KTIEYRGFPL EIDGAEVKIK GETYTFFRIE TFNGLILVVL RNEALVYDTE KNTITRYLDV MDAGFMKSGL VFLMKGFVLI GDEKIPVPED VTGLSCTMDE ILLRGENKTY WYRNGEFVDL PFVARRVFFW DGKKLVAENI RFSGSSKTLR NRVFEVVFDV EDVRIELGDI IKTVVRVEGD GNYLIVEDFI PSCAQVLSNY REKGIEENKF SYSWYSSWDA WYSGREIRTD RVALFARYLY GDSFDYVWRA TAEGVFHLLP ARIYPMYSRD LYAHTDPDVL FIGADFIDGR DDQP
|
| |