Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1778 |
Symbol | |
ID | 5171404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 1784464 |
End bp | 1788522 |
Gene Length | 4059 bp |
Protein Length | 1352 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640564299 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001245354 |
Protein GI | 148270894 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000145479 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGTT TGTTTTTTTC TTTCATGGTC TTTATGGCTG CTTTTCTCTT TGGAGGATAT GCGTATTTTT CCGGAGATCC TATACTGCGA GTTGGTGAAG GAATTTCTTT CGTCGTTTCG GACATGGAAA ATGTTGTCTT GAATGTGTGG AAAGTGGAAG ACGAAGAGAC ATTCTTGAAA GCCGTCCTCA CCGAAGAAAA TTTCGATTTC TGGTGGCAGG AGAACAATCC CCTTGTTTAC AGAAAAACAT TCTCTTCAAG GAACGAGTGG AAAGAATTCT CCGTTCCTTT GAAAGAAAAG GGCTTTTATT TTGCCACTTT GACAACCCCG TTGGCAGGAA CGTCTGTGAT AACAAGAGAA CTCGACAGGG GATTGTTCAT CGTTACCGAT CTTGAGGTGA TTTACTTTTC CGATGGAAGC AGGACCATGC TCCATGTTTT TGATATCGAC GATGGTTTCG CGGAAAAAGC AGAAGTGTTC TTGTACAGAG ATTCAATATT AGTTGGCAGA CTTTTCACCG ATGAATATGG AACCGTCGAA GTCAACGATC GTTTCGATAC GGTTTATGTC AGGTATAAAG ACTCTCGCTT CATTGGTAAT GTGTATTCCC CTAGAAGATT TGTTGAAGAT GAGAAACTCT TTCTCATCAC GGACCGACCC ATTTATAAAC CGTCCGATAC TGTTTACTTC AGAGGACAAC TTTTCAAGGT TGATGGGAAC GTGTACAGGG CTTTTGAAAG CACAGGAGTA ACCGTGACTG TTTTCGACAC GAAGGAAAAC GAGATCTACA GGTCAACTTT TGAAACCAGT GCACTCGGAG GCTTCAGCGG AGATTTGAAA TTACCAGACA CGGCTCCAGT TGGTCTTTAC AGGATAAAGA TCGAGCATGG TGAGAACACG TTCTGGGAGA ATTTCCTGGT TGAGGAGTAC AGAAAGCCAG AATACAGAGT AAACATCGAA ACCGACAGGG AAACGTACAT ATCTGGAGAA GTTATAAACT ATCGGATCAG GGTAAAGTAT TTCAACGATC AGCCGGTCGA AAAAGCAGAG GTTGCCTATT ACGTTCATGC GTTTCCGGAA GACGAAAATG GTTATCTGGT CTACAGAGGA ACAGGCTTTA CGAACGAAAA GGGAGTTCTG GAGTTTGGCG TGAAGACCCA GGAAGGTTTT CAGGGAGTCT ACGCTCTGGA GGTCATCGTG GTAGATGTGA GTCAGAGACA GATCGAAGAA AAAAAGACGG TAAAGGTATA CGCTGATGAC GTTTTAATCT CTCCACAGGA CAGGTTCGTT TACACATCTC CTGGAAATCA GGTGAGGATG ACTGTGAAGG TAACGGATCT GTCAGGAAAC CCTCTGGAAG GTGTTCTCTA CGTCCTGCAC GATGATTCAA CGAGCACAGC GGTTGTGGAA AACGGTGAAG CAACTTTCAC GTTCGTTCCA TATGAAGTCA GAGACTATAA ACTGGAACTG TCCTTCAAGA AGGCAAAGAC ACATGTTTAC GTGTATGCCT ACCATGGTGC AAGAACGAGC AATGAGTTTG TGATCGTCGC AGAAAGCGAC ACGGTGAAAC CTGGTGATAG AATACCGGTT CGTCTCCTTG CCCCTGGTAA TGTGAAGGGT GTTCTGGGAA TTACTTCGAA GAGGATCTAC AAAACGATTC CCGTTATCTT CACAGGTTCC ACCGAATTTC TCGTTGAAAT CCCTGAAAAT ATCCTGGAAA AGAACATTTT CTTTGTTTTT ACTGGATTCG ACGAAAAAGG CGAAATCTAT CTGACGAAAA AACTGGATGT TTCTTTGAAT ACCAATTTCA CCGACATGGA GATAAAGTTC GACAAAGACC AGTACGAACC AGGAGAAATC GCACAGATCA CCATAAAATC CAGTGTGGAT GAGATCTGCA TCTCGGTGGT CGATGAGGCG ATATACGATC TTGTGGGGAC AGAGCCACCT GTTCTCGAAG AATTTCTGTA TCCTTCCATG AATTATCCAC TGGTGAGAGG AAACTTCGCG TCCGGATGGA TACTTTACGT TTCGAGGAAT TCGATCCGAA ACAAACTTGC TTCGTTGCCC GAAGAGAAAA CCTTTGCCGA TTTCAAACAG AATGCTTTCC CCAGTAGGGT GAATGTCAGA GAATACTTTC CTGACACAGT CCTGTGGATA CCTGATGTGA AACTCCACGA CGGTACGGCG AGGATAAGTT TCAAGGTGCC AGACAGCATA ACCTCGTTCA GAGCAACTGC CTATGGTTTT TCAAAGGACA GATTTTCTCA AGGAGAAGAA ACCATCGTTG TTTCTAAGGA TTTCTACATA ACACCACATC TTCCCTCCTT CCTGAGGGAA GGAGACATCA TGAGACTATC TGCAACTGTT TTCAACAGAA CGGGGAAGGA GCTTTTCGTC GAGATCAGAA TAGAGCTGCC TGATAACATA AAACTCGTTG AAGGAAACTC GTCGAGACAT TTCTTGATGG AGGCGAACTC ATCTCACACT GAGACCTGGA CCGTGAAGGC GATTTCTCCT TCCGAGGAAA GTTTCGTGCA ATTCTTTGCA AGCGGAGGCG ATCTGAGTGA TGCGATTTCA CTGAAAGTTC CTGTCAGAAG GTTCGCTTTT GAGAGGGAAT TTTATAGGAT TATGTTCATA GACGGAGAAG AAACAGTGAC GCTTCCAGAA GGGCCGTTTG TTTTCTCGAA GATAAGGTTT TTGAGCGGCA TAACACCACT CATCGAGGAT AGTTTGAGGG AACTCATAGG ATTTCCGTAC GGATGTGTTG AGCAGACCAT GAGTCGGTTC TTCCCTGCGG TGGTTGCGGC CAACCTGGGG TTGAAGGTGG AAGATCTGGA TGAAATCATA CGAAAAGGAT TGTTCAGAAT ATATTACTAT CAGCATGCTG ACGGTGGATG GGGATGGTTT GTTACAGACA AAACCAACAA TTTCATGACA TGCTATGTTA TGGAAGGGTT GTATTTCACA ATGAAAGCAG GTTATAATGT GGCAGAAAGT GTTTTGGAGA GAGGAATAGA GTACCTCAAA GCCCATCCAT CGGCTTACGG TTCCTACGTT TTGGATCTGT ACGACATCGA GCACGAACCG TTCGAACCCC AGACCCCCGC AGATCTGGTT TTCCTGAGTA TGGAGTCAAA AGAAGCGTTG ACGAAGGTGC TGAAGTACGT AGTCCAGGAC GATCAAAAGG CCTATCTGAA GGTGACTTCT GAAGACCCCC TCATCAGTGA GATACAACTC AACAGTGTCC TGCTCAAAGC CCTTGTGAAA TGGAAAGAGT TCCCTGAACT CCAGAAAAAA CTGGTTAACT ACCTTCTTTT GAAGAAAGAG GGATACTTCT GGACTTCCAC GAAGGACACC TCTTTTGCGA TCCTTGCACT CCTTGAAGCG ATGCCAGTGT ACGAATCTAC CTCTTTGAGG GTTTTAAACT CTGGAAAAGA GTTCGTACTG AGATCAGGAG AGGAGAGCCA ACTTGTCCCT GGACCTCTGA TTGTCTCGGG CAATGGTGTG GTGGAGGTGC ACGTTGTACA TCCAGTTGTA CCGAAAGAGT CTGTGAGTGA GGGCATGGAG ATAAAAAGAA AGTTTTACAA GAGGTATGAG CTTTTCATAG AAGAGATAAG AAGCCTCGTG GATGCCTTCG TGCCGCTTGG AAGAGGCTAT GTACCGCATT CGATACACAG GGTTGAGGAA GAAAAGGACG ATGAACTCTT CATCTTACCT TATGAACTCC GGAATGAAAC TGTCGAGTAT GGAGGAATCA CCATGAAAGT GAATGGTAAC AGAGTCGAGA TAAAAGGAGA GACGTACTCG TTTTCCAGGA TAAAGACACA AAACGGTTTG ATTCTCATAG TTCTTGGAAA TGAAGCTCTC GTCTATGATC CGGGAAAAAA CACAGTCACA AGGTATCTGG AGGTTCTGGA CGGAGATTTC GTTGGAGACA GGGGGGAGAC AGATTCTACT ATGTGGCTGC AGAAGGGTAC GACCTGGAAA AATTATCGGA TGTGTCTTTT TCGGAAGAAG AAATCAGAGA ATGGACAAAA GGAGAATGTT GTTCCATAA
|
Protein sequence | MRSLFFSFMV FMAAFLFGGY AYFSGDPILR VGEGISFVVS DMENVVLNVW KVEDEETFLK AVLTEENFDF WWQENNPLVY RKTFSSRNEW KEFSVPLKEK GFYFATLTTP LAGTSVITRE LDRGLFIVTD LEVIYFSDGS RTMLHVFDID DGFAEKAEVF LYRDSILVGR LFTDEYGTVE VNDRFDTVYV RYKDSRFIGN VYSPRRFVED EKLFLITDRP IYKPSDTVYF RGQLFKVDGN VYRAFESTGV TVTVFDTKEN EIYRSTFETS ALGGFSGDLK LPDTAPVGLY RIKIEHGENT FWENFLVEEY RKPEYRVNIE TDRETYISGE VINYRIRVKY FNDQPVEKAE VAYYVHAFPE DENGYLVYRG TGFTNEKGVL EFGVKTQEGF QGVYALEVIV VDVSQRQIEE KKTVKVYADD VLISPQDRFV YTSPGNQVRM TVKVTDLSGN PLEGVLYVLH DDSTSTAVVE NGEATFTFVP YEVRDYKLEL SFKKAKTHVY VYAYHGARTS NEFVIVAESD TVKPGDRIPV RLLAPGNVKG VLGITSKRIY KTIPVIFTGS TEFLVEIPEN ILEKNIFFVF TGFDEKGEIY LTKKLDVSLN TNFTDMEIKF DKDQYEPGEI AQITIKSSVD EICISVVDEA IYDLVGTEPP VLEEFLYPSM NYPLVRGNFA SGWILYVSRN SIRNKLASLP EEKTFADFKQ NAFPSRVNVR EYFPDTVLWI PDVKLHDGTA RISFKVPDSI TSFRATAYGF SKDRFSQGEE TIVVSKDFYI TPHLPSFLRE GDIMRLSATV FNRTGKELFV EIRIELPDNI KLVEGNSSRH FLMEANSSHT ETWTVKAISP SEESFVQFFA SGGDLSDAIS LKVPVRRFAF EREFYRIMFI DGEETVTLPE GPFVFSKIRF LSGITPLIED SLRELIGFPY GCVEQTMSRF FPAVVAANLG LKVEDLDEII RKGLFRIYYY QHADGGWGWF VTDKTNNFMT CYVMEGLYFT MKAGYNVAES VLERGIEYLK AHPSAYGSYV LDLYDIEHEP FEPQTPADLV FLSMESKEAL TKVLKYVVQD DQKAYLKVTS EDPLISEIQL NSVLLKALVK WKEFPELQKK LVNYLLLKKE GYFWTSTKDT SFAILALLEA MPVYESTSLR VLNSGKEFVL RSGEESQLVP GPLIVSGNGV VEVHVVHPVV PKESVSEGME IKRKFYKRYE LFIEEIRSLV DAFVPLGRGY VPHSIHRVEE EKDDELFILP YELRNETVEY GGITMKVNGN RVEIKGETYS FSRIKTQNGL ILIVLGNEAL VYDPGKNTVT RYLEVLDGDF VGDRGETDST MWLQKGTTWK NYRMCLFRKK KSENGQKENV VP
|
| |