Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0210 |
Symbol | |
ID | 4602221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 188817 |
End bp | 190646 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639772964 |
Product | glycoside hydrolase family protein |
Protein accession | YP_919623 |
Protein GI | 119719128 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.044129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAAAG TATCACCTGG GAAAATCGGA GCTGTAATCT TACAGGGAAG CAAGGTATCG CTGAACTTTA CCGTGGAGAA CGATGAAGGC ACTCCCGTAA AGGGCACTCC TTTCTTCAAC CTAAAGGCTC AGGGGAAGCA ACAGATATAC GTTGTGTTCG AGCAAGCGGT TATTCAGCCT GGAGAAAAAG CAACGTTCAA CTTTGAAGTT AAATTCGACG TACCGGTTGG AGAGGCTGTC GCGTACTTCT GCTTCGAGAC GGACAAGGCT GTATGTGAAG AGAGGCAGGT CTACATCGCG GGGGAGGGTG AAGCCATCTA CGTAGCTTTC GTGTGGCATC ACCACCAGGC CCCCCAGTTT TACCCCGACG GGAGCCTAAA GGACGAGTGG GCATTTATAC ACGTCGCTAA GGGCGACTTC TACGCGTATT CGGGGGGACC CTACAAGGTA CACATGGAGA CGCACAAACG TATACCAGGC TTTATCGATG TAGACCACTT CTCCCCGTCG CTCCTGGAGC AGTGGTTGCT TTTTCTATCG GGTAAGCTAA GGTCCTCGAC AGCAACCAAG GAGGACGTAG AGGGGTTGCT GAGCTTTCTT AGGGAGAAGA TCCGGGAAGG CATCGTGGAG CCCCTGGGAA GCGTCTACGC CCACACGGTT CTCGGCCTCG TTCTGAAGAA GGCTAAGCAG AGGGGCCTCG ACGAGATAGC GAAGAAGTTG ATCGAGTGGG AGCTCCGGGA GGGCTTGAAG ATTGTGGAGG AAGCCCTGGG CCGCCGGCCG AGCGGGCTTT GGACACCCGA AATGTTCTGG CACATGGACC TCGTCGACCT CTACGGCTCG CTCGGAGTAA GGTACACCGT CCTGTGCGAG CAACACTTTA CGCGCGCTGG TGGGGACAAG GAGAACATTT ACGAGCCATA CGTCGTCGAG GACCCTATTT CCGGGAGATC CGTCGTCGTC TTCTTCAGGG ACCTAAAGCT CAGCAACTGG ATTAGCTTCA AGGTTGACTT CAAGAATCCA GAGGAAGCGG ACAACGAAGC TAGAAGGTTT GTTATAGAGC TGGCTAAGAG GAGGGAGGTC GCGCCAGGCG GGATTGTAAC CATAGCTCTC GACGGCGAGA ACTGGATGAT CATGCCGGCG TACAGGAAGT ACGCGCCGTA CTTCCTGGAA AAAGTGCTCG AGTACATCTT GGAGAGCCAG GTAATAAGGC TCACGACTTT ATCGAAGTAC CTCGACCAGA ACCCGCCGAA ACGCGTCCTT GACTACATAC CGTCGGGCTC CTGGGTAGAG CTCTCGGATA AGCAGTGGAC CGGGGGCGCA AAGGACGAGC TCTGGAACGA GGCAATGGAA ACTTTAGTCT ACGTTGAATC AGCGTATAGA TTACTGGAAC CGGAGGCTGA GCGCCTGCTC GCAGATCCCG ACTCACCACT CTACAGGCTG TTCAAAGCCA TCGCTATAGC CATAGACAGC GACTTCTACT GGTACGGAGA GTTGGAAAGA GAACGGGAGT TTATAAAGGA GTGGTTAGCA GAGGCGCGGA AGATAGCTGG AGAAATACTC GGAAACCTTA AGGCAAGGGA AGTCGGGAGG ACGAACAACC ATGCTTTCAT CGAAATTGAA AACAGAAATC TCTTCTCGGC GAAGGTGAGG ATAGTTTCAG AGGCTAAGAG CCGAGTCGAT GAAACGAGTA TAATTATCCC AGCGTCCTCG AAGGTAACCC TTCCCGTCTA CGTCGGTGAC TCAAATACCC TTGTGAGAGT GGTTTCCGGT AAAGTAACTC TTCAAGTTGT TGGCGGCGTT GACGGTTCTC CGGCGCCCCT CTCTCTTTAG
|
Protein sequence | MVKVSPGKIG AVILQGSKVS LNFTVENDEG TPVKGTPFFN LKAQGKQQIY VVFEQAVIQP GEKATFNFEV KFDVPVGEAV AYFCFETDKA VCEERQVYIA GEGEAIYVAF VWHHHQAPQF YPDGSLKDEW AFIHVAKGDF YAYSGGPYKV HMETHKRIPG FIDVDHFSPS LLEQWLLFLS GKLRSSTATK EDVEGLLSFL REKIREGIVE PLGSVYAHTV LGLVLKKAKQ RGLDEIAKKL IEWELREGLK IVEEALGRRP SGLWTPEMFW HMDLVDLYGS LGVRYTVLCE QHFTRAGGDK ENIYEPYVVE DPISGRSVVV FFRDLKLSNW ISFKVDFKNP EEADNEARRF VIELAKRREV APGGIVTIAL DGENWMIMPA YRKYAPYFLE KVLEYILESQ VIRLTTLSKY LDQNPPKRVL DYIPSGSWVE LSDKQWTGGA KDELWNEAME TLVYVESAYR LLEPEAERLL ADPDSPLYRL FKAIAIAIDS DFYWYGELER EREFIKEWLA EARKIAGEIL GNLKAREVGR TNNHAFIEIE NRNLFSAKVR IVSEAKSRVD ETSIIIPASS KVTLPVYVGD SNTLVRVVSG KVTLQVVGGV DGSPAPLSL
|
| |