Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1808 |
Symbol | |
ID | 4601801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1748148 |
End bp | 1749692 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639774581 |
Product | extracellular solute-binding protein |
Protein accession | YP_921206 |
Protein GI | 119720711 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCA AGGTTCTCCT GGTACTAGCA GTACTCGTAC TCGTAGCCCT GGGAGTAGCT CTGGTAGTCC TACAGCAGAA AGCCCCTCCG AGCGCTCCGC CGGCTAACCA GACCCAGCAG CCTTCCCCTG GACCTGTTAA TCAAACGCCT TCCACGAAGC CGCCGGCTAA CCAGACCCAG CAGCCACCCG CGGGTGAGGG TATAACCCTT TACGTTATAA CACGGCACGA GCAGACAATC CAAGACTTGA CGAGAACAAT GTTCCTTAGT AGTAGCGTGG CGAAAAAGTA CAACATCAAG AACATAGTCT TCCTGCCTAT AAACGCGGAG CAGTGGCCCG ACTATATCAA GAACGCCGCC CAGAAGGGGC AGGGTATAGA CGTCGCCTGG GGAGGTGGAC CAACGCTGTT CAACCTGATA GACGACATGG GCTTGATCGA GCCGATAGAC TTGGACAAGC ACCCAGAGTT CAAGATAGTG ATGGACGAGG TCGCCAAGCT ACCCAAGACG ATTGCAGGGG CGGAGACGTA CAAGGTTGGA AGTGACGGAA AGATACACTG GATAGGCGCG AGCGTGAGTA GCTTCGGCTT CACGGTAAAC AGGGATCTTC TCAACAGGTA TAAGCTACCA ATGCCTCAGC GCTGGGCGGA TCTCGGGAAC CCCGCGTACG CCGTGACGTT GCCCGCGCTA CAGCTCGTAG GAATAGCAGA CCCCACGATG AGCACGAGCA ATACCAGGAT ATTCGAGATT ATACTCCAGG CGTACGGGTG GGATAAGGGT TGGCGCACAC TCACGCTCAT CGCGGCGAAC TCCAAGATAT TCAGCGGTAG TAGCGACGTA AGAGACGCTG TCATACGCGG GGATATCGCG ATAGGTACTA CGATAGACTT CTACGGCTAC ACGGCGCAGC AGCAGAACCC CTCGTGCCTC TACATAATAC CGCCAGGCGA GAGCATAGTG AACGCAGACC CCATAGCGAT CCTTAAGGGT TCGCGGCACC CCGAAGCGGC CGCCGCCTTC GTGGCCTGGG TTCTCAACGA GACGGGAGGA CAGCTGGTGT GGCTTGACCC GAACATAAAC AGGCTCCCGA TAAACCCGAG GGTTTTCGAC ACGCCGCAGG GCGCCCGGAG ACCCGACTTG AAGAAAGCAC TTGAGGACGT TATCAACGCC GGGGGTATAG CGTTCAACGA GTCACTCTCC TCCGCGTGGG TTACCGCCGT TGTAGACTAC TTCAAGGCGA CGCTCGTAAA CGCGCACGAC GACCTTCAGC CCGTCTGGGC CCAGATAGTG AAGGCTTACA GGGATAAGAA GATAACCGAG GCGCAGTTCC AACAACTAGT CACGCAGCTG ACAGACTTCG TTACGTTCAC GGACCCGCTT ACTGGGCAAC AGACGACCTT CACGCTCGAC TACGCTATAA AGATAAGCCC GAAGCTCGCC AGCGACCCCT CCATATACCA GAGGCTGATG AACGACTGGA CAAGCGCGGC GAGGGCAAAG TACCTAAAGG TGCAGTCGTT GCTAAAGCAG ATGACCGGAG GCTAG
|
Protein sequence | MNSKVLLVLA VLVLVALGVA LVVLQQKAPP SAPPANQTQQ PSPGPVNQTP STKPPANQTQ QPPAGEGITL YVITRHEQTI QDLTRTMFLS SSVAKKYNIK NIVFLPINAE QWPDYIKNAA QKGQGIDVAW GGGPTLFNLI DDMGLIEPID LDKHPEFKIV MDEVAKLPKT IAGAETYKVG SDGKIHWIGA SVSSFGFTVN RDLLNRYKLP MPQRWADLGN PAYAVTLPAL QLVGIADPTM STSNTRIFEI ILQAYGWDKG WRTLTLIAAN SKIFSGSSDV RDAVIRGDIA IGTTIDFYGY TAQQQNPSCL YIIPPGESIV NADPIAILKG SRHPEAAAAF VAWVLNETGG QLVWLDPNIN RLPINPRVFD TPQGARRPDL KKALEDVINA GGIAFNESLS SAWVTAVVDY FKATLVNAHD DLQPVWAQIV KAYRDKKITE AQFQQLVTQL TDFVTFTDPL TGQQTTFTLD YAIKISPKLA SDPSIYQRLM NDWTSAARAK YLKVQSLLKQ MTGG
|
| |