Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1554 |
Symbol | |
ID | 4600903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1504467 |
End bp | 1505531 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639774328 |
Product | extracellular solute-binding protein |
Protein accession | YP_920953 |
Protein GI | 119720458 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.202841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAGC TCAAAGTCCT AGGAATACTT CTCGTAATCG TACTAGCAGT AGCCATCGGC GCATTGGTAT ACTTTGGCTC TAAACCTTCT CAGCCGGCTA CACCGGCAAA GGCTGTCGTC GAGCTACCGA CCGGCGAAAG AATAGAGGTT CCGGAGAACG TCGCCGGCAA GGTGGTATTC TACACAAGTA TACCAGACGT CATCGTAAAT TCTTGGAAGG GTAACTGGAG CAAGTACTTC GGGTCTACGA TATCGCTGGA GGTCTGGAGG TCCGGGACGG GGAAAGTGGT CGCTAAGCTC CTTGCGGAGA AAAAGGCCGG TAGCGTGGAG GCAGACGTTG TCTACATAGC ATCACCCTTC GAGTTCGAGA CCTTGATAAA CGAGAGCATC ATAGAGAAGT TCCCCGACAT TCCCGAGCTG AAATACATCC CCCAGGAGTA CAGGGATCCC AGAGGGTACT ACGTGTGGGG AAGAGTGCTT GTAATGGTGA TAGTGTATAA CCCGAACATA GTGACTGACC CGCCGAAGTC GTGGCAGGAC TTGGTGAAGC CTGAGTGGAA AGGTAAGGTA GTGATAGCCA ACCCACTCTA CTCGGGATCC ACGCAGGTCG CAGTAGCGGC CCTCGCCTCA AAGTTCGGGT GGAGCTACTT CGAAAAGCTG AAGGAAAACG ATGTACTCGT TGTACAAGAC GTGCCCGACG TCGCAAGGGT TGTTGCCACG GGTGAGAGAC CCGTAGGCGT GACGCTTACA ATGTACCTCG GGGCGTACCC CACGCTAAAG TTCGTAGCAC CGGAGGAGGG GGCCATAGCG ATACCCAGCC CCGTAGGCCT AGTAAAGAAC GCCAAGCACC CGGAGGACGC TAAGGTGTTC TTGAGGTTCC TGCTCTCCAA GCTAGGGGCC CAGGCCTTAA CCGATGCCTA CACCTACTCT ACCAGGATTG ATGCCCCTGC GCCGAAGGGT CTTCCACCGC TCTCCCAGCT GAAGATCCTG AAGGTAAGCA TGGACGAGCT AAGGCCCATT GTGAGCCAAA TAAGGGATAA GTGGACGCAG ATATTCGGTG GATAA
|
Protein sequence | MPKLKVLGIL LVIVLAVAIG ALVYFGSKPS QPATPAKAVV ELPTGERIEV PENVAGKVVF YTSIPDVIVN SWKGNWSKYF GSTISLEVWR SGTGKVVAKL LAEKKAGSVE ADVVYIASPF EFETLINESI IEKFPDIPEL KYIPQEYRDP RGYYVWGRVL VMVIVYNPNI VTDPPKSWQD LVKPEWKGKV VIANPLYSGS TQVAVAALAS KFGWSYFEKL KENDVLVVQD VPDVARVVAT GERPVGVTLT MYLGAYPTLK FVAPEEGAIA IPSPVGLVKN AKHPEDAKVF LRFLLSKLGA QALTDAYTYS TRIDAPAPKG LPPLSQLKIL KVSMDELRPI VSQIRDKWTQ IFGG
|
| |