Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1578 |
Symbol | |
ID | 4600564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1530543 |
End bp | 1531577 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774351 |
Product | inner-membrane translocator |
Protein accession | YP_920976 |
Protein GI | 119720481 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4177] ABC-type branched-chain amino acid transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0304091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGACG TGGTGGGGTT CCTAATTAGC TGGCTGACCC TGCTCGGGAT CTACGCTATA CTCGCGTTAA CGCTGAACTT CGAGTCGGGT ACGAGCGGGG TGACGAACTT CGGGAAGGTC TTCTTCTACG GGCTCGGCGC GTACGTAGGG GCAACGCTCA CCGTTTACCT CTTCCTACTG CTGAACGGGG TCGATGTGTC GAAGACCCCG CCCTACGACG TCAGCGGCAT AGTGTTGCTC GGGGAGCTCG CAGGGAAGAA CCCAGCCCTG AACCTGGGGC TCCTAGCACT CTCGCTCGCC ACAGCCTTCC TAGTTGCAGG CGTGATCGGC TACCTGCTGA GCTACCCGAT CATCAGGGTC GGCCCGGCAT TCGTGGGCTT CACGCTTCTA AGCACGGGCG AGCTTTTCCG CATATTCCTC CAGCACTATG AGCCCGTCGG GGGTAGCAGG GGGCTCATGT CCATACCCGG GCCGTTCTCC TGGGTGCCCC AGCCGAGGCT CAGGGAGGTG CTCTTCCTAG CACTCGTGCT CGCAGTACTC GCGGCTACGT ACCTCGTGAT GGACAGGTTG ACGAACTCGC CCCTCGGCAG GACTCTGAGG GCAGTCAGGG ACGACGAAGT AGCGGCGCTC TGCATGGGTA AGCACGTGCC GAAGCTTAAG GCGACGGTCC TCTTCATAGG CTCCGGTTTC AGCGGGGTTG CCGGCGTGCT CCTCGCCTAC TACCTAACCT CCGTGAACCC AGACATGTTC GTCCCCGCAG TTACGTTCAA CGTCTGGGCG ATGATAATCC TGGGCGGTAT GGGCAACCTC AGGGGGGCGC TACTCGGCGC GTCCATATTC ACGTTCATCG ATAGGGCTCT CTCCTTCGTA ACACCGCAGC TAGGCGTCAC GGTGATATCT CCCGACTACG TGAGGGGGTT CGCGGTGGGG CTCGTGATGG TCCTCGTACT GCTCTACAGG CCGCAGGGGC TACTACCCGA GGGGAGGGTC GAAACTGTGG CTTGGGAGGA GCTCGGGGGT GGTGCTAGTG GCTAA
|
Protein sequence | MIDVVGFLIS WLTLLGIYAI LALTLNFESG TSGVTNFGKV FFYGLGAYVG ATLTVYLFLL LNGVDVSKTP PYDVSGIVLL GELAGKNPAL NLGLLALSLA TAFLVAGVIG YLLSYPIIRV GPAFVGFTLL STGELFRIFL QHYEPVGGSR GLMSIPGPFS WVPQPRLREV LFLALVLAVL AATYLVMDRL TNSPLGRTLR AVRDDEVAAL CMGKHVPKLK ATVLFIGSGF SGVAGVLLAY YLTSVNPDMF VPAVTFNVWA MIILGGMGNL RGALLGASIF TFIDRALSFV TPQLGVTVIS PDYVRGFAVG LVMVLVLLYR PQGLLPEGRV ETVAWEELGG GASG
|
| |