Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1196 |
Symbol | |
ID | 4600388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1134804 |
End bp | 1136171 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773972 |
Product | amino acid permease-associated region |
Protein accession | YP_920597 |
Protein GI | 119720102 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGAGC AGAGGCTACG TAGAAGGATC GGGTTGCTCG AGGCCTTCAG CTTCGGCTAC GCGGACGTAG GGGCAGGTAT CTACATGACC CTGGGGCTCG TCGCCGCCTA CGCCGGGCCA GCGACACCCC TAGCATTCGC AGTCGCGTCG GTGTCGTACC TTTTCACGGC TCTCAGCTAC GCGGAGCTCA GCGCAGCCTA CCCTGAGGCT GGGGGCGGGA TGGTATTCGC GGATAGGGCT TTTGGAAGGC TGGCCGCTTT CATAGCCGGG TGGAGCCTCC TGCTGGACTA CGTCGTTACC GGCTCTATAT TCGCTCTCTC GACTACGGGC TACCTGGGGC ACCTCTTCCC CTTGCTGAAG CGGGACGAGT TCTTCGGGCC CGTAGCGGCT CTGCTCGTCT TCTTCCTCGT CGTGCTGAAC ATTCTCGGCA TCAGGGAGTC CGCGGCCTTT AGCTCTGCGC TAGTCCTGCT CGACATCGCC GGTCTAAGCG TGATAATGGG TATAGGCTAC CTGACGAGCT TCAAACCATT CTTCGACAAG GTAAACCTGG GGGTGAACCC GGATTGGCAG AGCTTCATGT ACGGCTCGAC GCTCGCGATG GCGTCGTACC TCGGGATAGA GGTAATCTCG CAGACGGCCG AGGAGACCAG GAGGGCGGGG GCTACGATAC CGAGGGCTGT GAAGCTCGTG AGCGTAGTTG TCATCTTCTT CGCGCTTCTC TTCTCGACGC TCGCTGTCGG CACTGTCGGG TGGGAGGTTC TCGCCGCCTC CCAGAAGGAC CCGGCGGCTG TGGTGGCGGA GCACCTGCCT TACGGATCGG TGCTCGCACT GTGGGTCTCC GTGATAGGTA TGACGGTCTG CTACGCCGCG ACGAACACCG GGATCGTGGG GGTGTCGAGG ATGGTTTACG CGATGGGGAG GGAGGGGATG CTTCCCCGCT GGTTGACGGA GCTACACGGC CGCTTCAAGA CCCCCTACAG GGCTATAGTG GTCTTCGCGG TAATCCAGCT ACTGCTAGCC TACGTCGGGC ACCTCGGGTT AGCGGCAGAC CTCTACAACT TCGGCGCCCT GCTATCCTAC ATGGTCGTCA ACCTCTCGGT CCTGGCGCTC CGCGTTAAGG ACCCGCACAG GTACAGGCCG TACAAGGTTC CGGGCAACGT CCCGCTCAGA GTGGGCGGCA GGAAGGTGTA CGTGCCCCTG GGGGCCGTCC TAGGCTTCCT GACGAACTTG GCGATGTGGC TCATGGTGGT ATCCACGCAC AAGGAGGGCA GGCTCGTAGG GTTCGCCTGG CTCCTCGCAG GCCTCCTGGT CTACGCCGTT TACTCTAGGA GGCGCCGAGC CGAGCCCCTC ACTCCCTCGT CGCCCTGA
|
Protein sequence | MGEQRLRRRI GLLEAFSFGY ADVGAGIYMT LGLVAAYAGP ATPLAFAVAS VSYLFTALSY AELSAAYPEA GGGMVFADRA FGRLAAFIAG WSLLLDYVVT GSIFALSTTG YLGHLFPLLK RDEFFGPVAA LLVFFLVVLN ILGIRESAAF SSALVLLDIA GLSVIMGIGY LTSFKPFFDK VNLGVNPDWQ SFMYGSTLAM ASYLGIEVIS QTAEETRRAG ATIPRAVKLV SVVVIFFALL FSTLAVGTVG WEVLAASQKD PAAVVAEHLP YGSVLALWVS VIGMTVCYAA TNTGIVGVSR MVYAMGREGM LPRWLTELHG RFKTPYRAIV VFAVIQLLLA YVGHLGLAAD LYNFGALLSY MVVNLSVLAL RVKDPHRYRP YKVPGNVPLR VGGRKVYVPL GAVLGFLTNL AMWLMVVSTH KEGRLVGFAW LLAGLLVYAV YSRRRRAEPL TPSSP
|
| |