Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1177 |
Symbol | |
ID | 4602043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1118351 |
End bp | 1119421 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773953 |
Product | ABC transporter related |
Protein accession | YP_920578 |
Protein GI | 119720083 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3839] ABC-type sugar transport systems, ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.675733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAGGG TTGTACTGGA AAACGTCTCG AAAACCTTTA AGGGAGGAGT AAACGCCGTG AAGAACCTCA ACCTCACGAT TAACGACAAG GAGTTCATGG TCCTTCTGGG ACCCTCGGGT TGCGGCAAGA CCACCACTCT CCTCATGATC GCGGGAGTAT ACAAGCCGAC GAGCGGCTAC ATATACTTCG ACGACAGGAT AGTGAACGAC CTCGAACCGA AGGATCGCAA CGTAGGCATG GTCTTCCAGA GCTACGCCCT CTACCCGCAC ATGACCGTCT ACGAGAACAT CGCCTTCCCG CTGAAGCTGA AGAAGCTCCC GAAGGGGGAG ATAGACAGGA GAGTGAAGGA GGTAGCCTCA ATGCTTAGGA TCGACAACCT GCTGGACAGG TACCCGAGGC AGCTGAGCGG CGGGCAACAG CAGAGAGTAG CGCTCGCGAG GGCTATAGCC AAGCAACCCG ACATCTTCCT GATGGACGAG CCCCTCAGCA ACCTCGACGC GAAGATACGC GTCGAAGTGA GGGCTGAGCT GAAGAGGCTT CAGAGAGAGC TCGGTATAAC AACGATCTAC GTTACCCACG ACCAAGCGGA AGCGCTAAGC CTGGCCGACA GGATAGCCGT AATGAACGAG GGCGTCCTCC AGCAGGTAGG CACCCCGGAC GACCTCTACA ACAGGCCCGC GAACACCTTC GTTGCAGGCT TCATAGGCTC GCCTGCCGCC AACCTGGTGG ACGCCGACGT AGTCGAGGCC GGCGGGGAGT ACTACCTGGA GATGCTCGGG TCGAGGTTTA AGTTACCGAG CGACCTAGCG GCGATCGTCA AGGGTGAGAG CAGGGTCATC TTCATGGTGA GACCGGAGGA CGTGAAGGTC GTCGAGGGTC AGGGCTTCCT CGTGTACTCC GTCGAGTGGC TGGGAAGGGA GGCTTTAGCG CACGTAAGGG CCCCCGACGG CACGTTGCTC AGAGTGCTAC TCCCGCCGGA GTCGAAGCTC ACAATAGGCG CGGAGGTATC GGTAACCTTT AACTACGCCA AGGTGCACGT GTACAAACCC TCGGGCGAGC TCATAGCTTA A
|
Protein sequence | MVRVVLENVS KTFKGGVNAV KNLNLTINDK EFMVLLGPSG CGKTTTLLMI AGVYKPTSGY IYFDDRIVND LEPKDRNVGM VFQSYALYPH MTVYENIAFP LKLKKLPKGE IDRRVKEVAS MLRIDNLLDR YPRQLSGGQQ QRVALARAIA KQPDIFLMDE PLSNLDAKIR VEVRAELKRL QRELGITTIY VTHDQAEALS LADRIAVMNE GVLQQVGTPD DLYNRPANTF VAGFIGSPAA NLVDADVVEA GGEYYLEMLG SRFKLPSDLA AIVKGESRVI FMVRPEDVKV VEGQGFLVYS VEWLGREALA HVRAPDGTLL RVLLPPESKL TIGAEVSVTF NYAKVHVYKP SGELIA
|
| |