Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1547 |
Symbol | |
ID | 4600840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1495826 |
End bp | 1496887 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774321 |
Product | ABC transporter related |
Protein accession | YP_920946 |
Protein GI | 119720451 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3839] ABC-type sugar transport systems, ATPase components |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGGGG TTAGGCTGGA GCACGTTACC AAGCGGTACG GGAAGGTAGT CGCGGTGGAC GACGTTAGCC TGGAGGTCAA GGAGGGAGAG TTCTTCGTAC TCCTGGGGCC CAGCGGTTGC GGCAAGACAA CGACTCTCCG AATAATCGCG GGCCTAGAGG AGCCGGACGA GGGGAGGGTG TTCTTCGGAG AGGAGGACGT GACGAGGCTA CCCCCCGGCA AGAGGAAGAT CTCGATGGTG TTCCAGAGCT ACGCCGTGTG GCCGCACATG AAGGTGTACG ATAACATTGC GTTACCACTC AAGGTGCAGG GCTACCCTCC GGAGGAGATC GAGCGCAGGG TTAGGGAGGC GGCGCGGCTC GTGCAGATAG AGGATCTCCT CGACAGGTAC CCGCAGCAAC TCTCGGGTGG GCAGAGACAG AGGGTCGCGG TTGCCAGGGC GCTGGCAGTG ACGCCGCGCG TACTGCTCAT GGACGAGCCT CTGAGCAACC TCGATGCCCT CCTCAGGGTG CAGGCGAGGG CGGAGCTAAA AAGGCTCCAG CGCGATACCC GCCTCACCAC GATCTACGTA ACGCACGACC AGGTGGAAGC AATGGTTCTA GCCGACAGGG TAGCGGTAAT GAACAGAGGA AGGGTACTCC AAGTGGGACC CCCGGAGGAG ATATACAGCA AGCCTTCGAG CAGGTTTGTC GCGCACTTCA TAGGGTCGCC CCCCATAAAC CTCTTCGAAG GCACGGTTAC GCCTAGAGGT GTCGACGTCG GCTTCGTGGT TCTGCCTGTG GGCGCGGGGA GTCTCGAGGA GTACAGCGGC AAGAGGGTCA TAGTGGGCAT AAGGCCCGAG GACGTCCATT TAACGGCCTC CCCAGGCTTC ATAGAAGTCG AGGGAAGCGT CTGGGTCACG GAGAACCTGG GAGGAGAGTA CGTAGTGCTG GTAAGCGTGG GCGATGTCAT CCTCAGGGCG AGGAGTCGAG AGAAGCCCGA GTCGGAGAGA GTCAAGGTAT ACCTTGACCC CTCGAAGCTA CACTTCTTCG ACAAAGAGAC AGAGGCCAGG ATCTTCAGCT AA
|
Protein sequence | MVGVRLEHVT KRYGKVVAVD DVSLEVKEGE FFVLLGPSGC GKTTTLRIIA GLEEPDEGRV FFGEEDVTRL PPGKRKISMV FQSYAVWPHM KVYDNIALPL KVQGYPPEEI ERRVREAARL VQIEDLLDRY PQQLSGGQRQ RVAVARALAV TPRVLLMDEP LSNLDALLRV QARAELKRLQ RDTRLTTIYV THDQVEAMVL ADRVAVMNRG RVLQVGPPEE IYSKPSSRFV AHFIGSPPIN LFEGTVTPRG VDVGFVVLPV GAGSLEEYSG KRVIVGIRPE DVHLTASPGF IEVEGSVWVT ENLGGEYVVL VSVGDVILRA RSREKPESER VKVYLDPSKL HFFDKETEAR IFS
|
| |