Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1246 |
Symbol | |
ID | 4600541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1181816 |
End bp | 1182874 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774022 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_920647 |
Protein GI | 119720152 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCTAG CTAGCTATCT GGTGAGGAGG CTTGCAACGT TTCTCCCGAG CGTCCTCGGG GCGCTCCTCA TAACCTACCT CATAGCCTAC GTTATCCCCA CGGACCCCGT GAGGGCGTGG GTAGGGGAGA AGCTCATGGA CCCCTCGACC CTAGAGAGGC TGAGGAAGGA GTACAAGTTC GACGCGCCGT GGTACGAGCA GTTCGCCTTC CTGGTCGAGA AGCTCCTAAC GGGCACGCTC GTAGACCCCA CCAGGGGTAT CCCCGTCGTC CAGCAGGTGG CGCAGAGGTT CCCGATAACC GTCGAGCTGG CTATATTCGG CATGCTGTTC ACAGTGGCTA TAGGCATACC GCTGGGGATA CTGGCAGCGG CGAAGAAGGA CAGCTTCGTG GACTTCTTCG TAAGGGTATT CGCGCTCTTC GGGAGCTCCA TGCCGGCCTT CGTGCTCTAC TACTTCCTGA TACTGGCGTT CTACGTCTAC GTGAGAGCCT CGCTACTAGC CGGGGTTCCC TCCCTGTCTC CAGCGTGCGC CGCCAGCCTG GACTCCGTCA GGAACGCGGT TCCCCTCCTG GGCTACGTCG TGTGGGCGGT AGGCCAGGTC CCGATGTTCG GCGGTCTCAT GTGCGGGGAT CTGGGGGTTG TCTCTGCGAC GTTCGTTAGG ATGTGGCTTC CGGGGTTGGC GCTGGGGCTC CTCTCCGGCG GCTTCATAGC GAGGATAGTT AGGAACAGCT TGCTCGACGC GCTGAGTTCG GACGCGATCC TCTTTGCAAG GGCAAGGGGC CTTACGAGCG GCAGGATATG GCGGCACGCC TTGAAGAACG CGTTCGCGCC TATAGTCACG ATTCTCGGCC TCAACTTCGC CGGCCTGCTC ACGGGCGCCG TGATAGCGGA GACTGTCTTC AATATCCCCG GCATGGGGCT CTACATGTAC CAGGGGATCA CGAGGCTGAA CTTCCCGATA ATAATCGCCG GGACGTTCAT ATTCTCGGTG ATATACATCG TGATGAACCT CCTGGTAGAC CTCGTCTACG CGCTGATAGA CCCGCGCGTC AGGTACTAG
|
Protein sequence | MGLASYLVRR LATFLPSVLG ALLITYLIAY VIPTDPVRAW VGEKLMDPST LERLRKEYKF DAPWYEQFAF LVEKLLTGTL VDPTRGIPVV QQVAQRFPIT VELAIFGMLF TVAIGIPLGI LAAAKKDSFV DFFVRVFALF GSSMPAFVLY YFLILAFYVY VRASLLAGVP SLSPACAASL DSVRNAVPLL GYVVWAVGQV PMFGGLMCGD LGVVSATFVR MWLPGLALGL LSGGFIARIV RNSLLDALSS DAILFARARG LTSGRIWRHA LKNAFAPIVT ILGLNFAGLL TGAVIAETVF NIPGMGLYMY QGITRLNFPI IIAGTFIFSV IYIVMNLLVD LVYALIDPRV RY
|
| |