Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0191 |
Symbol | |
ID | 4600624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 168513 |
End bp | 169661 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639772945 |
Product | formate/nitrite transporter |
Protein accession | YP_919604 |
Protein GI | 119719109 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2116] Formate/nitrite family of transporters |
TIGRFAM ID | [TIGR00790] formate/nitrite transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCG AAGAGAAATT CACCCCATAC GCTGGGGTTG ACGAAACCGT TGACGCCGCG GCAAGCGCAG CGGTCGTAAA GGCCTTTCAG CCCCCGGGCC GCCTATTCCT GATGGGCATA ATGGCGGGTA TATTCATTGG GATAGGCTTC TGGCTGGCAG TCACTGTTTC CTCTGCGTTC TGGACAACGA AGGTTACGGG CTTCGACGCC GCCACTCATA AGCTTGTCAC TGAGCCTTTC AACGTAGCGT GGCCCCTCAA CCCCTCAGCA ATGATGAAGT TCCTGCTGGG CGCCGTGTTC CCTGTAGGCT TAATCGCTGT GTGCATCGGC GGGGCGGAGC TCTGGACCGG GTGCGCGAAC GTAATACCGC TGGGCTACAT GCAGAAGAAG CTCAAGCTGA AAGCGCTAAT CTACAACTGG GTTACCGCGT ACGGAGGCAA CTGGGTGGGT AGCGTATTCC TAGCGTTCCT CGCCACTTAC GGCTCGACGC TACTCCTGGC CTCACCGTTC CGCGACGAAC TGATCTCCGT AGTTTGGGCG AAAGTTAACC TCTCGCCCTG GGAGGCTTTC TGGCGCGGCG TGGGATGCAA CTTCCTAGTA AACCTTGCGA TATGGCTCTG GCTGAGGTCT AAAAAGGGAG ATTTCATGGG ACAAGCCTTC CTGATATGGT TCCCGATATT CACCTTCGTA ACCATAGGCT TCGAGCACAG CATTGCCAAC ATGTTCCTAA TACCTGCCGC TATATTCGCC TCCCCGCTCG CCCTCAAGCA GTACATAATA ACATACTATG ACTTCTTCTT CAACAACCTA CTCCCGGTGA CTTACGGTAA CCTTGTAGGA GGCTTTGTCT TCATAGCGCT GGTTTACTGG TACGTAGGAA TGGTTAAGGG CAGTAAGTAC GGCGAAGCTA CGCCCACCGA CGCGCTCAAG TACGCCGCCG AGATACTACT CTTGGCGGGC ATCGTGCACC ATGTACTCGA AGTCGCTGTC CCCGGAGCTA TCGCCGTAGC CGTAGAGAAA GCTTTAGGGC TTAGTGCAGG CATAAACCTG ACGAACGCGG GTATGGCTCT CATACCGGCA GTGATCACAG GTATTTACTA CGCTCTACTA CCCTTTATAG TATACAAGGC TCTGAAGCCA TTAAAGTAA
|
Protein sequence | MATEEKFTPY AGVDETVDAA ASAAVVKAFQ PPGRLFLMGI MAGIFIGIGF WLAVTVSSAF WTTKVTGFDA ATHKLVTEPF NVAWPLNPSA MMKFLLGAVF PVGLIAVCIG GAELWTGCAN VIPLGYMQKK LKLKALIYNW VTAYGGNWVG SVFLAFLATY GSTLLLASPF RDELISVVWA KVNLSPWEAF WRGVGCNFLV NLAIWLWLRS KKGDFMGQAF LIWFPIFTFV TIGFEHSIAN MFLIPAAIFA SPLALKQYII TYYDFFFNNL LPVTYGNLVG GFVFIALVYW YVGMVKGSKY GEATPTDALK YAAEILLLAG IVHHVLEVAV PGAIAVAVEK ALGLSAGINL TNAGMALIPA VITGIYYALL PFIVYKALKP LK
|
| |