Gene Tpen_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0191 
Symbol 
ID4600624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp168513 
End bp169661 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content54% 
IMG OID639772945 
Productformate/nitrite transporter 
Protein accessionYP_919604 
Protein GI119719109 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2116] Formate/nitrite family of transporters 
TIGRFAM ID[TIGR00790] formate/nitrite transporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCG AAGAGAAATT CACCCCATAC GCTGGGGTTG ACGAAACCGT TGACGCCGCG 
GCAAGCGCAG CGGTCGTAAA GGCCTTTCAG CCCCCGGGCC GCCTATTCCT GATGGGCATA
ATGGCGGGTA TATTCATTGG GATAGGCTTC TGGCTGGCAG TCACTGTTTC CTCTGCGTTC
TGGACAACGA AGGTTACGGG CTTCGACGCC GCCACTCATA AGCTTGTCAC TGAGCCTTTC
AACGTAGCGT GGCCCCTCAA CCCCTCAGCA ATGATGAAGT TCCTGCTGGG CGCCGTGTTC
CCTGTAGGCT TAATCGCTGT GTGCATCGGC GGGGCGGAGC TCTGGACCGG GTGCGCGAAC
GTAATACCGC TGGGCTACAT GCAGAAGAAG CTCAAGCTGA AAGCGCTAAT CTACAACTGG
GTTACCGCGT ACGGAGGCAA CTGGGTGGGT AGCGTATTCC TAGCGTTCCT CGCCACTTAC
GGCTCGACGC TACTCCTGGC CTCACCGTTC CGCGACGAAC TGATCTCCGT AGTTTGGGCG
AAAGTTAACC TCTCGCCCTG GGAGGCTTTC TGGCGCGGCG TGGGATGCAA CTTCCTAGTA
AACCTTGCGA TATGGCTCTG GCTGAGGTCT AAAAAGGGAG ATTTCATGGG ACAAGCCTTC
CTGATATGGT TCCCGATATT CACCTTCGTA ACCATAGGCT TCGAGCACAG CATTGCCAAC
ATGTTCCTAA TACCTGCCGC TATATTCGCC TCCCCGCTCG CCCTCAAGCA GTACATAATA
ACATACTATG ACTTCTTCTT CAACAACCTA CTCCCGGTGA CTTACGGTAA CCTTGTAGGA
GGCTTTGTCT TCATAGCGCT GGTTTACTGG TACGTAGGAA TGGTTAAGGG CAGTAAGTAC
GGCGAAGCTA CGCCCACCGA CGCGCTCAAG TACGCCGCCG AGATACTACT CTTGGCGGGC
ATCGTGCACC ATGTACTCGA AGTCGCTGTC CCCGGAGCTA TCGCCGTAGC CGTAGAGAAA
GCTTTAGGGC TTAGTGCAGG CATAAACCTG ACGAACGCGG GTATGGCTCT CATACCGGCA
GTGATCACAG GTATTTACTA CGCTCTACTA CCCTTTATAG TATACAAGGC TCTGAAGCCA
TTAAAGTAA
 
Protein sequence
MATEEKFTPY AGVDETVDAA ASAAVVKAFQ PPGRLFLMGI MAGIFIGIGF WLAVTVSSAF 
WTTKVTGFDA ATHKLVTEPF NVAWPLNPSA MMKFLLGAVF PVGLIAVCIG GAELWTGCAN
VIPLGYMQKK LKLKALIYNW VTAYGGNWVG SVFLAFLATY GSTLLLASPF RDELISVVWA
KVNLSPWEAF WRGVGCNFLV NLAIWLWLRS KKGDFMGQAF LIWFPIFTFV TIGFEHSIAN
MFLIPAAIFA SPLALKQYII TYYDFFFNNL LPVTYGNLVG GFVFIALVYW YVGMVKGSKY
GEATPTDALK YAAEILLLAG IVHHVLEVAV PGAIAVAVEK ALGLSAGINL TNAGMALIPA
VITGIYYALL PFIVYKALKP LK