Gene Tpen_1196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1196 
Symbol 
ID4600388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1134804 
End bp1136171 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID639773972 
Productamino acid permease-associated region 
Protein accessionYP_920597 
Protein GI119720102 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGAGC AGAGGCTACG TAGAAGGATC GGGTTGCTCG AGGCCTTCAG CTTCGGCTAC 
GCGGACGTAG GGGCAGGTAT CTACATGACC CTGGGGCTCG TCGCCGCCTA CGCCGGGCCA
GCGACACCCC TAGCATTCGC AGTCGCGTCG GTGTCGTACC TTTTCACGGC TCTCAGCTAC
GCGGAGCTCA GCGCAGCCTA CCCTGAGGCT GGGGGCGGGA TGGTATTCGC GGATAGGGCT
TTTGGAAGGC TGGCCGCTTT CATAGCCGGG TGGAGCCTCC TGCTGGACTA CGTCGTTACC
GGCTCTATAT TCGCTCTCTC GACTACGGGC TACCTGGGGC ACCTCTTCCC CTTGCTGAAG
CGGGACGAGT TCTTCGGGCC CGTAGCGGCT CTGCTCGTCT TCTTCCTCGT CGTGCTGAAC
ATTCTCGGCA TCAGGGAGTC CGCGGCCTTT AGCTCTGCGC TAGTCCTGCT CGACATCGCC
GGTCTAAGCG TGATAATGGG TATAGGCTAC CTGACGAGCT TCAAACCATT CTTCGACAAG
GTAAACCTGG GGGTGAACCC GGATTGGCAG AGCTTCATGT ACGGCTCGAC GCTCGCGATG
GCGTCGTACC TCGGGATAGA GGTAATCTCG CAGACGGCCG AGGAGACCAG GAGGGCGGGG
GCTACGATAC CGAGGGCTGT GAAGCTCGTG AGCGTAGTTG TCATCTTCTT CGCGCTTCTC
TTCTCGACGC TCGCTGTCGG CACTGTCGGG TGGGAGGTTC TCGCCGCCTC CCAGAAGGAC
CCGGCGGCTG TGGTGGCGGA GCACCTGCCT TACGGATCGG TGCTCGCACT GTGGGTCTCC
GTGATAGGTA TGACGGTCTG CTACGCCGCG ACGAACACCG GGATCGTGGG GGTGTCGAGG
ATGGTTTACG CGATGGGGAG GGAGGGGATG CTTCCCCGCT GGTTGACGGA GCTACACGGC
CGCTTCAAGA CCCCCTACAG GGCTATAGTG GTCTTCGCGG TAATCCAGCT ACTGCTAGCC
TACGTCGGGC ACCTCGGGTT AGCGGCAGAC CTCTACAACT TCGGCGCCCT GCTATCCTAC
ATGGTCGTCA ACCTCTCGGT CCTGGCGCTC CGCGTTAAGG ACCCGCACAG GTACAGGCCG
TACAAGGTTC CGGGCAACGT CCCGCTCAGA GTGGGCGGCA GGAAGGTGTA CGTGCCCCTG
GGGGCCGTCC TAGGCTTCCT GACGAACTTG GCGATGTGGC TCATGGTGGT ATCCACGCAC
AAGGAGGGCA GGCTCGTAGG GTTCGCCTGG CTCCTCGCAG GCCTCCTGGT CTACGCCGTT
TACTCTAGGA GGCGCCGAGC CGAGCCCCTC ACTCCCTCGT CGCCCTGA
 
Protein sequence
MGEQRLRRRI GLLEAFSFGY ADVGAGIYMT LGLVAAYAGP ATPLAFAVAS VSYLFTALSY 
AELSAAYPEA GGGMVFADRA FGRLAAFIAG WSLLLDYVVT GSIFALSTTG YLGHLFPLLK
RDEFFGPVAA LLVFFLVVLN ILGIRESAAF SSALVLLDIA GLSVIMGIGY LTSFKPFFDK
VNLGVNPDWQ SFMYGSTLAM ASYLGIEVIS QTAEETRRAG ATIPRAVKLV SVVVIFFALL
FSTLAVGTVG WEVLAASQKD PAAVVAEHLP YGSVLALWVS VIGMTVCYAA TNTGIVGVSR
MVYAMGREGM LPRWLTELHG RFKTPYRAIV VFAVIQLLLA YVGHLGLAAD LYNFGALLSY
MVVNLSVLAL RVKDPHRYRP YKVPGNVPLR VGGRKVYVPL GAVLGFLTNL AMWLMVVSTH
KEGRLVGFAW LLAGLLVYAV YSRRRRAEPL TPSSP