Gene Tpen_1640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1640 
Symbol 
ID4600919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1587755 
End bp1589266 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content64% 
IMG OID639774413 
Productamino acid permease-associated region 
Protein accessionYP_921038 
Protein GI119720543 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.469925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAGC TACGGCGCGA GCTGGGGTTA CTGGAGCTTG TAGGTATCAG CGTGGGCGGG 
ATAATAGGCT CGGGTATATT TATGATGCCT TCCCTAACCC TCTCGACGGC AGGTCTCAGC
GCGTTGCTCG CCTGGATCCT CGCCGGGGTA GCGATGACCG TCGTCGCCCT GGTTTTCGCC
GAGCTAGGGT CAGCCTTCGG GGATACAGGT GGGCCCTACG TCTACGCGCG GGCCGCTTTC
GGGCGGACCG TGGGCTTCCT GGTGGGCTGG GGCTACTACG TCTCCTGCGT TCTAACCGTC
TCAGCCGTTA CCGCGGCGTT TGTAAGCTAC CTTGGCTTCT TCGTGCCGGG GCTCGTTGAG
GGTCAGCGGT TAACGCCCGT AGGAGTTCTG GCGGGGCTAG CCTTCCTCTG GTTCCTCACT
TTGCTGAACT ACGTCGGCGT GAAGTACGGG GGGCTCTACG CCTCCGCGAC GACTCTGCTC
AAGGTGTTCG CACTGGCTAT CTTCGCGGCG GCGGGCCTCG CCTACCCGGA GCCCTCGCGG
TTCAACGTGC TGGAGGGGCT GGACGCCGTC TCGCTTGGGC TCGCCGTCTC CCTGGCTGTC
TGGCCGTACA TGGGCTTTGA GAGCGTAACG ATACCGGTCG AGGAGGTGAA GAAGCCTCAG
AGAGACGTAC CACTCTCGAT AATCCTCTCG ATGGGTATCG TCACGGCGGT CTACGTGTTG
ATAGTGCTGT CGTTCCTGAG CCTCCTGGAC TGGAAGTCGC TGGGGCTCGC GCAGGGGGAC
TGGGGGTCCC TGGCGAACCT CTCCTCGCCT TTATCCGACG TCGCGAGGTC GAGGGGGCTA
CTCGCGGTGG CAGTAGCGGT GATGCTGGGC GCGGTCTTCT CGACGGGCGG CGCCAGCGGG
GTGTGGCTTC TCGACAGCGG GCGCTTCCCC TACGCCTTCT CGCAGGCGGG GGACCTTCCC
AAGGTCCTCG GGAAGGTTCA CGACCGCTAC GGTACCCCGC ACGTAGGGCT CCTGCTCTCG
TCTATCGTGG CGAGCGCCGT GCTCGTACTC CTGCCGCTCT TCCCGGCGAT CGCGCTTCTC
GGCGTGATGA CCTCGATAGT CCCGTACGCC GTGTCGTCCC TCGGGCTCGC CGTGCTACGC
GGAAGGGGCG ACTACAGGCC AGCCTTCAGG AACCCCTTCG GGAAGCTCGT TGCCTACGTC
TCCTTCGTTT TCTCGACACT CGTGGCGTAC TGGTCCTGCT GGCCGTGGAC CCTCGTAGGA
GCCCTGCTGA CACTCGCCGT CGTCCCCGTC TTTGCGCGCG TAGCCGGGGT GGGGCTTAGG
AGGGAGGACC TCTGGTACTT CGCCTACCTC TCGGGCTTGA GCGTCGTCTC CCTCCTCGGG
GACCCGTACT TCGAGTACTA CAACTTCCTC CCGGTGCACC CCCTGGGAGT CTTCAAGACC
CCGGTGGACA TAGCCGTCCT GGTGGTCTTC GCGTCGGCGT TCTTCTTCTA CGTGGCGAGG
ACCCGGCGGT AG
 
Protein sequence
MSKLRRELGL LELVGISVGG IIGSGIFMMP SLTLSTAGLS ALLAWILAGV AMTVVALVFA 
ELGSAFGDTG GPYVYARAAF GRTVGFLVGW GYYVSCVLTV SAVTAAFVSY LGFFVPGLVE
GQRLTPVGVL AGLAFLWFLT LLNYVGVKYG GLYASATTLL KVFALAIFAA AGLAYPEPSR
FNVLEGLDAV SLGLAVSLAV WPYMGFESVT IPVEEVKKPQ RDVPLSIILS MGIVTAVYVL
IVLSFLSLLD WKSLGLAQGD WGSLANLSSP LSDVARSRGL LAVAVAVMLG AVFSTGGASG
VWLLDSGRFP YAFSQAGDLP KVLGKVHDRY GTPHVGLLLS SIVASAVLVL LPLFPAIALL
GVMTSIVPYA VSSLGLAVLR GRGDYRPAFR NPFGKLVAYV SFVFSTLVAY WSCWPWTLVG
ALLTLAVVPV FARVAGVGLR REDLWYFAYL SGLSVVSLLG DPYFEYYNFL PVHPLGVFKT
PVDIAVLVVF ASAFFFYVAR TRR