Gene Tpen_1385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1385 
Symbol 
ID4600668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1339670 
End bp1340887 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID639774160 
Productmajor facilitator transporter 
Protein accessionYP_920785 
Protein GI119720290 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0632886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGG AGAGGAGGTT CAGGTTCGCG GTCTCCTACG TACCCGTCTT CGTTGCCAGG 
ATAGGGTCCG GCGCGAACAC GTTCCTGGTA GCCATGCTCG CGAGCGGCGG GGACGTCGCG
GCGGGCTTCG TTATGGCGGC GTACCCGCTC ATGGAGGCTA TAGGCGCCCT ACTCGCCGGC
TCGTGGTCCG ACTTCGCGGG GAGGAAGCGC ACGCTGATAG CAGGCTACGT GGCCAGGTCG
GCCGGGATGC TCTCCCTGGC GGCGGTATTC GCGCTTTCGA GCTCTTCCCC CGGGAGCCTT
GTGTTCGCCG CGCTGGAGGC CGCCTTGAAC GGCGTCGTCG GCTTTACAAC AGCCCTGATA
CTCGTCTCTT CGCTCTCGAT GGCCACCGAC CTCACCGAGA CGGGGAACAG GGGGCTTGGG
ATGGGGGGCT TCGAGTTCGT GAACCTCGCG AGCTACGGCG CCGGGTACCT CCTGGGCTCG
CTTCTCTACA GCGTGCTCCC CGGCTACCCC GCCTACCTAG CCGTGGCGCT CCTCACGGCC
CTGGCGACCC CGGTCTTCGC GGCGTTCCTC GAGGAGACGA GGCCCCCCGT CCCCCTGGTG
AGGAGGGGTC TTCTCTCCAG CCTACCGAGG TCGGCGCTCG TACTGCTACC CGTCTGGGTG
GCGCTCACGA CGATAATAGG CATCGGTATC TACGCGCCCA GGGTTATCCA CGACCACCTA
GGCGCGGTGC ACGGCCCAGC AATCGGTAAG GCCGGGATAG GCCTGCTGTT CCTCGGCGCC
CTCGTGCTCC TCGGCTCCGG CGCGGTGTTC TTCGGGAGGC TCTCGGACTC CTGGGGGCGC
GTCAAGGTAT TCAGGCTCGG CCTAGCCGGG GGCCTCGCGG CGCTTTCAGC CATGAGCGTC
CTCCTCGCCC TCGGCGTAGA CATCCTCAGG GCTGCCGCCG CCGTAGCCCC CCTGATGTTC
CTCACGTCCG CCGTAGGGCC CACGATACTC GCGCTGGTCG GGGACCAAGC GTCCACGGAT
GCCCGCGGCA GGGTGATGGG CGTGTACAGC GTGGTCCTAG GGCTCGGGAT GGGGCTCGGG
AGCATACTGG CCGGGCTCGC CTCCAGCGCC CTCCCGTACA ACAGGCCCCT TGCCCTCTCG
CTCGTAGCGC TCGCGGCTTA CTCGGTGGCC GCGCTCGCGC ACCTCTACCT GGAGAGGCGG
CTCGGCGGCT CCCTGTAG
 
Protein sequence
MSAERRFRFA VSYVPVFVAR IGSGANTFLV AMLASGGDVA AGFVMAAYPL MEAIGALLAG 
SWSDFAGRKR TLIAGYVARS AGMLSLAAVF ALSSSSPGSL VFAALEAALN GVVGFTTALI
LVSSLSMATD LTETGNRGLG MGGFEFVNLA SYGAGYLLGS LLYSVLPGYP AYLAVALLTA
LATPVFAAFL EETRPPVPLV RRGLLSSLPR SALVLLPVWV ALTTIIGIGI YAPRVIHDHL
GAVHGPAIGK AGIGLLFLGA LVLLGSGAVF FGRLSDSWGR VKVFRLGLAG GLAALSAMSV
LLALGVDILR AAAAVAPLMF LTSAVGPTIL ALVGDQASTD ARGRVMGVYS VVLGLGMGLG
SILAGLASSA LPYNRPLALS LVALAAYSVA ALAHLYLERR LGGSL