Gene Tpen_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0053 
Symbol 
ID4600533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp39606 
End bp40943 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content65% 
IMG OID639772806 
Productamino acid permease-associated region 
Protein accessionYP_919466 
Protein GI119718971 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCGGC TTAAAAGAGA GCTTGGGCTG GGCTACGCGA CGCTGTTCGG CGTTGGGTTG 
ATACTCGGCG CCGGTATATA CGTCTTGATC GGGAGGGCTG CGGGCATCGT GGGCGACGCT
GTGTGGGCTA GTGTCGCGTT CTCCGGCGTC ATAGCCGTTG CGACGGCTTT TTCTTACGCG
GAGCTTTCCT CTATCTTCGC GAAGGCTGCG AGCACGTATA CCTACGTGAG GGAGGCTTTC
CCGGGGGCTG GGCTTCTGGC TTTCTTCGCC GCGTGGATGC TGTTCTTCGG CGGTGTCGCG
GGGGCGGCTA CGGCGGCGCT GGGCTTTGCC GGGTACTTCT CCAGGCTGGC GGGGCTGGGG
GAGGGCTGGG TGGTCCCGGT CACCGTGGCC CTGCTCGCCG TGCTGTCTGT GCTGAACTGG
TGGGGGATTA AGGAGTCGGC TTTCCTGAGC GCTGTCTTCA CGGTGATAGA GGCGGGCGGG
CTCGTGCTCG TGGTGTTGCT GGGCTTCCTC TTCCCGGCGA GGAGCCCCGG CTACCTCTCC
TTCAACCCCT CCGTCGACCC CGTGATGGCT GTACTCGTGG GCGCCGCCGT GTTCTACTTC
GCGTACACGG GCTTCGAGTA CCAGCCGACG CTCAGCGAGG AGACCGTGGA CCCCGAGAGG
GTTATCCCGA AGTCCATAGT CCTAGCGGTC TCCGTGACGA CGTTGCTGTA CCTGCTCGTC
TCCCTGTCTG TCGTCAGGCT TATGAGCTGG GAGGAGCTCG GCGCCAGTAA GGCCCCGATG
GCTGACGCGG CTTCGAGGGC GTGGCCCCCC GCGTTCCACC TCTTGATGTT CATAGCCCTC
TTCGCCACGA CTAACACCTC GCTCGGCTTC CTCGTCTCGG CCTCGCGGCT AGCCTACGGG
CTGGCGGAGG AGGGGGTTGC GTGGAGCGGC TTCGGGAGGG TTGACGGGTG GAGGAGGACT
CCTCACGTAG CCGTCGCCTT CACGGGTGTT CTCGCCGCCC TGCTCGTCTT CGCGACGGAC
TACCTGCCCA GCGTCACCGG GTGGAGGCTG AGCTTCGGGG GGCAGGAGTA CCAGCTGATA
GACCTCGTCG GGAAGACTGC CAGCCTGGCT GTTCTCCTCG CGTTCTTCGT CGTGAACGCC
GCCGTGGTCG CCTTGAGGAG GAGGGAGGGC CTCTCGAGGC GCTTCAGGGT TCCGCTCAAC
GTCGGCGACT TCCCCGTGTT GCCCGTGCTG GCGGACATAC TCATAGCCGT GTTCGTCGCT
CTGAGCTTCT GGGACTGGAT CGTCTGGCTG AGCACAGCCC TCGTAGCCGC GCTGGGGCTC
CTGCTCTACA AGCGCTGA
 
Protein sequence
MSRLKRELGL GYATLFGVGL ILGAGIYVLI GRAAGIVGDA VWASVAFSGV IAVATAFSYA 
ELSSIFAKAA STYTYVREAF PGAGLLAFFA AWMLFFGGVA GAATAALGFA GYFSRLAGLG
EGWVVPVTVA LLAVLSVLNW WGIKESAFLS AVFTVIEAGG LVLVVLLGFL FPARSPGYLS
FNPSVDPVMA VLVGAAVFYF AYTGFEYQPT LSEETVDPER VIPKSIVLAV SVTTLLYLLV
SLSVVRLMSW EELGASKAPM ADAASRAWPP AFHLLMFIAL FATTNTSLGF LVSASRLAYG
LAEEGVAWSG FGRVDGWRRT PHVAVAFTGV LAALLVFATD YLPSVTGWRL SFGGQEYQLI
DLVGKTASLA VLLAFFVVNA AVVALRRREG LSRRFRVPLN VGDFPVLPVL ADILIAVFVA
LSFWDWIVWL STALVAALGL LLYKR