Gene Tpen_0295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0295 
Symbol 
ID4601304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp262187 
End bp263464 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content56% 
IMG OID639773053 
Productextracellular ligand-binding receptor 
Protein accessionYP_919708 
Protein GI119719213 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00215274 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCCG GAAATACTGT AGCAGGTAAA AGCAGTTTAG CAAGGGTGCT TGCATACATC 
GTGATAGCTT TACTTATAGG CGCCTTCCTG GGCTACTTCC TGCGAGGATA CCCGGCACAG
CAACAGGCGG CTACCACGCA AACATCGGTA ACAACGATAC CTATAGGAGC GCTCGTTGAG
CTTTCTGGTG ATCTCTCGTC TTACGGTAAG AGAGACGAGC TAGCAATGCA GATAGCCATC
GAAGACGTTA ACAACTTCGC CGAGAAGATA GGCTCGCCGT ACAGGTTCAA GTTGCTCGTA
GAGGACTCCG GAACTAGCCC GGAGCAAGCC CTCTCGAGGA TTAAGACTCT GGCCGCGCAG
GGTGTACGCG CAGTTATAGG GCTAGAGGCC AGTAGCGAGG TAGCAGCAGT GAAGCAGTTC
GCTGACACGA ACCACGTAGT CGTTTTAAGC GTAGGATCGA CAGCTTTGTC GCTGGCAATC
CCCGGCGACT ACATACTAAG AGTTGTACCA CCAGACAGCG TGCAAAGCAA GGCTCTGGCC
CGCCTTATCT ACTCGCTGGG ATACCGGAAC GTGGCGGTGA TATACCGCAA CGACGCATGG
GGTGTAGGTC TCTTTGAAGG GTTCAGCGCG AGGTTCAAGG AGCTGGGAGG GAACGTGGCG
GGGGTTGCGT ACGACCCTGC GGCTAAGGAT CTCAGCGGTG AGGTGAACAG GCTCGCGGAC
ATAGCGGCTA GCATGGGGTC TAACACGGCG GTGCTCGCCA TAACGTTCGA GGACGACGGC
ATACAGATAG TGAAGCTAGC GGCCAGGAAC CCTGTCCTCT CCAAGCTCAA GTGGTTCGGC
ACAGACGGCG TGGCTCAGTC GACCAAGCTT GCAAGCGAAG CTGGGGAAGA GCTAATAGCT
CTGGGAGGCT TTCCGTGCAC GATATTCCAG CCTTCCGAGA ACCAGCGTCT AGCAGACTTC
GTGAACAGAT TCCGTAGTAG GAGCGGGGGC GAGGATCCAC ACGCGTACGC CATGAACGCC
TACGACGCAG TCTGGCTCGT AGCCCTATCG GTGATGCTTA CAGGCTCCTA CTCGGGAGAC
AAGCTACTAA GCACTATTCC ACTCGTCGCC CAGAACTTCA ACGGAATCAC GGGACCGCTC
ACGCTGGACG CAAACGGAGA CAGAGCTTCG GGAGACTACG CTATATGGCG CGTCGTTAAA
ACAGCAAACG GCTACGACTG GCAGATAATA GGATGGTACA GCGCGTCCTC CGATAGCGTT
ACAATCCAGG GAGGCTAA
 
Protein sequence
MSAGNTVAGK SSLARVLAYI VIALLIGAFL GYFLRGYPAQ QQAATTQTSV TTIPIGALVE 
LSGDLSSYGK RDELAMQIAI EDVNNFAEKI GSPYRFKLLV EDSGTSPEQA LSRIKTLAAQ
GVRAVIGLEA SSEVAAVKQF ADTNHVVVLS VGSTALSLAI PGDYILRVVP PDSVQSKALA
RLIYSLGYRN VAVIYRNDAW GVGLFEGFSA RFKELGGNVA GVAYDPAAKD LSGEVNRLAD
IAASMGSNTA VLAITFEDDG IQIVKLAARN PVLSKLKWFG TDGVAQSTKL ASEAGEELIA
LGGFPCTIFQ PSENQRLADF VNRFRSRSGG EDPHAYAMNA YDAVWLVALS VMLTGSYSGD
KLLSTIPLVA QNFNGITGPL TLDANGDRAS GDYAIWRVVK TANGYDWQII GWYSASSDSV
TIQGG