Gene Tpen_1576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1576 
Symbol 
ID4600562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1528285 
End bp1529568 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content58% 
IMG OID639774349 
Productextracellular ligand-binding receptor 
Protein accessionYP_920974 
Protein GI119720479 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0597639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCAA CGCAATCTCA GCAAAAGGGA AAATTCGCCT TTGACGTAAA GCTCGTAGCA 
GTCGCGATAA TCGCCCTCAT AGTCGGAGTA GCCATAGGCG CGGCGATCTT CGGAGGAGCG
GCGGGGCAGG CCCCCGGCGC GGCGCAACAG GCTCCCAAGA AAGTCTACAC TATAGGCTTC
ACGCTCCCGC TGACCGGCGA GCTCTCGTCC ATCGGGAAAA TTTGGGAAAA AGTGGTCTAC
CTGGCAATAG ACGACCTGAA CAAGGAGGCG CAGGCTTACG GGTTTAACGT AGAGTTCAAG
GCCGTGATAC TCGACGACGG GACTACGCCC GAGAAGGCCC TCCAAAACGT CCAGACGCTC
GCACAGCAAG GAATAAAGGT CATAATAGGA CCCGCCGCAA GCTCGCAGGT TAAGGCCGTG
AAAGGATTCG TCGATAGCAA CCAGATAGTC TTAATCTCGC CGTCCTCCAC TGCTCCCACG
CTCGCAATAC CCGGCGACTT CATCTTCAGA ACCGTCGGGT CCGACGCCGG GCAGGCAAGA
GTGCTCGCAA CGCTCGCCTA TCAGGAGGGC GCCAGGAAGG TCATAGTGTT CCACAGGAAC
GACGAGTACG GTAACGCGTT CGCGGACTTC TTCAAGAAGT ACTTCTCCGA GCTTGGCGGA
AGCTCCATTG ACGTTCCCTA CCAGACCGGC CTCTCCGACT ACGCCGCAGA GGTCGCCAGC
CTCGCCAGTA AGGTGCAGTC CGAGAAAGTA GACGCCGTCG TGCTGATATC GTTCGACACG
GACGGCGGCA ACATACTGTC GCACGCCGCC GAGTCGCCAG TCCTCTCCAG CGTGAGGTGG
TTCGTCTCCG AGGGTCCCCA CGGAGCCGCA GAGCTCAAAG CCCCGGCTGT CGGGGCTTTC
GCCGCTAAGA CGAAGTTGCT GGGGACACGC CCACTCTTCA TAGGCAACCC GCTCTACGAG
GACTTCAAGA AGAGACTTAA GGAGAAGTAC GGGGTGGACG CCTCCGTGTT CTGCGATACG
CTCTACGATG CAGTTATGCT CGCGGGCTGG GCTATGCTGA GGGCTGGTAG TAGCGATGGC
AACGCTATAC GGAGCGCGCT GATCGAAGTC GCGAAGCACT ACTACGGAGT GAGCGGCTGG
GCAATATTCG ACGAGGCGGG CGACAAAGCG TACCAGGACT ACGGAGTATG GGCGATAGTC
AAGACCGACG GGGGCTACGA CTTCAAAGAC GTAGGGGTAT ACGAGAGAGG CTCCATAGTG
TTTACGGCTA AACCGTACCC GTAA
 
Protein sequence
MSATQSQQKG KFAFDVKLVA VAIIALIVGV AIGAAIFGGA AGQAPGAAQQ APKKVYTIGF 
TLPLTGELSS IGKIWEKVVY LAIDDLNKEA QAYGFNVEFK AVILDDGTTP EKALQNVQTL
AQQGIKVIIG PAASSQVKAV KGFVDSNQIV LISPSSTAPT LAIPGDFIFR TVGSDAGQAR
VLATLAYQEG ARKVIVFHRN DEYGNAFADF FKKYFSELGG SSIDVPYQTG LSDYAAEVAS
LASKVQSEKV DAVVLISFDT DGGNILSHAA ESPVLSSVRW FVSEGPHGAA ELKAPAVGAF
AAKTKLLGTR PLFIGNPLYE DFKKRLKEKY GVDASVFCDT LYDAVMLAGW AMLRAGSSDG
NAIRSALIEV AKHYYGVSGW AIFDEAGDKA YQDYGVWAIV KTDGGYDFKD VGVYERGSIV
FTAKPYP