Gene Tpen_1165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1165 
Symbol 
ID4601179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1106370 
End bp1107461 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content58% 
IMG OID639773941 
ProductABC transporter related 
Protein accessionYP_920566 
Protein GI119720071 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.191612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCACGTG TAGCCGTCAA AGATCTTGTG AAAAGGTTTG GAAAAGTGGT TGCCGTCGAC 
AGGGTCTCCT TCGAGGCTAA GGACGGCGAG TTCCTCGTTC TCCTCGGCCC CAGCGGGTGC
GGGAAGACCA CTACCCTGAG GATGATAGCC GGGCTAGAGA CGCCAGACGA GGGAGAGATC
TACATCGGGG ACAGGCTCGT GAACGACCTG CCGCCCAAGG ATAGGGACGT GGCGATGGTG
TTCCAAAACT ACGCTCTCTA CCCGCACATG AAGGTATACG ATAACATCGC TTTCCCGCTC
AGGATAAGGA AGCTGCCGGC CGACGAGATA GACCGCAGAG TCAGAGAGGT GGCAAAGCTC
CTGAGGATAG AGGAGTTGCT GGACAGGTAC CCGAGGCAGC TGAGCGGCGG GCAACAGCAG
AGGGTCGCCC TGGGTAGGGC TCTCGTGAGG CAGCCACAGG TCTTCCTGAT GGACGAGCCT
CTCAGCAACC TCGACGCAAA GCTGAGGGTG TACATGAGGG CTGAGCTGAA GAGGCTTCAG
AGAGAGCTCG GCATAACAAC GATCTACGTT ACCCACGACC AAGCGGAGGC TATGACCATG
GCGGACAGGG TAGCTGTGAT GAACGAGGGG AAGATAATGC AGCTCGCAGA CCCCGCCGAG
CTCTACTTCA GGCCCGCGAA CACCTTCGTT GCGGGCTTCA TAGGAGCCCC GGCGATGAAC
TTCGTAGACG CCTCGGCGAA GGTTGAAGAC GACACGGTCG TGCTCGACAC GGGGATCTAC
CGCATCAGGC TCCCCAAGGA CGCCTCCGAG GTGCTGATAA AGCAGGGCGT GCCGAGCGAG
GTCATATTCG GTATAAGGCC TGAGCACATC ACCGTTAGCA AGCAGGAGTT CCCCGGGAGC
TTCGCCGCGG AGGTCTTCGT AACGGAGCCC CTAGGATCGG AGACGATAAT CGACTTCAAG
CATGGAGACG CTATACTCAA GGCGAAGTAC CCCGGGCACT TCGAGGCCTC TCCGGGAGAG
AAGATATACA TAGGCTTCCA GCTACAGTAC GCCCACGTGT TCGACAAGAA GACAGGAAAA
GCCCTAGTCT AG
 
Protein sequence
MARVAVKDLV KRFGKVVAVD RVSFEAKDGE FLVLLGPSGC GKTTTLRMIA GLETPDEGEI 
YIGDRLVNDL PPKDRDVAMV FQNYALYPHM KVYDNIAFPL RIRKLPADEI DRRVREVAKL
LRIEELLDRY PRQLSGGQQQ RVALGRALVR QPQVFLMDEP LSNLDAKLRV YMRAELKRLQ
RELGITTIYV THDQAEAMTM ADRVAVMNEG KIMQLADPAE LYFRPANTFV AGFIGAPAMN
FVDASAKVED DTVVLDTGIY RIRLPKDASE VLIKQGVPSE VIFGIRPEHI TVSKQEFPGS
FAAEVFVTEP LGSETIIDFK HGDAILKAKY PGHFEASPGE KIYIGFQLQY AHVFDKKTGK
ALV