Gene Tpen_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1547 
Symbol 
ID4600840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1495826 
End bp1496887 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID639774321 
ProductABC transporter related 
Protein accessionYP_920946 
Protein GI119720451 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGGGG TTAGGCTGGA GCACGTTACC AAGCGGTACG GGAAGGTAGT CGCGGTGGAC 
GACGTTAGCC TGGAGGTCAA GGAGGGAGAG TTCTTCGTAC TCCTGGGGCC CAGCGGTTGC
GGCAAGACAA CGACTCTCCG AATAATCGCG GGCCTAGAGG AGCCGGACGA GGGGAGGGTG
TTCTTCGGAG AGGAGGACGT GACGAGGCTA CCCCCCGGCA AGAGGAAGAT CTCGATGGTG
TTCCAGAGCT ACGCCGTGTG GCCGCACATG AAGGTGTACG ATAACATTGC GTTACCACTC
AAGGTGCAGG GCTACCCTCC GGAGGAGATC GAGCGCAGGG TTAGGGAGGC GGCGCGGCTC
GTGCAGATAG AGGATCTCCT CGACAGGTAC CCGCAGCAAC TCTCGGGTGG GCAGAGACAG
AGGGTCGCGG TTGCCAGGGC GCTGGCAGTG ACGCCGCGCG TACTGCTCAT GGACGAGCCT
CTGAGCAACC TCGATGCCCT CCTCAGGGTG CAGGCGAGGG CGGAGCTAAA AAGGCTCCAG
CGCGATACCC GCCTCACCAC GATCTACGTA ACGCACGACC AGGTGGAAGC AATGGTTCTA
GCCGACAGGG TAGCGGTAAT GAACAGAGGA AGGGTACTCC AAGTGGGACC CCCGGAGGAG
ATATACAGCA AGCCTTCGAG CAGGTTTGTC GCGCACTTCA TAGGGTCGCC CCCCATAAAC
CTCTTCGAAG GCACGGTTAC GCCTAGAGGT GTCGACGTCG GCTTCGTGGT TCTGCCTGTG
GGCGCGGGGA GTCTCGAGGA GTACAGCGGC AAGAGGGTCA TAGTGGGCAT AAGGCCCGAG
GACGTCCATT TAACGGCCTC CCCAGGCTTC ATAGAAGTCG AGGGAAGCGT CTGGGTCACG
GAGAACCTGG GAGGAGAGTA CGTAGTGCTG GTAAGCGTGG GCGATGTCAT CCTCAGGGCG
AGGAGTCGAG AGAAGCCCGA GTCGGAGAGA GTCAAGGTAT ACCTTGACCC CTCGAAGCTA
CACTTCTTCG ACAAAGAGAC AGAGGCCAGG ATCTTCAGCT AA
 
Protein sequence
MVGVRLEHVT KRYGKVVAVD DVSLEVKEGE FFVLLGPSGC GKTTTLRIIA GLEEPDEGRV 
FFGEEDVTRL PPGKRKISMV FQSYAVWPHM KVYDNIALPL KVQGYPPEEI ERRVREAARL
VQIEDLLDRY PQQLSGGQRQ RVAVARALAV TPRVLLMDEP LSNLDALLRV QARAELKRLQ
RDTRLTTIYV THDQVEAMVL ADRVAVMNRG RVLQVGPPEE IYSKPSSRFV AHFIGSPPIN
LFEGTVTPRG VDVGFVVLPV GAGSLEEYSG KRVIVGIRPE DVHLTASPGF IEVEGSVWVT
ENLGGEYVVL VSVGDVILRA RSREKPESER VKVYLDPSKL HFFDKETEAR IFS