Gene Tpen_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1246 
Symbol 
ID4600541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1181816 
End bp1182874 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content60% 
IMG OID639774022 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_920647 
Protein GI119720152 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCTAG CTAGCTATCT GGTGAGGAGG CTTGCAACGT TTCTCCCGAG CGTCCTCGGG 
GCGCTCCTCA TAACCTACCT CATAGCCTAC GTTATCCCCA CGGACCCCGT GAGGGCGTGG
GTAGGGGAGA AGCTCATGGA CCCCTCGACC CTAGAGAGGC TGAGGAAGGA GTACAAGTTC
GACGCGCCGT GGTACGAGCA GTTCGCCTTC CTGGTCGAGA AGCTCCTAAC GGGCACGCTC
GTAGACCCCA CCAGGGGTAT CCCCGTCGTC CAGCAGGTGG CGCAGAGGTT CCCGATAACC
GTCGAGCTGG CTATATTCGG CATGCTGTTC ACAGTGGCTA TAGGCATACC GCTGGGGATA
CTGGCAGCGG CGAAGAAGGA CAGCTTCGTG GACTTCTTCG TAAGGGTATT CGCGCTCTTC
GGGAGCTCCA TGCCGGCCTT CGTGCTCTAC TACTTCCTGA TACTGGCGTT CTACGTCTAC
GTGAGAGCCT CGCTACTAGC CGGGGTTCCC TCCCTGTCTC CAGCGTGCGC CGCCAGCCTG
GACTCCGTCA GGAACGCGGT TCCCCTCCTG GGCTACGTCG TGTGGGCGGT AGGCCAGGTC
CCGATGTTCG GCGGTCTCAT GTGCGGGGAT CTGGGGGTTG TCTCTGCGAC GTTCGTTAGG
ATGTGGCTTC CGGGGTTGGC GCTGGGGCTC CTCTCCGGCG GCTTCATAGC GAGGATAGTT
AGGAACAGCT TGCTCGACGC GCTGAGTTCG GACGCGATCC TCTTTGCAAG GGCAAGGGGC
CTTACGAGCG GCAGGATATG GCGGCACGCC TTGAAGAACG CGTTCGCGCC TATAGTCACG
ATTCTCGGCC TCAACTTCGC CGGCCTGCTC ACGGGCGCCG TGATAGCGGA GACTGTCTTC
AATATCCCCG GCATGGGGCT CTACATGTAC CAGGGGATCA CGAGGCTGAA CTTCCCGATA
ATAATCGCCG GGACGTTCAT ATTCTCGGTG ATATACATCG TGATGAACCT CCTGGTAGAC
CTCGTCTACG CGCTGATAGA CCCGCGCGTC AGGTACTAG
 
Protein sequence
MGLASYLVRR LATFLPSVLG ALLITYLIAY VIPTDPVRAW VGEKLMDPST LERLRKEYKF 
DAPWYEQFAF LVEKLLTGTL VDPTRGIPVV QQVAQRFPIT VELAIFGMLF TVAIGIPLGI
LAAAKKDSFV DFFVRVFALF GSSMPAFVLY YFLILAFYVY VRASLLAGVP SLSPACAASL
DSVRNAVPLL GYVVWAVGQV PMFGGLMCGD LGVVSATFVR MWLPGLALGL LSGGFIARIV
RNSLLDALSS DAILFARARG LTSGRIWRHA LKNAFAPIVT ILGLNFAGLL TGAVIAETVF
NIPGMGLYMY QGITRLNFPI IIAGTFIFSV IYIVMNLLVD LVYALIDPRV RY