Gene Tpen_1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1554 
Symbol 
ID4600903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1504467 
End bp1505531 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content54% 
IMG OID639774328 
Productextracellular solute-binding protein 
Protein accessionYP_920953 
Protein GI119720458 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.202841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGC TCAAAGTCCT AGGAATACTT CTCGTAATCG TACTAGCAGT AGCCATCGGC 
GCATTGGTAT ACTTTGGCTC TAAACCTTCT CAGCCGGCTA CACCGGCAAA GGCTGTCGTC
GAGCTACCGA CCGGCGAAAG AATAGAGGTT CCGGAGAACG TCGCCGGCAA GGTGGTATTC
TACACAAGTA TACCAGACGT CATCGTAAAT TCTTGGAAGG GTAACTGGAG CAAGTACTTC
GGGTCTACGA TATCGCTGGA GGTCTGGAGG TCCGGGACGG GGAAAGTGGT CGCTAAGCTC
CTTGCGGAGA AAAAGGCCGG TAGCGTGGAG GCAGACGTTG TCTACATAGC ATCACCCTTC
GAGTTCGAGA CCTTGATAAA CGAGAGCATC ATAGAGAAGT TCCCCGACAT TCCCGAGCTG
AAATACATCC CCCAGGAGTA CAGGGATCCC AGAGGGTACT ACGTGTGGGG AAGAGTGCTT
GTAATGGTGA TAGTGTATAA CCCGAACATA GTGACTGACC CGCCGAAGTC GTGGCAGGAC
TTGGTGAAGC CTGAGTGGAA AGGTAAGGTA GTGATAGCCA ACCCACTCTA CTCGGGATCC
ACGCAGGTCG CAGTAGCGGC CCTCGCCTCA AAGTTCGGGT GGAGCTACTT CGAAAAGCTG
AAGGAAAACG ATGTACTCGT TGTACAAGAC GTGCCCGACG TCGCAAGGGT TGTTGCCACG
GGTGAGAGAC CCGTAGGCGT GACGCTTACA ATGTACCTCG GGGCGTACCC CACGCTAAAG
TTCGTAGCAC CGGAGGAGGG GGCCATAGCG ATACCCAGCC CCGTAGGCCT AGTAAAGAAC
GCCAAGCACC CGGAGGACGC TAAGGTGTTC TTGAGGTTCC TGCTCTCCAA GCTAGGGGCC
CAGGCCTTAA CCGATGCCTA CACCTACTCT ACCAGGATTG ATGCCCCTGC GCCGAAGGGT
CTTCCACCGC TCTCCCAGCT GAAGATCCTG AAGGTAAGCA TGGACGAGCT AAGGCCCATT
GTGAGCCAAA TAAGGGATAA GTGGACGCAG ATATTCGGTG GATAA
 
Protein sequence
MPKLKVLGIL LVIVLAVAIG ALVYFGSKPS QPATPAKAVV ELPTGERIEV PENVAGKVVF 
YTSIPDVIVN SWKGNWSKYF GSTISLEVWR SGTGKVVAKL LAEKKAGSVE ADVVYIASPF
EFETLINESI IEKFPDIPEL KYIPQEYRDP RGYYVWGRVL VMVIVYNPNI VTDPPKSWQD
LVKPEWKGKV VIANPLYSGS TQVAVAALAS KFGWSYFEKL KENDVLVVQD VPDVARVVAT
GERPVGVTLT MYLGAYPTLK FVAPEEGAIA IPSPVGLVKN AKHPEDAKVF LRFLLSKLGA
QALTDAYTYS TRIDAPAPKG LPPLSQLKIL KVSMDELRPI VSQIRDKWTQ IFGG