Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1347 |
Symbol | |
ID | 4600871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1297762 |
End bp | 1298874 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639774122 |
Product | hypothetical protein |
Protein accession | YP_920747 |
Protein GI | 119720252 |
COG category | [S] Function unknown |
COG ID | [COG3535] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAGAGG TTAGGACGGA GAGGCAGGCG AGGGCCCTGG TGCTCGGCGC TACAATCCTG GGGACGGGCG GTGGCGGGAG CCCCTCGCGG GGGCTGGCAG CGATCCTAGA CGCCCTGCGG AGGGGGTTGC CCGTGAGGAT CGTCGACGTC GGCGAGCTCC CGGAGGAGGG CTTCGCGGTC ACGGCGTACA ACGTCGGCTC GATAGCCCCG GGCGCCGCGG CGGCTAGGGA GCGCAGGATC GCCGACCCCT TGAGGAGGGC TGTCGAGGAG CTGGAGAAGG TTCTGGGCGG GAGGGTTGCG GCGATAGTCC CGAACGAGAT GGGCGGCGGG AACACCGCGG CAGCCCTCAG CCTAGCCGCC GAGCTCGGAG TCCCGGCCGT CGACGGTGAC CTCGTCGGGA GGGCGGCGCC CGAGGTGCAC CAGTGCTCCG CGATCGTCGC AGGGGTCCCG CTGTGCCCCT CGGCCGCCGT GACCGCTAGC GGAGACGTGG TCGTGGTTAA GGAGTACGCG AGCATCGACG ACTACGAGGC GGTAGTGAGG CACCTCTCGG TGCTCGGGGG AGGGCGCGCC GCCGTGGCGG ATACGCCGAT GAGGCGCGGC GAGGCGGCTA GGGCCGTGGT TAGGGGGACG GTGTCGAGGG CGATGAGGGT CGGCGAGGAG GTGCTGAGGG CCAGGGAGGA GGGGAGGGGC CCCGTGGCTG CGGCTACGCG CGCCCTGGGC GGGTGGAGGG TTTTCGAGGG CGTGGTCGAG AGGTACGAGT GGAGGGACGA GGGCGGCTTC CTGCTGGGGG AGGCCGTTGT GAGGGGGACG GGGGAGTACA GGGGGAGGAC CCTAAGGACC TGGATAAAGA ACGAGCACAT AATGGTCTGG GTGGACGGCG AGCCCGCGGT AATGCCTCCC GACCTCTTCT CCCTGCTCAG GGACGACGGC GAGCCCGTCA CGAACACGGA GCTGAAGGTC GGCGACAAGG TACACGGGGT GGCCGCGAAG GCGCCGGAGA TCTGGAGGAC GCCCGAGGGG CTCAGGTACT TCGGACCCAG GCACTTCGGC TTCGACTACG ACTACGTCCC CGTCGAGGAG CTAGTCGAAA GAGCACTCCG AGCCACGCGG TGA
|
Protein sequence | MIEVRTERQA RALVLGATIL GTGGGGSPSR GLAAILDALR RGLPVRIVDV GELPEEGFAV TAYNVGSIAP GAAAARERRI ADPLRRAVEE LEKVLGGRVA AIVPNEMGGG NTAAALSLAA ELGVPAVDGD LVGRAAPEVH QCSAIVAGVP LCPSAAVTAS GDVVVVKEYA SIDDYEAVVR HLSVLGGGRA AVADTPMRRG EAARAVVRGT VSRAMRVGEE VLRAREEGRG PVAAATRALG GWRVFEGVVE RYEWRDEGGF LLGEAVVRGT GEYRGRTLRT WIKNEHIMVW VDGEPAVMPP DLFSLLRDDG EPVTNTELKV GDKVHGVAAK APEIWRTPEG LRYFGPRHFG FDYDYVPVEE LVERALRATR
|
| |