Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0451 |
Symbol | |
ID | 4601854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 409958 |
End bp | 411334 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773218 |
Product | hypothetical protein |
Protein accession | YP_919863 |
Protein GI | 119719368 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTGA ACAAGAAAGC CCTAGTCTTG GTAGCAGTTG CAGTGCTCAC AGCCCTGGCA GCCTACATCG CGCTTCTAGC AACGCCGCCC GCACAGCCAC CCGTGCAACC ACCTACACAG ACGCCAGCCC AGGGGCCGGC GCCGGCCGAG AAGCCTAGGA ACGCCACCCT AACGCTCAAG GTCTACGGCC CCGGCTCCCT GCTCGTCAAC GGCACGGGCT ACGTGAACGC CACGCTGACG CTCAGGGCGC CCGCCGTCTT GGCTATCAAT GCCTCGCCCC AGCCAGGCTG GAGGCTGAAG GCTTTACTCG TGAACGGGTC GCCCGTTTTA CCAGGCGTAG CGAGGGTATC GGGCGATACG ACGGTTGAGG CGGTTTTCGC GTGGAGTGGC CCCGTGGTCA CGCTGAGGGT TTACGGCTCC GGCTACCTGC TCGTCAACGG CTCGAGCTAC GGCAACGACA CGCTGTTGCT CAGGGGCGGG GCACTGCTCT CGATCAACGC GTCCACGCCT AGAAGCTGGA GGCTGAAAGC ATTGCTGGTC AACGGCTCCC CCACCTCGCC CGGCGATGTC AGAGTCTACG GCAACACGAC CATCGAGGCC TTCTTCGAGC AGGTGAAGGT AAAGGTCAAG GTGGTGCCGG GCGAGCACCC CGTAACAATC AACGGTTCCT GGGTGAACTC GACCACTGTG CTGGAGGTCC CGGCCTACTC TGTCCTCGTG CTGGGCCCTG CGAGTGTGGA GCTCAACGAG ACCTGTGAGG CTGTGCACTA CTGGAACGCC AGTGTGGCTG GGCGCTGGAC GCTCCTACAC GGCGACGCCT CGCTCGAAGT CGCGAACGAC ACGGTACTGG TGGCGGGCTG GAGCCTCAAG TGCCACCCAC CACGCTCGAC GCTCGGAGGA GTACTCTACG CCGGCAGAGA AGTCAAGGCC AGGATGGTGC TCACAGTGAA GGAGGCCCAG TCGGGGTCCT GGAGGTACAA GGGCAACGGA GTCTGGGAAA TAGAGGCACC CGGCTTCCTC ATAGTACTCC TAGAAACCCC GAAGAACTGG AGCAAAGTAG TAGTGAAGGG CAAGCCGCTC GCGCGTAGCG GCATCATAGA GATCCTCGTG ATAGTCGAGA ACGGCCCCTC GATGTACCGC TCGAAGGGCG CCGGCCTAAT ATTCGAGGAC ATATCCTACT TCGAGTTCGT ACTGCCGAGG TGCATAATGC AGGGAACGTG CGACGCAACG GTAAACGCCT ACGGAAGCTT CGTGAACGAG GGCTACCGCG AGCACTGGGG CCCGAGGCTG GAGCCACCAC CGGTAGAGCC CGGCTGGCTC GAAATACAGG TCTACCCCGG GACGCACGTA GAGATACAGG TATTCGTCGA GCCCTAA
|
Protein sequence | MTVNKKALVL VAVAVLTALA AYIALLATPP AQPPVQPPTQ TPAQGPAPAE KPRNATLTLK VYGPGSLLVN GTGYVNATLT LRAPAVLAIN ASPQPGWRLK ALLVNGSPVL PGVARVSGDT TVEAVFAWSG PVVTLRVYGS GYLLVNGSSY GNDTLLLRGG ALLSINASTP RSWRLKALLV NGSPTSPGDV RVYGNTTIEA FFEQVKVKVK VVPGEHPVTI NGSWVNSTTV LEVPAYSVLV LGPASVELNE TCEAVHYWNA SVAGRWTLLH GDASLEVAND TVLVAGWSLK CHPPRSTLGG VLYAGREVKA RMVLTVKEAQ SGSWRYKGNG VWEIEAPGFL IVLLETPKNW SKVVVKGKPL ARSGIIEILV IVENGPSMYR SKGAGLIFED ISYFEFVLPR CIMQGTCDAT VNAYGSFVNE GYREHWGPRL EPPPVEPGWL EIQVYPGTHV EIQVFVEP
|
| |