Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1403 |
Symbol | |
ID | 4601817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1356467 |
End bp | 1357648 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639774178 |
Product | hypothetical protein |
Protein accession | YP_920803 |
Protein GI | 119720308 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTCT CCGACTACCT CTACCTGGCC TTCACCAGCC TTAAGGAGAA GAAGGGCAGG GCGGCCGGGG CGGCGCTCGG CGTCATGATA GCGGTGCTCG CCTTGAGCCT CGCGCTCGGC GTCGGCGAGA GCTTCCAGAA GGCCTTCGTC GAGCAACTCC AGTCGACGAT AGCCGCGGAC AGTGTCTTCG TGATAGGCGG GTACGCCGGG CTGACGGATG CGGACATAGC CTACTTCAAG AGCATCCCGG GAGTCAAGGA CGCGCAGGGC GTGCTCATGG CGAGCGGCGT CGTCTACACG GAGTCCGGGG AGAAGCCCGT GAACATAGTG GCTGTCGACC CCTCCTTCCT GCCGAGGTAC CTCGGAGTCT CGGACATGAG GAAGGCTGTG GCGGAGGGGG AGACGGAGCC GAGGGGGCTC GGCGTACTCG TGTCGTACAG CCTGTGGAGG GACCAGGAGA CGGGGAGGAA GCTCCTCGAC GTCGGGTCCG TCCTCAACGT GAGGGTGGGC GGCAGGAACG TGCAGGTCTT CGTAGTTGGT CTACTCGAGC AGACGAGCTC CAGCATGTCG ATGGGGCACG ACTTCCAGGT GGCGACGATC TACATGGACC CCGACGCTTT CTTCACGTAC CTTGGGAGGA CTAGGAACTA CCCGGTGGCG ATAGTGCTCG TCGAGAACCT CGACGCGCTG GACTCGATAA CCGAGAACAT CAGGGCCCTC GCGCCGCCGG GCTCCAGGAT AATATCGGCG GCCGCGATGG TCAAGCAGTT CACCTCCCTC GTCGGCGCTC TCCAGCTCTT CATAGCGCTT ATATCAGCCG TCGGGATGGG CGTGACGGCT CTCTGGATCT TCGACAGCAC GACAATCAGC GTCACCCAGC GCACGAAGGA GATAGGGATA CTCAAGGCGC TCGGCTACAC GAGCGGGGAC ATACTGGCGG TCTTCCTCCT CGAAACCGTG ATCGTGTCGC TGGTGGGGGC GGCGGCGGGC CTCCTGGCCG CCTTGGCCTC GTCGAGCGTC GTGAAGATAT CGGCTTTCGG CATACAGATA GGCGTCGCCC TAACCCCCCA CACCGCCGGG CTCGCAGTCC TGCTACCCCT AGCCGCCAAC GTGCTGGCAG CGTTCATACC CGCGAGGCGC GGGGCATCCC TCAACCCCGT GGAGGCGCTC AGGTATGAGT AG
|
Protein sequence | MNLSDYLYLA FTSLKEKKGR AAGAALGVMI AVLALSLALG VGESFQKAFV EQLQSTIAAD SVFVIGGYAG LTDADIAYFK SIPGVKDAQG VLMASGVVYT ESGEKPVNIV AVDPSFLPRY LGVSDMRKAV AEGETEPRGL GVLVSYSLWR DQETGRKLLD VGSVLNVRVG GRNVQVFVVG LLEQTSSSMS MGHDFQVATI YMDPDAFFTY LGRTRNYPVA IVLVENLDAL DSITENIRAL APPGSRIISA AAMVKQFTSL VGALQLFIAL ISAVGMGVTA LWIFDSTTIS VTQRTKEIGI LKALGYTSGD ILAVFLLETV IVSLVGAAAG LLAALASSSV VKISAFGIQI GVALTPHTAG LAVLLPLAAN VLAAFIPARR GASLNPVEAL RYE
|
| |