Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0471 |
Symbol | |
ID | 6166097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 426882 |
End bp | 428483 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641667628 |
Product | hypothetical protein |
Protein accession | YP_001793864 |
Protein GI | 171184945 |
COG category | [S] Function unknown |
COG ID | [COG4938] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0692558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000000612882 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGCAGACGG CGGAGCCCAG GCCCATTGTA AACGCTCTGC GGGCAATCTA TGCGAAGCTG TCGCCTCTTG ACAGGTACGC CGCTGGGCGT GTGGAGAGGA GGCTGTCGGC TAGGATTGCG GAGTATCTAG AGCGCGAGTT CCGGGCGGAG GCGTCGTCGA GCGTCCGGAG GTTGGTTTCT GGGGTGGTTG AGCTGGTTGC GGCTTTGAGG TCGGGTGGGG TTGAAGAGGC TCGTACCGTT TTGGGGCGGA TGGGCGAGGT GGGTCTTAAG GTGGCTGAGG CCGGGGGCTC TGTTGTCGTT AGAGGTCCGC CGATGGAGTG GGCGGTGGAC GTGGGTGTCT TGAGGCAGAT GGCTGTGGAC GCGTTTTACG GCTTTATGGC CGAGCTTGTG CCTGTGAGGG GGGTGGACGC GGTGCGGCTT GAGCCGCTCG AGCCGCTTGA TGTTGTGGGC GCGCAGAGAC TCCGTGTTGA GGCGGAGCGC CGGTATCCGT TGGGGTGGTT TGGGCAGGCA GGTGAAGTAC TTAGGAGGTT GGGTTGTGGG GGTGATGAGT TGGAGCGTCT GCTGTTCTCG TCCTCGGTGA GTCTTCACTT TGATGTTGGG GTGGGTGGGA GGTCGAGTGT GTGGCTTGAG TTCCGCATAG ATACGCGGGC TTTCTCGCCG GGGGGCGCCG GCCGCGTGGA GAAAACCGAT GGGGGTTTGT CCAAGGCGTT GGACGAGTAT GTGGAGGAGT TGGTGTCTAA GGTTGTGCGC AGGTCGGCTG CCTATCTTTC TAGTGCTGTG GTGGGGGGCG TCCGCGGCGC TTTGAGGAGC CGCCTTGGTT TTGAGGGGCT GCGTTTTGTT CCTTTTGGCA GAAGTGTGCT CGTCTTGGCG CTGGAGAGCG CCTCTAGGGA GCCGTATGCC AGGCCTGTGT ACCTGCGTAG GTTGGTTGAG GAGTTCTATC CAGGCGTGTT GGCGAGCTAT GTCTACTGGG CGAGCGAGGG GCGGAGGCGT CTGTTGGAGT GGCTCGACGA GGCGGGGGCG AGGGTTCTCG ACGCCGCCGC GCCTCTGCTG GAGGGCAGGC TGGTGCCGGG CGCCGCAGGG AGGGTGATGT ACAGGGACTG GCGTGGGTCG CTTGTCGAGA TTCGGCTGTC TTCCGCTCTT GTGGGCGAGG TCGCCGGCCT TCTCTTCTCC CTGCTGAGCG TGGGCGGTAG ATCTCTGGTG CTTGTCGAGG AGCCCGAGGC CCAGTTGCAC CCCGGGGCGC AGATAGCGAT GGCGCTCTTC TTAGTCTCGC TACCGGCTCT GTGCGGGTGT AGGGTCGTGG CTACCACACA CAGCGATCTG TTGGCCATCA CCATGAGCCA GCTGGCGGTG CAGAGGCCGG ACAGGCAATG GGTTGTGGAG CTGCTGGCGA GGGTTCTGCC CCATGTGAAG GAGGGCGTTG ACGTGTTGGC TGGGGCCGTG GCGGAGGCCG CCGTAGACCT GAGGATCTAC GAATTCACAA GAGAGGGCAG GGTGGCGGCT GTGAGGCCGG AGGACGTGCT CGGCAAGGAG GTACCTGGGA TAAGCAGGGT AATCGACGAG CTCACCGATT GGGCCTTTCG CCTTGCGAGC CGCCGGAGGT GA
|
Protein sequence | MQTAEPRPIV NALRAIYAKL SPLDRYAAGR VERRLSARIA EYLEREFRAE ASSSVRRLVS GVVELVAALR SGGVEEARTV LGRMGEVGLK VAEAGGSVVV RGPPMEWAVD VGVLRQMAVD AFYGFMAELV PVRGVDAVRL EPLEPLDVVG AQRLRVEAER RYPLGWFGQA GEVLRRLGCG GDELERLLFS SSVSLHFDVG VGGRSSVWLE FRIDTRAFSP GGAGRVEKTD GGLSKALDEY VEELVSKVVR RSAAYLSSAV VGGVRGALRS RLGFEGLRFV PFGRSVLVLA LESASREPYA RPVYLRRLVE EFYPGVLASY VYWASEGRRR LLEWLDEAGA RVLDAAAPLL EGRLVPGAAG RVMYRDWRGS LVEIRLSSAL VGEVAGLLFS LLSVGGRSLV LVEEPEAQLH PGAQIAMALF LVSLPALCGC RVVATTHSDL LAITMSQLAV QRPDRQWVVE LLARVLPHVK EGVDVLAGAV AEAAVDLRIY EFTREGRVAA VRPEDVLGKE VPGISRVIDE LTDWAFRLAS RRR
|
| |