Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0905 |
Symbol | |
ID | 6164361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 801462 |
End bp | 803192 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641668061 |
Product | DNA polymerase B region |
Protein accession | YP_001794288 |
Protein GI | 171185369 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0417] DNA polymerase elongation subunit (family B) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.408153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.581013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTACTT TCGGCCGACC CACCCCCCGG CGCTCCTTAA GCGGTGCGGA TATTGCCCAG CTCCGCAGAG GCGCCGTGTA CATCATAGGG CCTCGCCCCT CCGCAGACGC CGAGCGCGTC GACCTAGTGC CCTACGTCAA AGAGGGCCTC TTCTACAGGG AAGCTCCCCT GCCCGTGTGG AGGATCGAGG GACGCGCCGC CGCGGCTAAG CTGGCGGCTA GAGGCTACAG GCTCTCCCAG TCCTACATTC CGCCCTCGGT CCGGCCCTAC CTAGAGGGCC GCAGAAGCCC CCGGCTGGGC GGCAGAGTCG CCTCCTTCTA CCTAGACCGA GGCGTCGTCC ACTGGCGGAT GGGCGGCGAG GCGTGGAAGG GGGGATGTCC ACAGGCCGAC TACGTGGCGT CTTGGGGCCA GCCGCCCCCC TGTCCAGGCC TCTGGGTGGA CTTGAGACGT ATCTACCGCC GGCACCTCTG GCTGGAGGCG GCCGAGAGGA TGGGGGACGC CTGGCTTCTA AAGGCGGCGC GGAGGAGGCC GGAGGACGCC GCCGATGCTG TGCACGCCCT CGCCGCCGAG GCGGTGGCCT TCCTCGAGGC CCTCGCCGCG GCCACGGGGA TAGGGGTAGA CGCCATATTG GACGTTCTGG AGCGCAACTC GGTGGCCGCC GTGGCGGAGG CCGTGTTCCA CTGGGAGGCG GAGAGGCGGG GGTACGTGCT GGAGGACTCG CGTAGGCTGT TCGACATGCC CTACTTCGAC ATGGCGCGCG GCGGGAAGCC CGGGGTGTAC AGGGGGGTGG CCGAGTTCGA CTTCAGATCC CTCTTCCCAT CGCTGGCCGT TAAACTCGGC GTAGATCCCA CCACGGCGAG GCCCTGCGCC GGGGGGTACT GCTTCGACGG AGGGCCTATA GCCGACGTGA TAAAACGCTG GCTGGAGGCG AGGCTGAGGA CCGGGGACAA GCGCCTGTCG GAGGCCCTCA AGTGGCTTAT GAACGCCGGC ATAGGGGCGT GGGGCAAGGC GGGGTGGGGG ATAGTCTGCG AGGTCTGCCT CCTAGCCGTC AGAGGCGCCG CGGCATCTCT CTTCGACGAG GCTTGGAGGA GGCTCAAGCC CATCTACGGC GATACCGACT CCATCTACGT GGAGAGTAGC AGGGCGGGGG ACGTGGCCCA GTGGGCCTCC GCCCTCGACG GGCTGACCCT GGAGCTACGG GGGGTTTGGG ACGTCTTTAT CCTGGCCCCC GCCAGAGGCG GGGGCGCGGC GGAGAAGAAC TACATCAAGG CGGGGGGCGG CGTCTTCGAG GCGAAGGGAG GCCTCCTCAG GCCCCACGAC CTGCCCCTAG CGGTGAGGCT GAGGTACCGG GAGGTGGTGG AGGCCGCCCT GGCGGGGAGG GACCCCGCCG ACGCCGCGGC CGAGATCCTC AGAGCTGCGC CGCCTGAGCA GCTCTTCGTA TATAGGGCCG TCGAGGTGCG GAGGCTCGCC GGCCTTAAGC GGCGCGGCGC CCTACACACA GCCGCGCTGT ATCTCGCCAG GAGGTGCGGC GGGTGCGAGG TGGTCGACGT CTTCTACGTG CCAGACGGCG ACCTCGTCTA CGTCCCCTGG GCCGCCGTAG ATAGGCGCCT CGCGGCGAGG CCTGCGGACG AGCGGGAGGT GAGGGCGGCC GCGGTTAGCT ACATCCGCAG GCTCTGGAAG CTCAAGGCGT TGCGGCTCGC CTCGGGGGGC CAGATGAAAA TAGTTATATA G
|
Protein sequence | MFTFGRPTPR RSLSGADIAQ LRRGAVYIIG PRPSADAERV DLVPYVKEGL FYREAPLPVW RIEGRAAAAK LAARGYRLSQ SYIPPSVRPY LEGRRSPRLG GRVASFYLDR GVVHWRMGGE AWKGGCPQAD YVASWGQPPP CPGLWVDLRR IYRRHLWLEA AERMGDAWLL KAARRRPEDA ADAVHALAAE AVAFLEALAA ATGIGVDAIL DVLERNSVAA VAEAVFHWEA ERRGYVLEDS RRLFDMPYFD MARGGKPGVY RGVAEFDFRS LFPSLAVKLG VDPTTARPCA GGYCFDGGPI ADVIKRWLEA RLRTGDKRLS EALKWLMNAG IGAWGKAGWG IVCEVCLLAV RGAAASLFDE AWRRLKPIYG DTDSIYVESS RAGDVAQWAS ALDGLTLELR GVWDVFILAP ARGGGAAEKN YIKAGGGVFE AKGGLLRPHD LPLAVRLRYR EVVEAALAGR DPADAAAEIL RAAPPEQLFV YRAVEVRRLA GLKRRGALHT AALYLARRCG GCEVVDVFYV PDGDLVYVPW AAVDRRLAAR PADEREVRAA AVSYIRRLWK LKALRLASGG QMKIVI
|
| |