Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1006 |
Symbol | |
ID | 6164939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 897878 |
End bp | 899155 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641668159 |
Product | hypothetical protein |
Protein accession | YP_001794384 |
Protein GI | 171185465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.260042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0010639 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCCAA AGGTAGCCGC GTTCCTCCTA GCGGCGGCCC TCGCCGCCGC CCAACAGCTG TACGTCCCAG TTTTGGTAGA CGTGTCACAC GGCGAAGCCA CGAAGGGGCT CGACCTCTGG GTAAACTCCA CGGCAAACCC CCTGGCGATT ACGGACTTCG CTAGGCTGTA CGTCCTCGTC CCGCCAGACG CCAAGCTCGA CCCCACCCTG ACTAAGCTAA ACGCCACCAA GGCGGCCGTC ATAATACGGG GAGACCTCTC GACGGTCGAC CTGTCCCAGT TTAAGGTAAT CGTCCTCGGG CAGCCGCCTA AGCCCCTCAG CGAGGCGGAG CTAGCCGCCT TAAAGAAGTG GTTCGACTCC GGCGGGAGAG TGCTGTGGTG CGCCGCCGAC TCCGACTACC CAGCCCAGGG CTCGGAGGAG TCGCAAGCCG CCTGCAACGA CGTAGCTGAG TACCTCGGCG CCCACATCAG AGCCGACTAC GTCTCGGTGG AGGACCCCCA GCAGAACGCG GGGGCTGCCT ACAGGGTGGT AGGCGTCGTT GACCCGCCGC CTCAGCTGGC CTTCCTCGGC TTCCTCGCCC AGCGCGTCCT CCTCCACGGC CCCGGGGCAA TCGCGGTGGT GCTGCCCGAC GGGAGGTGGG TCCCCGCCAC AAGCCCAGAG GCGCAGAAGG CCTACGGCAA CATATATGTG ATAGTGAGAA CCACGGAGAA GGGGGCCATA GTTGAGCACA GGACCTCCGC CGACGGCAAA GGCAGAGACG GGAAGGCCCA CAAGGCGGGC GACCGCGGCG TCTTTGCCCT GATGGCCGCC GAGATTATGC CAAGCGGAAG CGTGCTCATC CTCTCCGGCG AGACGCCATA CGGCGGCTAC GAGCCCATGG TCGCCCCGGT CTACTACAGA GTCCAGCTAG ACGGGCCTAG GTTCCTCAGA AACATCCTCC TGTGGGCCAC CGGCAACTAC AGAGAGCTCA CCACGATGGT CTACCAAGCC CGCCAGATGG CACAGATCGC GTCCGACGCC GCCGCGCTGA AGAACACCGC CGCCTCTTTG CAAAACGAGG TCTCCGCCGT TAAAACAGCC GTCTCGCAGG TCTCCGCCAA GGTAGACGCC GTGGGCGGCC AGGTGGCTGA GCTCAGCCAG AAGGTGGACC AGCTCACTCA GCAGCTCAAC GCCGCCGTGG CCGAGGCCAA CAACGCCAAG ACCACCGCCT TCGTCGGCAC AGCCCTAGCC TTGATCTTCG CCATAGCCGC AGCCGCCCTC GCCATACGCA GGAGATGA
|
Protein sequence | MNPKVAAFLL AAALAAAQQL YVPVLVDVSH GEATKGLDLW VNSTANPLAI TDFARLYVLV PPDAKLDPTL TKLNATKAAV IIRGDLSTVD LSQFKVIVLG QPPKPLSEAE LAALKKWFDS GGRVLWCAAD SDYPAQGSEE SQAACNDVAE YLGAHIRADY VSVEDPQQNA GAAYRVVGVV DPPPQLAFLG FLAQRVLLHG PGAIAVVLPD GRWVPATSPE AQKAYGNIYV IVRTTEKGAI VEHRTSADGK GRDGKAHKAG DRGVFALMAA EIMPSGSVLI LSGETPYGGY EPMVAPVYYR VQLDGPRFLR NILLWATGNY RELTTMVYQA RQMAQIASDA AALKNTAASL QNEVSAVKTA VSQVSAKVDA VGGQVAELSQ KVDQLTQQLN AAVAEANNAK TTAFVGTALA LIFAIAAAAL AIRRR
|
| |