Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1888 |
Symbol | |
ID | 6164941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1663334 |
End bp | 1664314 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641669050 |
Product | periplasmic solute binding protein |
Protein accession | YP_001795249 |
Protein GI | 171186330 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.447196 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGA AAACTCTCTA CCTGGCGGCC GCCCTGCTGG CCATCGCGGC GGTTGCCCAG ACGACGACCA AGATAGTGGT ATCGTTTCCC GCATACAACG TCGTGCTCAG CGAGGCTTTT CCAAACGCAG ACGTCGTCCT CATCACAAAG GGAGCATCGG ACCCACATGA GTACCAGCTG ACGGCTGAAG ATCTGCAGAT GCTCAGTAGC CTCACCCCAC GCGACGTCGT CGTCCTCTCC ATGCACGCCC CCTTTGAGCA GAAGATAGCC GAGATGGCTA GAGACGGCCA GATAAAGGCC AAGGTCATAG ACCTCACAAA AATCCAGCAG TATCTAACAT ACGACAACGG AGCGGTGAAC CCCCACGACC ACGGCATATA CCCGCCCAAC GTCCTCAGGC TGGTAGCCGC CGTGGCAAAC GCCACCGGGC TAAGGCCGGA CCTGGCCTTC CTACAGAAGC TACGGGAGCT CAACTCGACA TACTGCTGTA GATTCAGTGG AAAAGCCGTG GCCCTGACGC CAACCGCGCA GTATATACTC TACTGGCTTG GATACAGAGA CATAGCCGTC CTCATCAAAG AGCCAGGCGT GCCACCCACC CCGCAAGACC TCCAGAAGGC CCTCCAATAC GCAAAAGAGG GCGCCCCAGT CCTAGCCGCC GTAGTGCGCG GAGAAGCTCT ACGCATAGTA GACCAGTTTA GACAGAAGGC CCAGGAAGCC GGAATAAACC CAAACATAAT CACGGCAGAC TTCTCCAAAA ACTACATCCA GACCCTAGAA GCCGCGGTCA GACAAATAGC GGCCGCCCAA ACGCCCACGG CCACCGAAAC CACAGCCAAA CAAACAACGC AACAGAACAC AGAGGCTACC CAGACGGCGC ACACAGCCGC CGGCCCAGAG CCGACTATCC CAATCGCAGT AGCCGCCACA GCCATCGTGC TACTCGCAGT TCTACTCCTC CTCAGGTGGA AAAAACAATA G
|
Protein sequence | MAAKTLYLAA ALLAIAAVAQ TTTKIVVSFP AYNVVLSEAF PNADVVLITK GASDPHEYQL TAEDLQMLSS LTPRDVVVLS MHAPFEQKIA EMARDGQIKA KVIDLTKIQQ YLTYDNGAVN PHDHGIYPPN VLRLVAAVAN ATGLRPDLAF LQKLRELNST YCCRFSGKAV ALTPTAQYIL YWLGYRDIAV LIKEPGVPPT PQDLQKALQY AKEGAPVLAA VVRGEALRIV DQFRQKAQEA GINPNIITAD FSKNYIQTLE AAVRQIAAAQ TPTATETTAK QTTQQNTEAT QTAHTAAGPE PTIPIAVAAT AIVLLAVLLL LRWKKQ
|
| |