Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0139 |
Symbol | |
ID | 6164664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 127460 |
End bp | 128584 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641667305 |
Product | extracellular solute-binding protein |
Protein accession | YP_001793542 |
Protein GI | 171184623 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0201839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAACTA GGAGGAAGCT CCTAGCCGGC GCGGTAGCCG CGGCGGTTGT AGCCGCCTTG GGAGGTAGCG TTCTGCTTTC GAGGGGGCCG GAGAAGAGGG CGGCAACACT CGGCGGACAG CTAGCCGTCT ACAACTACTC CTACTACATA GACAAGGACC TGCTCACGGA GTTCGAGAGG GAGACCGGCG TTAAGGTGAT CTACCAGGAG TTCGAGAGCG GGGAGGAGGC CTACGCCGCC CTACTCAGAG GCGGGGGCGG GTACGACCTA GTAGTGGTGC CGGATATGTA CCTAAAGGAG GTGATCAAGG GGGGCTACGT GAGGAAGATG GACCACGGCA GACTAGCCAA CATCAACAAC ATAGACCAGG CCTTCTTCGA CAACCCGAAC GACCCAGGCC TCCAGCACTC TATTCCATAC GCCTTCGGCA CCACGGGCTT CGCCGTCAAC TACCACGCGA TGGCCGTAGA GGCCGGCAGA AAACTCGAGA GCTGGGGCGA CCTCTTCGAC TTCGGTCTCC TGGAGAAGAT GAGAAACAAG GTGGCTATGT TGGAGGAGTT CGTGGAGCCC GTCATGGCGG CTAAATACGC CCTGGGAATA GACCCAAACG ACTGGAGCCA AGCGGCTGTG GACAAGGTCG CAGAGCTTCT GAAGAAGCAG AAGGGCTACA TAAGAGGCTA CATGGGGGTC AGCCAGATCG TTCCGGCTAT AGCCGCCGGC GAGCTGTGGG TTTCACAGAT CTGGAGCGGA GACGCAGCCA CGGCACGCGA CGAGTTTATC AAACACGCCG GTGAGAAAAA CGCCGATAAG TTCGAGTACG TGTTGCCAAA GCCAATGACG CACAGATGGG TCGACTTCAT GGTGATCCCC CGCGACGCGA AAAACATCGA CGCGGCATAC GCCTTCATTG ACTTCCTGCT TAGGCCTGAG AACTCTGCTA GAATCACCAA GGCGTCTTAC TACCCAACAG CGCTGAAGAG ACAGCTACTC GAAAAGCACC TCAGCCCCGA CATATTACAG GACCCCACGG TCTTCCCGCC TGAGGGAGCC AAACTCATCT ATCTCAACTA CACAGACGAG ATGATTAAGG CCGTGGAGAA GATCAGCTAC GCCGTCAAAG GCTAG
|
Protein sequence | MVTRRKLLAG AVAAAVVAAL GGSVLLSRGP EKRAATLGGQ LAVYNYSYYI DKDLLTEFER ETGVKVIYQE FESGEEAYAA LLRGGGGYDL VVVPDMYLKE VIKGGYVRKM DHGRLANINN IDQAFFDNPN DPGLQHSIPY AFGTTGFAVN YHAMAVEAGR KLESWGDLFD FGLLEKMRNK VAMLEEFVEP VMAAKYALGI DPNDWSQAAV DKVAELLKKQ KGYIRGYMGV SQIVPAIAAG ELWVSQIWSG DAATARDEFI KHAGEKNADK FEYVLPKPMT HRWVDFMVIP RDAKNIDAAY AFIDFLLRPE NSARITKASY YPTALKRQLL EKHLSPDILQ DPTVFPPEGA KLIYLNYTDE MIKAVEKISY AVKG
|
| |