Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2201 |
Symbol | |
ID | 6314873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2337209 |
End bp | 2338912 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642644589 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001918355 |
Protein GI | 188586810 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAATG CAGGACAATT TTTTAAGAAA TTTTCAGTAC TCATGGCAGT TTTAATAGTT ACTACTGCGT TAGTAATCGG ATGTGGAGAC GGTGGCGATC CAGACGCATC TGATGATTTG GACCCAGATA GAAAGGTTCC AGAAGTAAGA TTTGTTTCAT CCACTGCTGA TGATAACCAG ATAAGAAATG AAGCAGTACA GCTAGTTGCT GACTGGTGGG AAGAAATCGG TTTAGAAGTA GATATTCAAA CCAGAGAGTT TAACTCTCTA GTCAACAGAG TATTGGCTGC CCCCGAAGAT AAGGATTTCG AAGCATATAT TCTAGGTTGG AGTGGTAGAG TATCAAGATC AGACCCTGAT ATGTTCCTTT ATTCCTTATA TCATTCCAAT CAAGCTGTAG ATGGCGGTAA CAACAGTAGT GTTTTCAAAA ATGAAAAGTA TGATGAACTT GTTTCGAAAC AAAGAGCGGA AATGGATCTA GAAAAGCGAC AAGAACTCGT CTTTGAGGCC CAGGAAGTCC TGGCAGAAGA AGTTCCTGAT ATCACCCTAT ATTACAGAGA CGAAATTCAA GGTTATAACA ATGAGCGATG GGAAGATCTA CCAAGTATGG CGGGAGAAGG TATCTTTAAC GAACAGTTCC CCTATGAGGC TACTCCCAAA ACAGATGATG CAGAGTTTGT AATAGCTAAC TCCGCAAACT TGGATACATT TAACCCCTTT GCGGCAGAGA CAGTTTACGA ATGGAAATTT TTACGACTTG TTTACGACAA ATTAGTTAGA TTAGATGAGA ATTTTGAGCC CCAACCCTGG GCAGCAGAAG AAATCAACGT AGTAGAAGGC GAAGACGGTG AAGTAATAGA TGTAACCTTG AGAGATGATT TACAGTTCCA CGATGGAGAA CCACTTGGGC CAGAAGATGT TGTCTTTACT TTCGACTACA TGTTTGAAGA AGGCATACCT TATTTCCAAG CCTTTTTAGA CCCAATTGAA ACAGTAGACC TCATGGACGA TGATGAAACT ATCAGATTTA CTTTAGAAGA AGCTTATGCA CCCTTTATAA CCAATACTTT AGGTCAAATT CCTATCTTAC CAGAACACAT CTGGGCTGAT GTTATGGAAG AAGAAAATTT AGATCATCCT TCTCAGTTCG ACAATGCCGA AGCTATCGGA AGCGGTCCTT TCAAATTTGA CAATTGGGAA AGAGGCGAGT ACATCAGAAT TGTTAAAAAC GATGATTACT TCAAAGCTGA TGATATCGAT GTGGAAGCTA TCAGATACGA CAAGTACAGC CATTCTGAAG GAGTTTTCGG TGCCCTGGAG AACCAACAGG CCGATGTAAA CGAAAACACT TTTGACCCTG AATACGTTCA ACAAGCAGAA GATCTTGATC ATTTAACAGT AGTTAGAGAA CCCGATATCG GCTTTGACTT CATTGGCCTC AATAACTACA AAGAGCCTTT TAACGATAAA GCCGTAAGAC AAGCCGCTGC TCATGCCATC GATTTAGATG AGCTTGTAGA CGTTCTCTTG TACGGTTATG GAGATCCAGC AGGTGCAGGT CAGACTATTT CAACAGGTAA TGAAATGTGG AAAAATGACG ATGTAAAAGA ATATCCTTTT GATATCGATA AAGCAAGAGA AATCTTAAAA GATGCCGGCT ATGAATGGGA TAGTGACGGA AGATTATATT TCCCTGAAGA GTAA
|
Protein sequence | MENAGQFFKK FSVLMAVLIV TTALVIGCGD GGDPDASDDL DPDRKVPEVR FVSSTADDNQ IRNEAVQLVA DWWEEIGLEV DIQTREFNSL VNRVLAAPED KDFEAYILGW SGRVSRSDPD MFLYSLYHSN QAVDGGNNSS VFKNEKYDEL VSKQRAEMDL EKRQELVFEA QEVLAEEVPD ITLYYRDEIQ GYNNERWEDL PSMAGEGIFN EQFPYEATPK TDDAEFVIAN SANLDTFNPF AAETVYEWKF LRLVYDKLVR LDENFEPQPW AAEEINVVEG EDGEVIDVTL RDDLQFHDGE PLGPEDVVFT FDYMFEEGIP YFQAFLDPIE TVDLMDDDET IRFTLEEAYA PFITNTLGQI PILPEHIWAD VMEEENLDHP SQFDNAEAIG SGPFKFDNWE RGEYIRIVKN DDYFKADDID VEAIRYDKYS HSEGVFGALE NQQADVNENT FDPEYVQQAE DLDHLTVVRE PDIGFDFIGL NNYKEPFNDK AVRQAAAHAI DLDELVDVLL YGYGDPAGAG QTISTGNEMW KNDDVKEYPF DIDKAREILK DAGYEWDSDG RLYFPEE
|
| |