Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2405 |
Symbol | |
ID | 6314596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2570043 |
End bp | 2571641 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642644793 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001918558 |
Protein GI | 188587013 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000184044 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAAAT TTAACAAAAA GCCATTTTGG CTAGCAATTG TGCTATCTGG TCTTTTGGTG TTCACCTTGA CAGGCTGTGG AGAGCCTGAT GTTGAGGACG AAGCCGAAGC ACCAGAAGGA GAAGAAGAGG AAGAAATGGA AGAACGACTT GAAGAAGATC ACTTGGTAGT TGCCCAGGGG GCAGATGCAC CAACACTAGA TCCTATTGGT GAAAACGATC AACCTTCTGC AAGAATTACA GAGCAAATTT TTGACACTCT TGTGGAGCAG GACGAGAATA TGGAGGTTCA GCCAGGACTG GCTGAAGACT GGGAACAGAT CGATGATACT ACTTACGAAT TCTATCTTCG CGAAGGCGTT AAATTTCACA ATGGTGAAGA GCTAACAGCT GAAGATGTTA AATATTCATA TCAAAGATTG TTAGATGAAG ACGAAGCTTC CCCGGGAGCT TTTATTTTAG AAATGGTTGA TGTTGATAAT ATTGAAATTG TAGATGATTA TACCGTGCAA ATTCCTCTAG AAGAACCTTT TGCACCAATT CTTTATCATC TTGGTCACTC AGTAACTGCC ATTGTCAATG AAGATGCAGT TGAAGAACAT GGCGATGACT TTGGACAGAA TCCTGTAGGA ACTGGGCCAT TTAAATTTGA CGATTGGGAT ATTGGAAACA GAATTGACCT AGTTAGTTTT GATGACCACT GGAGAGGAGA AGCTGGTGTA GAGCAGCTTT CTTTCAGAAA CATTGAAGAG GACACAAACA GAACTATTGA ATTAGAAACA GGTGGTGCTG ATATTATTTA CGATGTAGCA CCAACAGATC TTGAAAGAGT AGAAGATCAT GAAGAATTAA CTCTTTTAAG AGAACCTAAC TTGTCAACTG AGTATATTGG CTTTAATATT GACAAAGAAC CCTTTGATGA TGAAAGGGTA CGACAAGCAA TTAACTATGC ATTAGATATG GAACCAATTG TCGAAGGAGT ATATTATGGA CTGGGAGAAC CTGCTAGAAG TCCCCTTGCT CCAGCAGTAG TGCACAATAA TCAAGATGTA AAATCTTATG AGCAGGATAT GGAGAGAGCT GAAGAACTAC TGGCTGAAGC TGGCTATGAA GACGGATTTG AAGCAGAAAT CTGGACTAAT GACCAGCAGC AGCGTCAGGA CATCGCTGAA ATGGTACAAG GCCAGTTGTC TCAACTGGGT ATTGACCTAA ACATCAGCAT TCGCGAGTGG GGAACTTATC TTGAAGAAAC TGCCCAAGGA GAGCATGATA TGTTCATACT TGGCTGGGTT TCAGTAACCG GTGATGCCGA TTACGGATTA TACTCCCTAT TCCATGGTGA TGAGCATGGA GCAGCCGGAA ACAGAACTTT CTATGATAAT GACAGAGTAG ACGAACTTCT AGACGAAGGT CGTAGAACTT TCGATGAAGA TGAGCGTGCT GAGATCTATG CGGAAATTCA AGAAATAGTT ACTGAAGAAG CTCCTTGGAT CTTTACTCAA GTGGGTGAAG AAGCAGTTGG TACTAGAGAT TTTGTAGAGA ACTTTACTAT TAATCCTGCA GGCCACCATG ACCTGTTTGA GGTAACTATT GCAGATTAA
|
Protein sequence | MKKFNKKPFW LAIVLSGLLV FTLTGCGEPD VEDEAEAPEG EEEEEMEERL EEDHLVVAQG ADAPTLDPIG ENDQPSARIT EQIFDTLVEQ DENMEVQPGL AEDWEQIDDT TYEFYLREGV KFHNGEELTA EDVKYSYQRL LDEDEASPGA FILEMVDVDN IEIVDDYTVQ IPLEEPFAPI LYHLGHSVTA IVNEDAVEEH GDDFGQNPVG TGPFKFDDWD IGNRIDLVSF DDHWRGEAGV EQLSFRNIEE DTNRTIELET GGADIIYDVA PTDLERVEDH EELTLLREPN LSTEYIGFNI DKEPFDDERV RQAINYALDM EPIVEGVYYG LGEPARSPLA PAVVHNNQDV KSYEQDMERA EELLAEAGYE DGFEAEIWTN DQQQRQDIAE MVQGQLSQLG IDLNISIREW GTYLEETAQG EHDMFILGWV SVTGDADYGL YSLFHGDEHG AAGNRTFYDN DRVDELLDEG RRTFDEDERA EIYAEIQEIV TEEAPWIFTQ VGEEAVGTRD FVENFTINPA GHHDLFEVTI AD
|
| |