Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1596 |
Symbol | |
ID | 6315775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1671840 |
End bp | 1673558 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642643967 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001917758 |
Protein GI | 188586213 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0368881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 7.93881e-23 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCAAGTC GAAAAATATT TGTTTTGTTT ATTGCTTTTC TCCTTACAAT GGGAGTGGCT TTGAGTGCTT GTGCACCTCC AGAAGAGGCT AAAGATGAAG AAGTGGAAGA AAAAGAAGAC GAAGAACCCG TAGTTGAAAA CCCTGCAGTT GAACGTCCTA ATGAATTAAT TATCGGAGGA ACTGATTTAG ATGGAATTTT TAATCCAGTT CTTTACTCTA CGGCATATGA TGCTTGGGTT ATAGGAATGA TGTATGATAC ATTGCTTACA GTAGACGAAA ATGGTGAATT AACGACGGAT CAGAGATCCA TAGCTAAAGA CTATGAAATT TCCGACGATG GTTTAGAATA CACTTTTTAT CTTAGAGAAG GATGGAAATT TCACGATGGA GTAGAAGTAA CTGCGGAAGA TGTTGCATTT ACCCTTGAAG TAACTGCTCA TCCAGATTAC GATGGGCCAC GTGCCAGCTG GTCAGATAAT ATTGTAGGTG TTGATGAATA CAGAGCAGGA GAAACTGATG AATTAGAAGG TGTTATTGTT GAAGATGATT ATACTCTCAC TGTAAAATCC CAAGAACCCG ATGCCGGTGA TATATTTGAT TACTCAACTT ATGCTCTTCC AAAGCATTAT TATGAATTTG ATGACTACGA AGAAATTCAC GATTTAACAA ATGATCCTAT GGGAAGTGGA CCTTTTCAAT TAGTAGAGTA TAGCCCGGAT GAACACGCTA TTTTAGAACC TTTCGAGGAT TATTATCATG GTGAACCCGA GTTAGATCGT ATAATTTACG AAGAAATTGA AACGGAACAG CAGATCCCTA TGGTTGAAAC GGCAGAAGCA GATATTGTGG CAGTATCATC AACTCCCGAA AATTATGAGA TGCTACAAGA AGAAGATCAT CAAGAAACAA TTACTTTCTT AGATAACTCT TATTCCTATA TCGGACTTAA CCATCAAAAT GAACATTTAC AACATCAAGA AGTCAGGCAA GCCTTGGCTT ATGCCATAGA TATTGAATCA TTCATTGAAG GAATGTACGG GGAAGAACTG GCAAGACCTA TGGCAACGCC ATTTTCACCT GTATCTTGGG CTTATCCTAA TGAAGACCAA TTAAATTTTT ATGAATATGA TCCGGATAAA GCCAATGAAT TGCTTGAGGA AGCAGGATAT GAATGGGATG AAAATGAAGA ATATCGCTAC AATGAAGATG GTGAACGCTT AAGTATTGTG TGGGAAACCT TAGCAGATAA TGAATGGTCT GAGCATTTAA CAACCCTTGC CTTAGAACAG TGGCCACAAA TAGGTGTGGA TTTGGAACTA GAGTCGTATG AATTTAATAC TTTAGTGGAT AGAGTAAATG TAGAAAAACG TGGAGAAGTA GATATGTGGA ATATGGCATG GAGTTTAGCT ACTGACCCAG ATCCTAGCAA TATATTCAGT GTTCAATATG CAGATGACGG ATGGAATATG GGCTACTATC ACAATGAAGA GGCAGAAGAG TTAATGGAGG AAGGGATTAG AACCTTTGAT CAGGATGATC GAGCAGAAGT ATATAATGAA TTAGCTTTAT TATTAAATAA AGATTTACCA TATATCTTCG TTTACAGCTC CAAAGATTTA TGGTCTGTTA ACAATAGAGT AGAAAACTTC GAACCTTCAG CTTGGCAAGC ATTTTCGTGG AACATTCATG AATGGGAGAT TACCGAATAT CAAGAATAG
|
Protein sequence | MSSRKIFVLF IAFLLTMGVA LSACAPPEEA KDEEVEEKED EEPVVENPAV ERPNELIIGG TDLDGIFNPV LYSTAYDAWV IGMMYDTLLT VDENGELTTD QRSIAKDYEI SDDGLEYTFY LREGWKFHDG VEVTAEDVAF TLEVTAHPDY DGPRASWSDN IVGVDEYRAG ETDELEGVIV EDDYTLTVKS QEPDAGDIFD YSTYALPKHY YEFDDYEEIH DLTNDPMGSG PFQLVEYSPD EHAILEPFED YYHGEPELDR IIYEEIETEQ QIPMVETAEA DIVAVSSTPE NYEMLQEEDH QETITFLDNS YSYIGLNHQN EHLQHQEVRQ ALAYAIDIES FIEGMYGEEL ARPMATPFSP VSWAYPNEDQ LNFYEYDPDK ANELLEEAGY EWDENEEYRY NEDGERLSIV WETLADNEWS EHLTTLALEQ WPQIGVDLEL ESYEFNTLVD RVNVEKRGEV DMWNMAWSLA TDPDPSNIFS VQYADDGWNM GYYHNEEAEE LMEEGIRTFD QDDRAEVYNE LALLLNKDLP YIFVYSSKDL WSVNNRVENF EPSAWQAFSW NIHEWEITEY QE
|
| |