Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2798 |
Symbol | |
ID | 6314465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 3022756 |
End bp | 3024429 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642645170 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001918934 |
Protein GI | 188587389 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGAAAGC TGTTAATTTT ACTGCTCGTA GTTTTTGTGG CGTCTTTTGG ATTTATGGGT TGTGATCCGG CTGATCCTGA AGAAATTGCT GAAGAAGAAG AAAAAGAGCC TGAAGAAGAT GTTGACGAAG AAGAGGAAGA GCTAGAACAG GCTATTACTC ATGCTATATG GAGTGAACCA GACGGACTAT TTAATTATTG TGAATATGAA AGTACTTATG ATCTTGATGC TTTTAATCAA GTATTTGATG GCATGATGGA AGCAGATCCC CATGCAGATT TGGAACTAAA TCCAAATTTA GCAGAAGAAC ATGAAATAAG TGATGATGGA AAAACTTTTT ATTACAAACT AAGAGATGAT ATTGAATTCC ATGATGGTGA GCCACTAACT GCAGAAGATG TTAAATTCAC TTTCGAATGG ATGTGTCATG AAGACTATAT AGGTCCCAGA GCTAGTTACT GGCAGCACCT GGAAGGGTTT GACGAATATC GAGCTGGAGA AGCTGACGAA GTAGAGGGAA TTGAAATTAT TAGCGACCAT GAAATAGAAT TCCACTTTGA ACAAGTTGAT GCTTCTGCCG TATATTATGT TAGTACCTGG GGCATTTCAC CTAAACATGT ATGGGAAGAT ATCCCTGTAG GTGAAAGAAG AGAAGCCCCT GAAATGACAG AACCAATTGG AACAGGACCT TTTGAATTTG AGGAATACGT TGAAGGTCAA TACGTAGAAT TAGTTGCTAA TGAAGATTAT CACAGGGGAG AACCAGAATT AGAGAGAATT ACTGTAGAAG TAAAAAGTCC CGATGTAGTC AGGGCAGACT TGGAGACTGG TGAAGTTGAT ATCGCTGAAG TTCCACCTGA TGAGAGCGAG TGGGATGATT ATGAAGCTCA CGATGAGTTA GCCCTTGAAT ATTATCCCAC TAATGGTTAT CAATACATGG GTATGAACTT AAGGGATGAA AGTATCTTTA GTGATCATGC TGTAAGAGAG GCAGCTACTT ACGCTATCAA TAGGGAAGGT ATGGTTGATG GTATCCTAGA TGGTTTAGGT AAAGTCCAAA ATTCCCACTT TTCACCTAAC CAATGGGCTT ATGACGAAGA TTTAGATACT TATCCCCATG ATCCTGATAA AGCCGAAGAA ATATTAGAAG ATGCTGGCTA TACTAAAAAC GATGATGGAA TCTGGGAACA AAATGGTGAG CCCTTAGAAT TCACTCTATT ATATCCTACC GGCGACGAAC CCAGAGAACA AGCAGCTTTA ATCATTGAAC AGGACTTACA GGGAATTGGT ATTGACATTA CTTTAGAATC TTTAGAATTT GCCACTTTGA GTGACAGGGT TTTTGACGAA ATTGATTTCG ATGCTTATCT TATGGGTTGG TCTTTAGGTG CAGACCCAGA TCCAACGGGT ATCTGGGGAC CTGATGAAAG ATTCAATGCT GTTGGATTTG TACATCCGGA AAGCGACGAA CTAATGGAGC AGGGTCTTCG TACTACAGAT AGGGATGAAC GAAGAGAACA TTATGTAGAA TGGCAGCGCT TACTTCAAGA AGAGATGCCT TATGTATTCT TGTACGCTGA CTATGAAGGT TATGCCTACA ACAATGATAT CCAGGTATTT GAACCAAATC CTTGGAATAT TTGGTATGAC GTACATGAAT GGTACCTTGA ATAA
|
Protein sequence | MRKLLILLLV VFVASFGFMG CDPADPEEIA EEEEKEPEED VDEEEEELEQ AITHAIWSEP DGLFNYCEYE STYDLDAFNQ VFDGMMEADP HADLELNPNL AEEHEISDDG KTFYYKLRDD IEFHDGEPLT AEDVKFTFEW MCHEDYIGPR ASYWQHLEGF DEYRAGEADE VEGIEIISDH EIEFHFEQVD ASAVYYVSTW GISPKHVWED IPVGERREAP EMTEPIGTGP FEFEEYVEGQ YVELVANEDY HRGEPELERI TVEVKSPDVV RADLETGEVD IAEVPPDESE WDDYEAHDEL ALEYYPTNGY QYMGMNLRDE SIFSDHAVRE AATYAINREG MVDGILDGLG KVQNSHFSPN QWAYDEDLDT YPHDPDKAEE ILEDAGYTKN DDGIWEQNGE PLEFTLLYPT GDEPREQAAL IIEQDLQGIG IDITLESLEF ATLSDRVFDE IDFDAYLMGW SLGADPDPTG IWGPDERFNA VGFVHPESDE LMEQGLRTTD RDERREHYVE WQRLLQEEMP YVFLYADYEG YAYNNDIQVF EPNPWNIWYD VHEWYLE
|
| |