Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0477 |
Symbol | |
ID | 6315540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 507157 |
End bp | 508878 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642642861 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001916661 |
Protein GI | 188585116 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGCA GAAAGTTATT TGTATTATTT CTGGCATTGA TGTTTGCTAC TGCTGTAGCC ATCGGTGGCT GTGCACCTCC AGAAGAAGCT AAAGAGGAGG ATCCAGAAGC AGAGGAAGAA GAAGAGGCTC CAGCAGAAGT AGATAATCCT GCTGTAGAAA GACCTAATGA AATTGTAATT GGAGGTCCAG ATCTAGAAGG AATTTTCAAT CCAGTTTTAT ACAGTGGTGT TTATGACTCC TGGGTTTTAG GTATGATTTT TGATTCACTA CTTACAGTAG ATGAAAATGG ACAGTTAACA ACTGATCAGA GATCAGTTGC CAAAGATTAT GAAATCTCAG AAGATGGCAC TGTATACACT TTCCACTTAC GTGAAGGATG GGAATTTCAC GATGGAGAAG AGATAACTGC CGAAGATGTG GCCTTTTCTT TAGAAGTTAC TGCTCACCCT GATTACGACG GACCACGCTC CAGCTTTTCC GACGATATAG TGGGTGTAGA TGAATTTCGA GAAGGAGAAA CTGACGAGCT AGAAGGAATT ACAGTAGAAG ATGATTATAC CTTGGTAGTT GAAGCTCAAG AACCTAGTGC TGATAACATA TTTGATTTTG CAGTAATGAT AATGCCAAAA CATTATTATG ATTTTGAAAA CTATGAAGAC TTCCAGGATT TAACAGACGA CCCAATGGGA AGTGGTCCCT TCGAACTTGT AGAATACAGT CCGGATCAAC ATGCTATCTT AGAATCCTTT GAAGACTATT ACCATGGTGC ACCAAATGTG GATCGAATAA TCTACGAAGA AACTGAAACT GAACAACAAA TTCCAATGGT TGAAACAGGA GAAGCTGATA TTGTCCAGGT AAGTTCAACT CCAGAAAATA TGGAAATGTT AGAAGATATA GACCATCAGG AAGCATTAAC TTTCTTTGAT AATGCCTATA CTTATATTGG TTTGAACCAC GAAGATGAAC ACTTGCAGCA TCAAGAGGTT AGGCAGGCAT TAGCTTATGC CCTTGATATC GAAGCTTTCC TTGATGGAAT GTTTGGAGAT GAATTGGCTC AACCGATATA CACTCCGTTT TCACCCGTAT CATGGGCATA TCCCGATGAA GACGATATGA ACGATTATGC TTATGACCCT GATAAGGCCA ATGAACTACT TGATGAAGCG GGTTATGAGT GGGATGACAA TGAAGAGTAT CGAGTAAATG AAGACGGAGA GAGACTATCC TTTACATGGG AAACCATTGC TGACAACGAA TGGTCCGAAC ATTTAACTAC TCTAGCCTTA GAACAATGGC CACAAATAGG AGTAGAACTA GAAATCGAAA ACTACGATTT CAATACCTTG ACCGACAGAA TTGATCAGGA CCGCGGAGAT GTAGATATGT GGAATATGGG TTGGTCACTA TCAGCCGAAC CCGATCCATC TAATATCTTT AGTGTAGAAT ATGCCGATGC TGGCTTAAAC TACGGAATGT ATCACAACGA AGAAGCAGAA GAATTGATGC AAGAAGGTCT TGAAACCTTT GATCAAGATG AAAGAGCAGA AGTTTACAAT GAACTAGGTG TACTATTTAA TGAAGATCTA CCCTATATCT TTGTCTACAG TAACAAAGAG ATTTGGTCCA CAAATGACCG AGTTGAAAAC TATGAACCAA CTGCATTCCA ACACTTGACA TGGAATATCC ACGAGTGGGA AGTAACAGAC TACGAAGAAT AA
|
Protein sequence | MSSRKLFVLF LALMFATAVA IGGCAPPEEA KEEDPEAEEE EEAPAEVDNP AVERPNEIVI GGPDLEGIFN PVLYSGVYDS WVLGMIFDSL LTVDENGQLT TDQRSVAKDY EISEDGTVYT FHLREGWEFH DGEEITAEDV AFSLEVTAHP DYDGPRSSFS DDIVGVDEFR EGETDELEGI TVEDDYTLVV EAQEPSADNI FDFAVMIMPK HYYDFENYED FQDLTDDPMG SGPFELVEYS PDQHAILESF EDYYHGAPNV DRIIYEETET EQQIPMVETG EADIVQVSST PENMEMLEDI DHQEALTFFD NAYTYIGLNH EDEHLQHQEV RQALAYALDI EAFLDGMFGD ELAQPIYTPF SPVSWAYPDE DDMNDYAYDP DKANELLDEA GYEWDDNEEY RVNEDGERLS FTWETIADNE WSEHLTTLAL EQWPQIGVEL EIENYDFNTL TDRIDQDRGD VDMWNMGWSL SAEPDPSNIF SVEYADAGLN YGMYHNEEAE ELMQEGLETF DQDERAEVYN ELGVLFNEDL PYIFVYSNKE IWSTNDRVEN YEPTAFQHLT WNIHEWEVTD YEE
|
| |