Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0728 |
Symbol | |
ID | 6315699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 753660 |
End bp | 756380 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642643106 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001916906 |
Protein GI | 188585361 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATAA AAATCAGCAC AGTTTTATTA GTATTTGTTT TGCTTTTATT ATTTCCAGGG TTGTCATCGG AAGTTTTAAC TGAAATTTCT TTCAATGGCA GTAATGAGCA AACTAAAGAA AAAAGCCAAG AGCAAGTAAA GGTGAGAGGT TTCTCAGACT ATTACCAATT GGACACCGCT ACCAAAACCG AAAATGGCAT CCGTTACATT CCTATGAGAA CATTGATGTC AGAGATGGAT ATGTATATTA CATGGCAGGA TGACGATCAA AGTATCTTTG CAGAAAGAGA AGACCTGGAA CTTGAGACCA AGATAGGAGA ACAACATATT AGAGTTAATA ATCAACTTAT CACTCTTCAA AATGAAATTC GCTTAATTGA TAATAAAAGT TATCTTCCCA TGGAGTTTCT AGAGCAAATA CATACAGAAG TTTCCTTGAA CGAAGAAGAT CAAATTATTG ATTATACGCC CATGAGTGCC TTTCCAGTAG TTCAAAATGA TAGATACGGA TTGATTGACG AAGAGGGGAA TAAATTATTA GATAAAGAAT TCAGCAATAT TCGAAAGCAA CGAGCCACTG GCGATTCAGA CTATCGTGCC AACTTTTTCA TGGTAACAGA TCAAGACGAA AATAAAGGCA TCTGGCACCC AATTACCCAG GACTTTTTAA TTGAACCCCA GTACGATCGA ATTCATCCAT TATTAGAGGG TCTGTTCCTT GTAAGAGAGG AGGACAAGAA TGGTTTAGTT AATTTAAAAG GGGAGACCGT TCTTGATACG AATTACGATT CGCTCTATTC CTTTGAAGAA GGCTTAGCTT TAGTGGAAAA GGATGATAAA TACGGTTATG TAGATCAATC CGGAAAAGAG GTTATCGATC TGCAATTTTC TGAAGTTAGT AGATTTGAAG GTGGAAAAGC TCCCGTTGTA AAGGATGGGA ACTACGGCGT TATAGATAAA TCAGGGGAAT GGTTAATAGA ACCGGAATAT TCTAAGGAAA AATATGAGCT ATCTCATCTT ACAGATGATC TATTGAGAAT TGCCAAAAAG GTGACAAACG ATGAAACTGT TAGGGAAATG AAGTACGGTA TCATGGATCT TTCCGGAGAA ATAATCATAG AACCTCAGTT TGATAGAATA GGCGATTTTT ATAATGGCCT TGCTTTTGTG GTCGATGATG AGGACTTTGG TTATATCGAT AAAAATGGTG AAATAGTTAT CGAACCTCAA TTTGAAAATG CTTATAATTT CAATACTGAT ATCGATGGAG AAGCTATTAG TGTTGTTATC GAAGACGGAC AGAGAGGTGT TATCAACCAA GAAGGAGAAT TCATAATTGA ACCTGTATTT GACCGGATAG CAAATGTTGA AGACGCCGAT GTGTTTATTA TCGAACAGGA TGATGAATTT GGTGTTATTG ATAAAAAAGG AAACAAGATA ATAGAAGCCA AGTACCATCG AATTCATGAC CTATATTCTT ATAGCTACGA TTTAGATGAA CATTATTTTG GAGTCCAAGT CGATGATAAA ATTGGCCTTT ATCATAAATC AGGTGAGCAA ATCACTGAGT TGATTTTTAA TGATATCAAC GATCCCAGGG ATGGATATAT TTCTGTTGAA CAGAATGATA ATTGGGGAGT TTTTGACCTG GAAGCTGGAG ATTTTGTAAT AGAACCTAAG TATGATAGTA TAGGGGAGTT TTCTCAAGGC CTTGCCTCGG TTGAACTTGA TAGACAATGG GGCTTTATTG ACAAAGGAGG TAATGTGGTC ATTGAACCAC AATTTGATTC AGTCCGTGAC TTTCGAGATG GCCTGGCCAG GATTAAAATA AACAGTGAAG TAGGTTATAT TGATACCGAA GGTGAATTTG TGTACGAACC TCCAGAGCCC CCAGAACCTG AAAAAACAAT TAAAATTGGG ACTACTCCCT GGCTTGATAC CAAATTGATG AGTCATATTT TCCATGAAGC CCTTAAGGTA AAAGGATATG ATAGTGAAGT CGTAGTAAGT GATATAGGAG TAATATTTGG TGATATTGCT CAGGACGATT CTCAAAGAAT GGACTTCACC TTTTCCATTT ACTTGCCTCA CAACCATGAA CATTATTACG AAGAGTATCA AGATGATATA GATAAAGTGG GAGAGGCTTT AACTGGTATA GATCAAGGAA TAGCAGTACC CGAGTATGTT TCCATTGATT CCATAGATGA TCTAAAGGGA AAAGAAGAGA AGTTTAATAG TAAGATATAC GGCATTGATG CTGGTGCCGG AATAAGCACT GAGACAGAAG AAGTTTTAAA AAAATACAAC TTGGATATGG AGCTAATTAC TAGAGGAGAT TATAAAATTG ATGATGACGA ATACGAAGAG TATGATTTAC ATGATTTAAT GCTAAAAAAT TTAGATGAAA AGTATCAAGC CGAAGACCCA GTAGCCATTA CCGGTTGGAA ACCCTATTAT AAATGGGCGA AATGGGACTT GAAGATGCTG GAAGATTCCA AGGAAATATA TACAACAGCT GATGCCTCTT ATACATTTAG TAGAAAGGGA TTGGAAGAAG ACCTTCCTGA AATTTATGAA GCCGTTACTA ACTTTGAGAT GAGCATGGAA GAAATCAATT CATTGCTTCT AAAAAAGGAG AAAGAAGATT TGGATTTTGA GGAAATAGCC AAAAAGTATA TTGACGAAAA TGAAGAGATG GTGGAAAGCT TGTTTGAGTA G
|
Protein sequence | MGIKISTVLL VFVLLLLFPG LSSEVLTEIS FNGSNEQTKE KSQEQVKVRG FSDYYQLDTA TKTENGIRYI PMRTLMSEMD MYITWQDDDQ SIFAEREDLE LETKIGEQHI RVNNQLITLQ NEIRLIDNKS YLPMEFLEQI HTEVSLNEED QIIDYTPMSA FPVVQNDRYG LIDEEGNKLL DKEFSNIRKQ RATGDSDYRA NFFMVTDQDE NKGIWHPITQ DFLIEPQYDR IHPLLEGLFL VREEDKNGLV NLKGETVLDT NYDSLYSFEE GLALVEKDDK YGYVDQSGKE VIDLQFSEVS RFEGGKAPVV KDGNYGVIDK SGEWLIEPEY SKEKYELSHL TDDLLRIAKK VTNDETVREM KYGIMDLSGE IIIEPQFDRI GDFYNGLAFV VDDEDFGYID KNGEIVIEPQ FENAYNFNTD IDGEAISVVI EDGQRGVINQ EGEFIIEPVF DRIANVEDAD VFIIEQDDEF GVIDKKGNKI IEAKYHRIHD LYSYSYDLDE HYFGVQVDDK IGLYHKSGEQ ITELIFNDIN DPRDGYISVE QNDNWGVFDL EAGDFVIEPK YDSIGEFSQG LASVELDRQW GFIDKGGNVV IEPQFDSVRD FRDGLARIKI NSEVGYIDTE GEFVYEPPEP PEPEKTIKIG TTPWLDTKLM SHIFHEALKV KGYDSEVVVS DIGVIFGDIA QDDSQRMDFT FSIYLPHNHE HYYEEYQDDI DKVGEALTGI DQGIAVPEYV SIDSIDDLKG KEEKFNSKIY GIDAGAGIST ETEEVLKKYN LDMELITRGD YKIDDDEYEE YDLHDLMLKN LDEKYQAEDP VAITGWKPYY KWAKWDLKML EDSKEIYTTA DASYTFSRKG LEEDLPEIYE AVTNFEMSME EINSLLLKKE KEDLDFEEIA KKYIDENEEM VESLFE
|
| |