Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2053 |
Symbol | |
ID | 6315571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2169997 |
End bp | 2170941 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642644441 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001918208 |
Protein GI | 188586663 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | [TIGR03414] choline ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000296796 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00000000325695 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGCTC ATGGAAGCGG AAAACGTAAA GTTTTAATAG TAGCAATGCT TATTACAGGA ATTATTTTTG CAGGGATTGC TACCGGATGT GAAGAAGAAA GAGATGTAGA ACTTGCTATG GTCGAATGGA CATGCTCTAC TCAGAAGAGT CATATCAACG AAGCTGTATT AGAGACATTA GGTTACGATG TTAACGTTAA GACTTACAAT CTCCCTGTAA TCCTTGAAGG AATGGCAGAT GGGCAAATTG ATGCCTTTAC AGATGCATGG TTTCAAACTT GGGGAACCCC CCTTGAAAAT GCTTTGGAAG AAGGGGATGT AGTTCATTTA GAAACTCATT TAGATGAAAC TAATTACGCG CCAGCTGTTC CCACTTATGT ATATGAGGAA GGGGTAACCT CCCTAGAAGA TTTAGCCGAT CACTCGGAAA AATTTGAGTA TACTTATTAT GGCTTGGAAC CAGGGAATGA CGGTAATGAG ATTATGATCG AAGCTTTTGA AAATGATACC TACGGTCTAG GTGAATGGGA TATCATGGAA AGTAATGAAG CTGCTATGAT CGCTGATGTT GAGCAAAAGA TAGAAAATGA AGAATGGGTA GTCTTTAGCG GTTGGGAACC CCATTACATG AATGTAATAT TCGATATGGA ATATTTGGAT GATCCCAAAG GAATTTGGGG TGAAGGTGAG CAAGTTGGTA CCATTGCAAG ACCTGGCTTA GAAGATGACA ATCCACAACT AGCTCAATAT TTGAAACAAT TTGACGTAGA TGTAGACACT GTTGACGAAT GGGTTTACGA ATACGGTTAT GAAGACCGTG ACCCAGACGA AGTTGCGGAT GAATGGATTA GCGAGAACTT AGATAAGGTA TTAGAGTGGG TTGATGGATT AGAAACTGTC GATGGACAAG ATGCTCAAGA AGCATTGCGT GAAGCTTACG AATAA
|
Protein sequence | MNAHGSGKRK VLIVAMLITG IIFAGIATGC EEERDVELAM VEWTCSTQKS HINEAVLETL GYDVNVKTYN LPVILEGMAD GQIDAFTDAW FQTWGTPLEN ALEEGDVVHL ETHLDETNYA PAVPTYVYEE GVTSLEDLAD HSEKFEYTYY GLEPGNDGNE IMIEAFENDT YGLGEWDIME SNEAAMIADV EQKIENEEWV VFSGWEPHYM NVIFDMEYLD DPKGIWGEGE QVGTIARPGL EDDNPQLAQY LKQFDVDVDT VDEWVYEYGY EDRDPDEVAD EWISENLDKV LEWVDGLETV DGQDAQEALR EAYE
|
| |