Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2048 |
Symbol | |
ID | 6315566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2163651 |
End bp | 2164676 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642644436 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001918203 |
Protein GI | 188586658 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0715987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00000000000901879 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGAGTTA TCAATAGTAT TATCAAAAAA TGTGTTTTTG CTATACTTTT TGGCATGTTA GTTTTGGGAG TAACCGGTTG TGCAGACAGC GAAGCTCAGG ATGAGGTTGA ACTAACTTTT GCAGATGCAG GATGGGAAAG TATTAGGGTA CATAACTATA TAGCAGGGAT TATTTTGGAA GAAGGATATG GTGGATATAG ACAAGATATA ATGTCTGGAT CTACACCGGT AACCTTTACT GATTTACGCG GTGGCGGAAT TGACATTTAC ATGGAAGTGT GGAAAGAAAA TATCCAAGAG GAGTACAATG AAGCCCTAGA AAAAGGTGAA ATCCAGGTCC TGTCGATTAA TTTTGATGAC AACTTTCAAG GATTATATGT ACCCACCTAT GTTATAGAAG GTGATGAGGA CCGGGGAATC GATCCTATTG CGCCTAATTT AGAGTCTGTG TTTGATTTAC CTGATTATTG GGAAGAGTTT CAGGATCCGG AAGACCCGGA TAAAGGCCGG ATAATAGGGG CTCCCTCCGA ATGGGCAGTA GATGAAATCC TTGAAATTAA GGTAGAAACT TACGGATTAG ATGAACACTT TAATTATGTG AGTCCTGGTT CAGAATCTAC GTTGAATGCA ACTATAATGG ATGCCTATGA AAGTGGCGAA CCAGTTGTGG CGTATAACTG GGAACCTACA TGGATTATGG GTAAATATGA CATGACTTTA TTAGAAGAAC CAGAATTTGA TGAGGAAAAA TATTACGAAG AAGGATATGG AACTGAAATT CCTTCTATGG ACGTTACAGT AGCAGTTAAT TCTGATTTAG CAGAGGAACA TCCCGAGGTA GTTGAGTTTT TGGAGAATTA TGAGACTAGT AGTGAAGTAA CAAGTGAAGC TTTGGCTTAT ACGGAAGAAG CTGATGCAGA TGAACGAGGA GCAGCAAAAT GGTTTTTAAG AGAATATGAA GAGATTTGGA CTGAATGGGT TAATGAAGAA GTGGCTGAAA ATGTTAGAGA ATATTTACAG CAATAA
|
Protein sequence | MRVINSIIKK CVFAILFGML VLGVTGCADS EAQDEVELTF ADAGWESIRV HNYIAGIILE EGYGGYRQDI MSGSTPVTFT DLRGGGIDIY MEVWKENIQE EYNEALEKGE IQVLSINFDD NFQGLYVPTY VIEGDEDRGI DPIAPNLESV FDLPDYWEEF QDPEDPDKGR IIGAPSEWAV DEILEIKVET YGLDEHFNYV SPGSESTLNA TIMDAYESGE PVVAYNWEPT WIMGKYDMTL LEEPEFDEEK YYEEGYGTEI PSMDVTVAVN SDLAEEHPEV VEFLENYETS SEVTSEALAY TEEADADERG AAKWFLREYE EIWTEWVNEE VAENVREYLQ Q
|
| |