Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1620 |
Symbol | |
ID | 6315383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1698903 |
End bp | 1699784 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642643996 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001917782 |
Protein GI | 188586237 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000349679 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00000000178449 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAAGA AGCTGATTTT AGTTTTAGCA ACAGTATTTT TAGTAGGAAC TATTGGTTTA GCTGGTTGTG ATGAGCCAGC TCAAGATGGG GACCCAGAAG GAACCCAAGA GGAGATTGAA TTAGCGTATG TTAACTGGGC ATGTGCTGAA GCCCAAACAC ATGTAGCCCA AGAAGTAATT GAAAGCGAAT TGGGATACGA TGTGGAGATT ACAATGGCAG ATGCTGGTCC TATTTGGGCT GACGTAGCTG CAGGAAACCA AGACGGAATG GTTTGTGCTT GGTTACCAGT AACGCAAGGT GAGTACGATG ATGAATATGA TGGCGATGTA GACAACTTAG GACCAGTTTA TGAAGGAGCT AGAATTGGCT TAGTTGTTCC TGAATACGTA GATATCGATA GTATTGAAGA GATGGATGAA ATCGCTGATG AATTAGATAA TGAAATAGTT GGAATCGAAC CCGGTGCAGG AATTATGATT AATACTGATG AAGCACTTGA AGAGTACGAC AGTCTAGCCG AGTTTGAATT GATTGATAGT TCCGATGCAG GAATGACTAC ATCTTTAAGT GATGCTGTAG ACAATGAAGA GCCAATTGTT GTTACAGGAT GGACACCACA TTGGAAATTT GCCGAGTGGG ATCTTGAATT CTTAGAAGAT CCATTGAATG TATACGGAGA AGAAGAACAT ATAGCTGCAG TTGCAAGACA AGGGCTAGAA GATGATGCCC CAGATGTTTA CGAATTCCTT GATAACTACA TCATGGATGA CGATCAGATT GGTGAGGTAA TGGGTATGAT TGAAGAAACT GATGACCCTG AAGCATCTGC CGAAGAATGG GTTGAAGAGA ATCAAGATGT TGTTCAAGAA TGGTTAGATT AA
|
Protein sequence | MSKKLILVLA TVFLVGTIGL AGCDEPAQDG DPEGTQEEIE LAYVNWACAE AQTHVAQEVI ESELGYDVEI TMADAGPIWA DVAAGNQDGM VCAWLPVTQG EYDDEYDGDV DNLGPVYEGA RIGLVVPEYV DIDSIEEMDE IADELDNEIV GIEPGAGIMI NTDEALEEYD SLAEFELIDS SDAGMTTSLS DAVDNEEPIV VTGWTPHWKF AEWDLEFLED PLNVYGEEEH IAAVARQGLE DDAPDVYEFL DNYIMDDDQI GEVMGMIEET DDPEASAEEW VEENQDVVQE WLD
|
| |