Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1827 |
Symbol | |
ID | 6314376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1901228 |
End bp | 1902103 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642644205 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001917987 |
Protein GI | 188586442 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00135653 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAAGC TGAAATTAGT ACTAGTGACA ATGATAACAA TTGTTTTATT AACTGCCTGT ACCAATGGAG GAGGCGGTGT TACAGCTGGA GAAGAATCAA AAGAAACCAT AAAATTTGGA ATGACAGATT GGACAAGTAC AGCAGTTCCA ACAGAAATTG CCAGACAAAT TTTAGAAGAA GCAGGTTATG AAACAGAAAC AACTAATGCA GATCAGCCAG TAATTTTTGT AGGATTAGTG GATGAAGAAA TTGATTTCTT TATGGATGCA TGGTTACCCT ATACAGAAGA GGCCCTTTGG GACGAGCATG GAGAAGATCT ACAGAAAGTA TCTGCAAGTT ATAAAGAAGC TCCCTTAGGT TGGGTTGTTC CCGAATATGT GGAGGAAGAC ACTCTAGATG AGTTTTTGGC TAATCTAGAT AAGTATAATA ATGAAATTGT AGGTATTGAC TCTGGAGCAG GCATATCAGA AATCTCTAAA GAAATGATAG AAGATCTTGG AATAGGTGAT GAACTTGAAT TTGTCCCTTC AAGTGAAGCT GCTATGATGG GTGAAGCCAT GAGTGCCATG GACAATGAAG AACCAATAGC TTTCCTTGGA TGGAGACCTC ATTCTATGTT CACTCAATAT GATATCAAAT TTTTAGAGGG TCAAGAGGAA TATTTTAAGG CAGATAATGT ATATGTAATT TCTTATGAAG GTATTGAAGA CAAACATCCG GAAGCTTATG AAATATTATC AGACTGGAGT ATGCCCATTG AAGATTTAGA GGAAATGATG TACGAGCATG AAGAAAATGA TGTAGAATAT GAAGTACTTG CTGAACAGTG GATTAAAGAA AACCGTGACA AAGTTGATGA AATGTTGGGC AATTAA
|
Protein sequence | MRKLKLVLVT MITIVLLTAC TNGGGGVTAG EESKETIKFG MTDWTSTAVP TEIARQILEE AGYETETTNA DQPVIFVGLV DEEIDFFMDA WLPYTEEALW DEHGEDLQKV SASYKEAPLG WVVPEYVEED TLDEFLANLD KYNNEIVGID SGAGISEISK EMIEDLGIGD ELEFVPSSEA AMMGEAMSAM DNEEPIAFLG WRPHSMFTQY DIKFLEGQEE YFKADNVYVI SYEGIEDKHP EAYEILSDWS MPIEDLEEMM YEHEENDVEY EVLAEQWIKE NRDKVDEMLG N
|
| |