Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1016 |
Symbol | |
ID | 6315142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1084328 |
End bp | 1085224 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642643388 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001917188 |
Protein GI | 188585643 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000028546 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTAAAA AGTTACTTGT TACACTAACT ATTCTTTTAC TAACAGCAAG TTTTGCGGTT GGTTGTGATG GGGGTTCAGA AGAAGACGGA GATGCGTCAG TTCCTGAACG ATTTGATAAT GAAATTGTAG GTATAGACTC TGGTGCTGAG ATTATGGGCA CAGTAGAAAA TGAAGTCATG GTGGAATACG GCCTGGATGA GTACGATCTT GTTGAATCTA GTGAAGCTGG TATGATTACT GAAATTGATT CTAGGGTAGA TGACGAGGAA TGGGTAGTGG GGATCGGTTG GACACCTCAC TGGAAAATCC CAGAATATGA TATGAAATTT TTAGAAGATC CTAAAGGTAT ATTTGGTGAA GCGGAAAATA TCAAAGCATT AAGTAGAGCA GGTTTTACTG ATGACATGCC TGAAGTTAGT CAAACTTTAC AAAATTTTTA TTTGACAGAA GATCAACTTG GTGAGTTGAT GGCATTAGTA GAAGACACAG ATAGTAACGA GAGGGAAGTT GTTAAAGAGT GGGCTGAAGA CAATCAAGAT GTAATTTCTG AATGGGTGCC AGAAGATCCC GATGGGGAAG GCGAGACAGT AGAGCTTTTA TACAATAACT GGACTGATGC TATTGCTTCC ACTAATCTAA TAGCTTATGT CTTGGAAGAG GAAATGAATT ATGAAGTAGA AATGGAAATG GTGGATGTGG CTTTTGTTTT TGAAGGCTTA GCTAGTGGAG ACTACGATGC CATGGTTTGT GCCTGGTTAC CATTAACTCA AGCAAATTAT TGGGAAGAAT ACGGAGATGA CTTGGAAGAC TTGGGTCCTA TCTTTGAAGG AGCAAAATTA GGGTTGGTAG TTCCCGATTA TGTTGAAATA GACTCCATTG AAGAAATGGC TGAATAA
|
Protein sequence | MCKKLLVTLT ILLLTASFAV GCDGGSEEDG DASVPERFDN EIVGIDSGAE IMGTVENEVM VEYGLDEYDL VESSEAGMIT EIDSRVDDEE WVVGIGWTPH WKIPEYDMKF LEDPKGIFGE AENIKALSRA GFTDDMPEVS QTLQNFYLTE DQLGELMALV EDTDSNEREV VKEWAEDNQD VISEWVPEDP DGEGETVELL YNNWTDAIAS TNLIAYVLEE EMNYEVEMEM VDVAFVFEGL ASGDYDAMVC AWLPLTQANY WEEYGDDLED LGPIFEGAKL GLVVPDYVEI DSIEEMAE
|
| |