Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0474 |
Symbol | |
ID | 6315537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 502191 |
End bp | 503498 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 642642858 |
Product | selenoprotein B, glycine/betaine/sarcosine/D-proline reductase family |
Protein accession | YP_001916658 |
Protein GI | 188585113 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01917] glycine reductase, selenoprotein B [TIGR01918] selenoprotein B, glycine/betaine/sarcosine/D-proline reductase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTAAAA AAATAGTCCA TTATATAAAC CAGTTCTTTG GCCAGATCGG TGGTGAGGAA AAGGCTGATA CGGCGCCATT TGTGAAGGAA GAACCTGTTG GACCTGGAAC CGCTTTAAAT GGTCAATTAG GAGATGAAGG TGAAATTGTT GCCACTATCA TTTGTGGTGA TGATTATTTC TCTTCTAATA CCGAAGAGGC TAAAGAAGAA ATTCGTAAAG TCTTACGGGA TTATGACCCT GATCTGGTAA TTACTGGTCC AGCCTTTAAT GCTGGGAGAT ATGGAACAGC AGCTGGCGGC GTTGCTGAAC TGGCTGCAAC TGAGTTTGAA CTACCTGTAG TTTCAGGTAT GTATCCGGAG AACCCCGGTG TTGATATGTA CAAAAAGTAT GCCTATATAA TTGAAACATC AGATTCCGCT GCTGGCATGA GAAAAGCTGC TCCAGCCATA GCCGAATTAT CTAAGAAAAT TCTTAGAGGC GAAGAACTGG GAACTCCTAA GGAAGAAGGT TATATCCCCC GAGGAATTCG CAAAAATGTC TTTTTTGAGG AGCGCGGTTC CAAACGGGGT GTGGATATGC TGGTTAAAAA ACTTAATCAA GAGAAGTTCG ACACAGAATA TCCCATGCCA GACTTTGATC GGGTTGATCC TCAACCTGCT ATAAAAGATA TGGCAAATGC TAAAATTGCC TTGGTAACTT CAGGTGGAAT TGTACCTAAA GGAAATCCCG ATGGAATTGA GTCATCCAGT GCTTCTAAAT ATGGAAAATA TGAATTAAAA GGCATGGATA CTTTGACAGC AGAAAGCCAT GAAACTGCAC ATGGTGGTTA TGATCCAACT TATGCCAATG AAAATCCCAA TAGAGTTCTT CCCTTGGATG CGGCTAGGAA ACTTGAACAA GAGGGTCGAA TTGCTGAGTT ACATGAATAC TTCTATTCTA CAGTAGGAAA TGGAACTTCT GTTGGAAATG CTCAACAATA TGCGGCTGAA ATAGCAGAGG ATCTAAAGAA TCACGGTGTG GATGCTGTTA TACAGACTTC CACCTGAGGC ACATGTACTC GTTGCGGTGC AACGATGGTT AAAGAATACG AGCGTGCTGG TTTACCAGCG GTTCATGTGG CCTCAATAGT GCCGATTTCA AAGACTGTGG GAGCTAATAG AATAGTTCCA GCTGTAGCTA TTCCACATCC ACTGGGCAAT CCAAAACTAG ACGAAGAGGA AGAATTTAAG GTGAGAAAGG ATCTGGTTGA TAAAGCACTC AAAGCTTTAG AAACCGAGCT AGAAACTCAA ACTGTTTTTG AGGACTAG
|
Protein sequence | MSKKIVHYIN QFFGQIGGEE KADTAPFVKE EPVGPGTALN GQLGDEGEIV ATIICGDDYF SSNTEEAKEE IRKVLRDYDP DLVITGPAFN AGRYGTAAGG VAELAATEFE LPVVSGMYPE NPGVDMYKKY AYIIETSDSA AGMRKAAPAI AELSKKILRG EELGTPKEEG YIPRGIRKNV FFEERGSKRG VDMLVKKLNQ EKFDTEYPMP DFDRVDPQPA IKDMANAKIA LVTSGGIVPK GNPDGIESSS ASKYGKYELK GMDTLTAESH ETAHGGYDPT YANENPNRVL PLDAARKLEQ EGRIAELHEY FYSTVGNGTS VGNAQQYAAE IAEDLKNHGV DAVIQTSTUG TCTRCGATMV KEYERAGLPA VHVASIVPIS KTVGANRIVP AVAIPHPLGN PKLDEEEEFK VRKDLVDKAL KALETELETQ TVFED
|
| |