Gene Nther_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0474 
Symbol 
ID6315537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp502191 
End bp503498 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content42% 
IMG OID642642858 
Productselenoprotein B, glycine/betaine/sarcosine/D-proline reductase family 
Protein accessionYP_001916658 
Protein GI188585113 
COG category 
COG ID 
TIGRFAM ID[TIGR01917] glycine reductase, selenoprotein B
[TIGR01918] selenoprotein B, glycine/betaine/sarcosine/D-proline reductase family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTAAAA AAATAGTCCA TTATATAAAC CAGTTCTTTG GCCAGATCGG TGGTGAGGAA 
AAGGCTGATA CGGCGCCATT TGTGAAGGAA GAACCTGTTG GACCTGGAAC CGCTTTAAAT
GGTCAATTAG GAGATGAAGG TGAAATTGTT GCCACTATCA TTTGTGGTGA TGATTATTTC
TCTTCTAATA CCGAAGAGGC TAAAGAAGAA ATTCGTAAAG TCTTACGGGA TTATGACCCT
GATCTGGTAA TTACTGGTCC AGCCTTTAAT GCTGGGAGAT ATGGAACAGC AGCTGGCGGC
GTTGCTGAAC TGGCTGCAAC TGAGTTTGAA CTACCTGTAG TTTCAGGTAT GTATCCGGAG
AACCCCGGTG TTGATATGTA CAAAAAGTAT GCCTATATAA TTGAAACATC AGATTCCGCT
GCTGGCATGA GAAAAGCTGC TCCAGCCATA GCCGAATTAT CTAAGAAAAT TCTTAGAGGC
GAAGAACTGG GAACTCCTAA GGAAGAAGGT TATATCCCCC GAGGAATTCG CAAAAATGTC
TTTTTTGAGG AGCGCGGTTC CAAACGGGGT GTGGATATGC TGGTTAAAAA ACTTAATCAA
GAGAAGTTCG ACACAGAATA TCCCATGCCA GACTTTGATC GGGTTGATCC TCAACCTGCT
ATAAAAGATA TGGCAAATGC TAAAATTGCC TTGGTAACTT CAGGTGGAAT TGTACCTAAA
GGAAATCCCG ATGGAATTGA GTCATCCAGT GCTTCTAAAT ATGGAAAATA TGAATTAAAA
GGCATGGATA CTTTGACAGC AGAAAGCCAT GAAACTGCAC ATGGTGGTTA TGATCCAACT
TATGCCAATG AAAATCCCAA TAGAGTTCTT CCCTTGGATG CGGCTAGGAA ACTTGAACAA
GAGGGTCGAA TTGCTGAGTT ACATGAATAC TTCTATTCTA CAGTAGGAAA TGGAACTTCT
GTTGGAAATG CTCAACAATA TGCGGCTGAA ATAGCAGAGG ATCTAAAGAA TCACGGTGTG
GATGCTGTTA TACAGACTTC CACCTGAGGC ACATGTACTC GTTGCGGTGC AACGATGGTT
AAAGAATACG AGCGTGCTGG TTTACCAGCG GTTCATGTGG CCTCAATAGT GCCGATTTCA
AAGACTGTGG GAGCTAATAG AATAGTTCCA GCTGTAGCTA TTCCACATCC ACTGGGCAAT
CCAAAACTAG ACGAAGAGGA AGAATTTAAG GTGAGAAAGG ATCTGGTTGA TAAAGCACTC
AAAGCTTTAG AAACCGAGCT AGAAACTCAA ACTGTTTTTG AGGACTAG
 
Protein sequence
MSKKIVHYIN QFFGQIGGEE KADTAPFVKE EPVGPGTALN GQLGDEGEIV ATIICGDDYF 
SSNTEEAKEE IRKVLRDYDP DLVITGPAFN AGRYGTAAGG VAELAATEFE LPVVSGMYPE
NPGVDMYKKY AYIIETSDSA AGMRKAAPAI AELSKKILRG EELGTPKEEG YIPRGIRKNV
FFEERGSKRG VDMLVKKLNQ EKFDTEYPMP DFDRVDPQPA IKDMANAKIA LVTSGGIVPK
GNPDGIESSS ASKYGKYELK GMDTLTAESH ETAHGGYDPT YANENPNRVL PLDAARKLEQ
EGRIAELHEY FYSTVGNGTS VGNAQQYAAE IAEDLKNHGV DAVIQTSTUG TCTRCGATMV
KEYERAGLPA VHVASIVPIS KTVGANRIVP AVAIPHPLGN PKLDEEEEFK VRKDLVDKAL
KALETELETQ TVFED