Gene Nther_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1954 
Symbol 
ID6315861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2063061 
End bp2064068 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content35% 
IMG OID642644344 
Producthypothetical protein 
Protein accessionYP_001918112 
Protein GI188586567 
COG category[R] General function prediction only 
COG ID[COG5401] Spore germination protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.508711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.643926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGG TAAAATTGAT AGTTTTAATA ATAGTGATTA GTTCCATGGC ACTGTTATTA 
AGCGGATGTT TTTTTCTAGA TTTTATTTTT GGGCCAGATC CGGATGAAAC CACTAAAGAA
GAACAAGAAG AATTGGAACC AAAAGAAGAT GAACCCACAG AAATACGAGA TTACGATGAG
GATGAAAAGA GAGAAACAGT ATTTTATTTG CTTGATAACA ACGATAACTT AGTACCAGTG
GTTAAACCCA TTGAATGGAC TGAAGGAATA GCTACTAAAA CCTTGAACAA AATGTCTCAA
ACACCGGCAA ATGAAGAGTT TTGGATGGAT ACAAACTTAA CTCCTACTTT ACCTAATGGA
ACCAAAGTCA AAGGAATGGC TATTGACGAT GGTAGAGCCC GGGTGAACTT TACCGAAGAA
TTTTTGGACA TGGAACCAGA ATCGGAACAA GAAATTAAAA ATTCTATTGT TTATACTTTG
TCTGAATTTG AAACTATTGA TGAGGTAGAA ATCATGGTTG AAGGTGAATT TATTGAAAGT
TTACCAGGAG GGACTGATGT TTCAGGACCT TTAACACGAG AAGGATTGAA TGTAGAAATA
ACTGCTGAAG CAGAAAGTGC AGAGAATGAG ACGGGAGTAA ATTTATATTT CTTATCACAA
GATGGAGAGT ATGTAGTTCC TGTTACTAGG TATATCCCAG ATACTGAAGA GTTAGAGGGT
AATGCCATTT ATGAATTAAT GAAAGGTGCA GACCCAGATA GTGGACTTAT CTCTTATGTT
TCAACAGATC TTGATATTAA TGACGTCAAG ATAGAAGGCA ATACTATGCA AATGGATGTT
TCTAACTTAG CAGATGACCC AGAACATCAA GAACTTGCAC TAAAGCAGTT GAAATTCACA
TTAACAGACT TTGATTATAT CGATAGTATG GAGATATCTA TAGATGGAAC ACCTATAGAT
ATTGACGAAA GAGTGATGAA CTTAGAGGAA GTTAATTTTA GATATTAA
 
Protein sequence
MGKVKLIVLI IVISSMALLL SGCFFLDFIF GPDPDETTKE EQEELEPKED EPTEIRDYDE 
DEKRETVFYL LDNNDNLVPV VKPIEWTEGI ATKTLNKMSQ TPANEEFWMD TNLTPTLPNG
TKVKGMAIDD GRARVNFTEE FLDMEPESEQ EIKNSIVYTL SEFETIDEVE IMVEGEFIES
LPGGTDVSGP LTREGLNVEI TAEAESAENE TGVNLYFLSQ DGEYVVPVTR YIPDTEELEG
NAIYELMKGA DPDSGLISYV STDLDINDVK IEGNTMQMDV SNLADDPEHQ ELALKQLKFT
LTDFDYIDSM EISIDGTPID IDERVMNLEE VNFRY