Gene Nther_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1044 
Symbol 
ID6314225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1108163 
End bp1109977 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content33% 
IMG OID642643416 
Producthypothetical protein 
Protein accessionYP_001917216 
Protein GI188585671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0661829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGGAA GAAAATTAGT TTATTTAGTA ATAGTAATAA GCTTAATGGT TTTAACAGTT 
TTTGTGACAG GCTGTACTCA AGAGAAAAAA GCAGGAGATG AACAGGCTAA ACAGGATGAA
GATGAACCAA ATGAATACAA AGTAGAGGTA CAAGCTAATC CTGAAGAAGC TGGTGAGATA
ACAGGAGAAG GTACTTATGA AGAGGGGGAA AAGGTTGAAC TAGAAGCAGT ATCAAAAGAA
GGTTATGAAT TTAAGAAATG GCATTTAGAA GGAGAAGATA TTAGTGAGGA TAAAAAATAT
AATTTTACAA TGGAAAAAGA TAAAAAGTTC CAGGCAGTAT TTGAGACAGA ATATAATTCG
GATAAGTTTG AACAGATAAC CACAGAAACA GATGATGGTA CTATTAAAGA AACTATCATG
GGAGCAAATT TAATAGATGA ATTAGACCCT GTAAAATCAA ATGCATTAAC TGACGAGCCA
ATTTATATAG GAGAAAATTA TATTGCTCAA CCTGATGAAG AAAAAGGTGA CTTGGTCATT
TACCAAAAAG ATAATTTAGA ACATTTTGAT ACCGTAGAAG GAATAGTTAC AAAAGAAGGG
ACTAATGAGA AATGTGTAAT TATTGAAGAT CAGTTGATTT ATATACCAAA TACCGGTCAA
AGAGCGTATT TTTATAGTCT CGAAGAGGAA CAAGCTTCAA AAGAAAAAAA ATTAAATATA
CAAGATTTAG AATGGGACAT ACCAGAGGGT TTAGAGAAGG ATGGGAGAGG AAAATTCTTT
ATAAACATAC ATAACAATCT ACACTTTCAA ACTAGTGTTA TTGGCAATCA CCTATTTCTT
TACGCAGATC ATTCTTGGGT AGTAAGGCAT GAAAAACCAC CCAGAGTTGA AAAACCTTTT
TTAAAAGTAT TCGAGTTATC TGAAGATGGA TTTAAACAAA AAGAGATAAA TTTTGAAGAG
GAGTTTGAAG AGTCCCCGTT AATTACGGAT ATAGCAATAT ACGATGATAA TAGCGCTATT
ATAGCTAGTG GTGATAAAGG ATTTAGTTTA ATAGATTTAA ACAATTTCTC TATAGAACAT
TTGGAGGTAG GCGGTACTAG TCATGAGGGG GAAGTCCCTG GCGAAGGTGA TTCAAATTTT
GTGTCCCATA AAATTTTAGG AGTCAATGAA TATGGCATTT TACTTACTAA AAGTGAAGTG
CCAATACATG CCTGTTCTGC TAGAAGTATA GAAATATGGG CTCCGGATAG CAATAATTCC
CTCGATTTAA CAGCAAGCAC TGTACAGCAC CTTCCCGGTG GTTTTGAAGA GGAAAAAGAT
GATGTAATGT TGGTAGGTGC TATCCCAGAT GAAAAAGGGA TTAAAAACAA TTTATGGCCT
ACTAATATGA CTGAATCTGA GCAAATTGAG GAAGGCAAAA GAAAATCTTC ATTACAGATA
TCTGCTTTTG TAAGTTTAGA TAATAATAAA GTGATGAAAG ATGTTGGTGA AAATAATTCT
AAATTGAGTG AGTACCTACA CAAAGAATTA GATTTTGCTC AAACATTAGA TAATGGAGAG
ATAGACCAGC TTCCGGTTAT TGTAAAAGAT ACAGATAAAG ATCATTTACA CCATACATTT
GACGATCCGC GTCCTCCTAT TGAAGCTCCT GTTTTTGCGT CAACTAATCT AAACGCGAAT
AGTTTAGATA TAGAATTGGT TTATCTTAAA GATAATATTA AGCTTATCCA AATTGATCCT
GAAGATCAGG ATATTTACTT AAAGAAAGAT GATTCAGTTT ATCAGATAGA GTATGATTCT
TTGTTTAGTC ATTAA
 
Protein sequence
MSGRKLVYLV IVISLMVLTV FVTGCTQEKK AGDEQAKQDE DEPNEYKVEV QANPEEAGEI 
TGEGTYEEGE KVELEAVSKE GYEFKKWHLE GEDISEDKKY NFTMEKDKKF QAVFETEYNS
DKFEQITTET DDGTIKETIM GANLIDELDP VKSNALTDEP IYIGENYIAQ PDEEKGDLVI
YQKDNLEHFD TVEGIVTKEG TNEKCVIIED QLIYIPNTGQ RAYFYSLEEE QASKEKKLNI
QDLEWDIPEG LEKDGRGKFF INIHNNLHFQ TSVIGNHLFL YADHSWVVRH EKPPRVEKPF
LKVFELSEDG FKQKEINFEE EFEESPLITD IAIYDDNSAI IASGDKGFSL IDLNNFSIEH
LEVGGTSHEG EVPGEGDSNF VSHKILGVNE YGILLTKSEV PIHACSARSI EIWAPDSNNS
LDLTASTVQH LPGGFEEEKD DVMLVGAIPD EKGIKNNLWP TNMTESEQIE EGKRKSSLQI
SAFVSLDNNK VMKDVGENNS KLSEYLHKEL DFAQTLDNGE IDQLPVIVKD TDKDHLHHTF
DDPRPPIEAP VFASTNLNAN SLDIELVYLK DNIKLIQIDP EDQDIYLKKD DSVYQIEYDS
LFSH