Gene Nther_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0839 
Symbol 
ID6315650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp879499 
End bp881208 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content31% 
IMG OID642643212 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001917012 
Protein GI188585467 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00192903 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTTA TCAGTTTTTC AGAAGAAGAT TGCGTAAATT GCTATAGATG TATCAGAGCA 
TGTAATACTA AAGCTATATC TGTTTTTGGA GATCCCCGGG AAAACGATGA AAAGCTTTGT
ATATCTTGTG GAGAGTGTTA TGTTGCTTGT GAACGAAATG GTTTGACTAT AAAGGATAAA
GTTAAAGAAG TTAAGGATGC TTTATCTTCC AATAAAAAAG TAATAGCCAG TTTAGCACCT
TCCTTTCCTG GTGCATTTTC TATAAAAGAA GGTGGAAAAA TTGTTACTGC TTTAAAAGAA
TTAGGATTCT ATGCAGTAGA AGAAACAGCA GTTGGAGCTG ATGTTGTAAT AAATGCTTAT
GAAAACTTCA CTAATGATGC AAAGCAAAAA AATTTGATTT CAACAAGTTG TCCTTCTACA
GTCTATTTAG TCGAAAAATA CTATCCTGAG TTAGTAAATT ATCTAATACC TGTAGTGTCA
CCTATGATTG CCCATGGAAA AATGATTCGT GAAAAATATG GTGAAGATGC ATATATAGTT
TTTATAGGGC CTTGTATTGC TAAAAAAGTA GAAGCTCACG AAAGTCAATA TAATGGAATA
ATTAATTCTG TGCTCACTTT TGTTGAATTA CAAAAATGGT TTAAAGATAA TAATTTAGAT
CTCAAAAACC AATCACCAGA GCCTTTTGAT GAAAAAGCTA ATGATAAGGG AAAGTTAATA
CCTTGCAATA TCAAGTTATC TAATGAAAAA AATGAAAACT ATGAAAAGAT AGTTGTCACT
GGGGTAGAGC GTTCTAAGGA AATTTTAAAT AGTTTAAAGA AAGGTGAGCT CAGCGGTATA
TTTTTAGAAA TCTTATCTTG TCCTGGAGGA TGTATAGATG GCACAGGTAT GCTTAAAGAG
GCACCAAGCT ATTATGTTAG AAAAAAGTAT GTTAAAGATT ATATTGCTAA AAAGTGTAGT
GATGATGTCA AAATAGAAAA TAAGTTTGCA CAAAAAATTG ATATCGAAAG ACATGTATCT
TCAAAAAAAG TTAAAAAATA TCGCCACCCT AAAGAACAAA TTGACGAAGT CTTAAATAGT
ATAGGTAAAA ATAAAAAAGA AGATGAGCTA AATTGTAATG CTTGTGGTTA TGTTACATGT
AGAGAATTTG CAGAGTCAAT TTTAAGAAAA AACGCCCATG TTAACATGTG TCATCCTTTT
ATGCGGTCAA AAGCAGAAAG CTTGAAAAAT GTGATTTTTG ATCATAGTCC CAATGCAATA
TTTATGGTTG ATTTTAATTT GGTAGTAAGA GAATTCAATC CTTCTTCGGA ACAAATATTT
AACATAGAAG CTAATAAAAT AAAAGGCAAG AGAATTGGTG AGATAATTGA AGAAGACACA
TTTAAAAAAG TTTTAGAAAC CAAAACTAAT CTTATAGGAA AAAAAGTCGA GTATTCTAAT
TATGGTGTTA TATTAATAGC TAATATTATA TACCTCAAAG AAGAAAAAGT ATTAATGGCT
ATCATGACTG ATGTAACTTC AGCAGAAAAA AATAAAGAGG AGTTAGTAAG AGTTAAAAAA
AACACCATTG ATGTCGCTCA AAGTGTTATA GATAAGCAAA TGAGGGTTGC ACAAGAAATT
GCTAGTCTTT TAGGAGAAAC TACAGCAGAA ACTAAAGTTG CATTAACGGA TTTAAAAGAT
ATTGTCATTG AGGAACGGAG GGATGATTAA
 
Protein sequence
MKFISFSEED CVNCYRCIRA CNTKAISVFG DPRENDEKLC ISCGECYVAC ERNGLTIKDK 
VKEVKDALSS NKKVIASLAP SFPGAFSIKE GGKIVTALKE LGFYAVEETA VGADVVINAY
ENFTNDAKQK NLISTSCPST VYLVEKYYPE LVNYLIPVVS PMIAHGKMIR EKYGEDAYIV
FIGPCIAKKV EAHESQYNGI INSVLTFVEL QKWFKDNNLD LKNQSPEPFD EKANDKGKLI
PCNIKLSNEK NENYEKIVVT GVERSKEILN SLKKGELSGI FLEILSCPGG CIDGTGMLKE
APSYYVRKKY VKDYIAKKCS DDVKIENKFA QKIDIERHVS SKKVKKYRHP KEQIDEVLNS
IGKNKKEDEL NCNACGYVTC REFAESILRK NAHVNMCHPF MRSKAESLKN VIFDHSPNAI
FMVDFNLVVR EFNPSSEQIF NIEANKIKGK RIGEIIEEDT FKKVLETKTN LIGKKVEYSN
YGVILIANII YLKEEKVLMA IMTDVTSAEK NKEELVRVKK NTIDVAQSVI DKQMRVAQEI
ASLLGETTAE TKVALTDLKD IVIEERRDD