Gene Nther_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1841 
Symbol 
ID6315668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1917341 
End bp1918861 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content32% 
IMG OID642644219 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001918001 
Protein GI188586456 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.212286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGTAG AGAAAAATGG AACTTTTTGT TACTTAAAAA ACAATAAGAA GCATCAAAAT 
TTAACTGGAA TATCTTTAGA GGAACTTCGA GGAAAAACTC CTTATCAACT TTTTGAAAAA
GAACAGGCAG ACTATATAGT AAGTAAGTTT TTAACATGTT GTCAGCTTAA AGATACAATT
AACTTCCAAG AGAGTTTAGA ATTCCCCACA GGTAAGGGTA CTTTTCAGAC AATCTTATCG
CCAATATTAA AAGACGGAGA AGTAGTTAGA ATTGTGGGAA TTGGACGTGA CATTACAAAT
CAAAGACAAA GTCAGGAAGG TTTATGGTCC AGTGAGAAAA AACTCAGTAC ATACCTTGAT
AAAGCACCTA TTGGAGTATT TATTATCAAT AGTCTAGGGA ATTTTATTGA AATAAATCAA
AAAATTTGTA CAATGACGGG ATATTCAAAG AAAGATTTAC TGGAACTATC TCTTATAGAT
ATTACGACAT CAGATAATTA TAATTTTAAT ACAAAAGTAT TTAACAATCT AATTAATGAT
GGTCAAGTAG ATGAATTTTT TGAAATTAAC CATAAAAATG GAAATAAATT ATGGTTACAT
TTTCAGGCTT CCAAAATTGA TAATGACCAA TATATCTGCT TTGTTGGTGA TATCACAAAA
TTAAAAGAAT CAGAAAAGAG CTTGAAGAAA ACTCGTAAAA TCTTTAATCA TTTCCTGGAT
GCTATTGAAG ATGGGTTTTG GTATTGGGAT ATACAGACTG GAGAATTTTT TTTAAGTGAC
AATTTTTATT CTATGTTGGG ATACGAGCCC AATGAATTGA AATTAAACTA TAATGACTGG
CATAAGTTAA TCCATCCTGA TGATTTAGAA TCGGTAAAGA AAGAGATTTC ACAGAAACTT
CTAAGGGAAA AAGAAGGGGC TAATATTGAA TTTCGAGCAA AAACAAAATC GGGTAGCTAT
AAGTGGATTA ATGGTAAAGG AAATGTAGTA GAGTTTGATA ATAATAGAGT CCCTATAAAA
GCAGCAGGTA TTCAGATTGA TATTAATGCA AGAAAAGAAT ACGAGGAATT ATACAAATCA
ATTGTAAATG CAGCCAAAAC AGTTTCCTTG ATAGTTACTG ATCTTAATGG GATTATCAGA
GAGTTCAGTC CCGGAGCTGA AGCTATCTTT GGTTATACAA GGGATGAAAT GATTGGTAAG
CATGTTAGAA AAATAATAGT GTCTTCTGAT TTAAAGAAAA TTCCAGACTT TTTAAACAAA
TTAAAGCGGG AACGAGAAGG ATTTACCTTT GAAACTGAAC GGATAAGAAA ATCCGGTGAA
AGATTTCCTG TTCGCCTGAC ACTGCAACCT TTATTCGATA ATGAAGGTAA TTTTCATGGT
ACTTTGGCAA TCATTGTAGA CATTTCAGAT TTAAAAGAAA CGGAGAAAAG ACTTCGAAAG
AGTGAGCAGC GCTTTATACT CTTATTTCTC AAACTCCAGC AGTTATTTAT TCATTTAAAT
TGGTTGATAA TAGACCAATA G
 
Protein sequence
MSVEKNGTFC YLKNNKKHQN LTGISLEELR GKTPYQLFEK EQADYIVSKF LTCCQLKDTI 
NFQESLEFPT GKGTFQTILS PILKDGEVVR IVGIGRDITN QRQSQEGLWS SEKKLSTYLD
KAPIGVFIIN SLGNFIEINQ KICTMTGYSK KDLLELSLID ITTSDNYNFN TKVFNNLIND
GQVDEFFEIN HKNGNKLWLH FQASKIDNDQ YICFVGDITK LKESEKSLKK TRKIFNHFLD
AIEDGFWYWD IQTGEFFLSD NFYSMLGYEP NELKLNYNDW HKLIHPDDLE SVKKEISQKL
LREKEGANIE FRAKTKSGSY KWINGKGNVV EFDNNRVPIK AAGIQIDINA RKEYEELYKS
IVNAAKTVSL IVTDLNGIIR EFSPGAEAIF GYTRDEMIGK HVRKIIVSSD LKKIPDFLNK
LKREREGFTF ETERIRKSGE RFPVRLTLQP LFDNEGNFHG TLAIIVDISD LKETEKRLRK
SEQRFILLFL KLQQLFIHLN WLIIDQ