Gene Nther_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2236 
Symbol 
ID6315237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2373308 
End bp2375014 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content43% 
IMG OID642644624 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001918390 
Protein GI188586845 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00215386 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGATCCA CTTTCCACGG TATGGAAACG GCTAAAAGGG CACTTTTTGC TCAGAGAAAT 
GCACTTGATA CTACTGGTCA TAATATAGCA AATGCCGAGC GCGATGGTTA TACCCGCCAG
AAAGCACATA TGAGTGCCAC ATCTCCATAT ACCATGCCTG CCCAGAATAT GCCCGGAACT
GCCGGACAGA TCGGAACAGG TGTGGAAGTT GAGAGTGTTG AAAGGATGCG GGATGATTTC
ATAGATTCAC AAATTAGAGA AGAATCCCGG TTTAAAGGGA ATTGGGAAAA CAAAAATGAT
ACCCTGAAAA AAATAGAAAC CTTATATACA GAACCATCGG ATAGTGGTTT GAGATCTGTA
TTCGACCAGT TCTGGGAGTC TCTACAGGAT CTTTCCAAGA ATCCGGAAGA CAGGACCGCT
CGTACCCAGG TCATGGAACG GGGTGTTTCC CTGGCGGATA CTGTTAACCA CATGTACGGT
CAGTTTGTAA ACTTAAAAGA AGATTTAGAT GAAAGTATAG ATATCAAAGT TGATGAGATT
AATTCCATTG GTCATCAAAT TGCCGATTTG AACGAGCAGA TTCAAAATAT CGAGCTTCGA
ACTAACCAAA ACGCAAATGA TCTCCGCGAT AAGCGAGATA TGCTTTTAGA CGAGCTTTCC
GAGCTTACAA ATTATAATTT AAGTGAAGAT GATAAGGGAA ATGTCAGAGT TAGTATAGGC
GGTACCTCCC TGGTGGAAGG GAATCGGGTC AATGAGCTTG AATTCGAAGC GGAAGCTGCT
GATGCAGATA ATGACAACGG AAACAACGAG GGCAATGGGA ACATCTATTT AGACGATCAA
GAACCTAGAG GTGGTAAAGT TAAGTGGAGC CATACCGGAG AGGAAGTTAA CTTCCGGGAT
GGTGCCATGG GCGGTCTGAT GGATTCCCGT GATAAAATCG TCCAGGACCA CATAGACGAA
CTCCGAAGTG TCATGGCCCA GTTCCAAGAA AGCTTCAATG TTGTGCACAG TGATGACAGG
GCTTTTAACC ACTATCAAGC CCAGGCAGCC CAGGATGGGG AGGACCCTTT TTCAGACGAT
GTTAATCAGG TGGATAATTT CTTCGAATGG AAGAATGATG GCAATGAGCT ATTGACAGTT
AATGAAGAAA TTCAAGAGGA CGTTTATTTG ATTAATGCGG GATACATAGA CAATGACACC
CTTTCAGAAT ACCTGGATGG AGACTATGAT GACGATGATC TATCTGATAT GGGATTAGAA
GATATATTTG GCGACAATTA CGAAACACAA GAGCCCGAAA ACATGGATAT TAATCCTAAC
AAATTGATGT ACCGTGGAAA TGGTGAAAAC GGTAAACGCC TGGCTAATTT AAAAGATGAA
GACATTATCA AAGAAGTCCC AAGTGAAGAT AATTTTGATA ATGATGAAAA AATTTCTGGA
GACACAACTT TTAATGACTA TATCGATGCT GTGGTTTCAG ACCTGGGTGT AGAAGCCCAG
GAAGCTGAAC GAATGGTGGA AAACCAGGAA TTACTAGTAG ATCAGCTAGA AAACAGGCGA
GAGGCCGTAA GTGGTGTATC CATGGACGAG GAAATGACCA AAATGGTCCA ACAACAACAC
GCCTACAACG CTGCTTCTCG AGTAGTAACC ACCATAGATG AATCCCTGGA TACCATAATA
AATCAAATGG GTATTGTTGG GCGCTAA
 
Protein sequence
MRSTFHGMET AKRALFAQRN ALDTTGHNIA NAERDGYTRQ KAHMSATSPY TMPAQNMPGT 
AGQIGTGVEV ESVERMRDDF IDSQIREESR FKGNWENKND TLKKIETLYT EPSDSGLRSV
FDQFWESLQD LSKNPEDRTA RTQVMERGVS LADTVNHMYG QFVNLKEDLD ESIDIKVDEI
NSIGHQIADL NEQIQNIELR TNQNANDLRD KRDMLLDELS ELTNYNLSED DKGNVRVSIG
GTSLVEGNRV NELEFEAEAA DADNDNGNNE GNGNIYLDDQ EPRGGKVKWS HTGEEVNFRD
GAMGGLMDSR DKIVQDHIDE LRSVMAQFQE SFNVVHSDDR AFNHYQAQAA QDGEDPFSDD
VNQVDNFFEW KNDGNELLTV NEEIQEDVYL INAGYIDNDT LSEYLDGDYD DDDLSDMGLE
DIFGDNYETQ EPENMDINPN KLMYRGNGEN GKRLANLKDE DIIKEVPSED NFDNDEKISG
DTTFNDYIDA VVSDLGVEAQ EAERMVENQE LLVDQLENRR EAVSGVSMDE EMTKMVQQQH
AYNAASRVVT TIDESLDTII NQMGIVGR