Gene Nther_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2229 
Symbol 
ID6315230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2367254 
End bp2368849 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content41% 
IMG OID642644617 
Productflagellar hook-associated 2 domain protein 
Protein accessionYP_001918383 
Protein GI188586838 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000000000532748 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCGACTTG GAGGATTAGC AACAGGTTTT GATACAGAAA ACCTGGTAAA ACAGCTAGCA 
GAATTAGAAA GACAACCTAT TCAGCGCCAC GAACAGGATA TCCAAGAGAT AGAAGGAGTT
AAAGACGCAT GGCGAAATGT CAATAAACTG TTATCGGGAG TTAGTGATGA ACTCTCTGGA
CTTCAGAGTG AAGATACCTT TCAAGGTATG GAAGCCAGCT CTACTAATGA AAATATTGAA
GTTTCCGCTG ACAATCAAGC AGTAGAAGCA GATTATGCAT TTGAAATACA GCAAAAAGCC
CAAAATCAAA GACTGGCTTC ACTAGACACT ATAAACAATG TGGAAAATGA AGATGGTCAC
AAAGGTACTC TGACCTTTGA TTTTGGTGAT GAAGAATTTG ATATTAAAGT GGACGAATCA
GACACCATTC AGGACATCGC TGATAAAATC AATAACAATG AGATAGAAAA TAATGACGAT
AAAACCATTC CCATGGGTGC TACCGTCATA GCTGGTGAAT ACTTAGAGTT TAACATCGAT
GAAGGATATG AACTAGAAAA TATTGAGGCA GACAATGAGG AAAAGGCAGA CAATGATGGA
GACAGTGACT TATTAGGCTG GCTGCAACTA GATGCTGATA TAAAGAACGA TGATGAAGAA
AATAATGATG TTAATGAAGA TGATAGAGAA GTACTGCAAG TGCAAGAAAG CCAAAATGCC
AAAGTCATAG TAAACGGCAT TCAAGGTATT GAATCTTCCA CTAATGAAAT AGAGATCGCC
GATGGTGTGG AAGTAGATAT TTCTGCAGCT ACTGAGAGTG ATATGGAGAT TCAAGAAACC
ATTTCCATAT CTCAAGATAC CGAGACTCTT ATCAATAATG TAGAAAGCTT TGAAGAAAAC
TACAACGAAG CTATCAGTGC AATTGAAGAC AAAACTGATG TTACAGTACT TGAAGATGAT
ACCCAAGAGG GTGAAGAAGA AGTTCAGTCC GGTGTTTTAC AGGGAGATAG TACCCTTAAC
AATATTACCA ATAATCTCAG AAGAGCTATT ACAGATCCGG TAGATGTCAG TGATCAAGAT
ATAGAAGTTG ATGGTAATGA AGTAGATGAA TTAGTTCCTG CACACTTCGG TATTGAAATT
GGCGGTGACA CTGTAGCTGG TGGCTCTTAC AGCGGAGCTG ATGCCAACGA GCTCACTATC
GATTACGATA GATTAGAAGA ACAGCTCCAG GAAAATCCCG AAGCAGTACA AGCTCTATTC
GCCAATGACG GCGAAGGCGG CGGAGAAGGA ATAGCCGTTC GCATGGAAGA GTACTTGGAT
GGCGTGGTTT ACGACGGTGA ACTGGACTCT ACTGGACTTC GAAATCAAAC CCAAGACCAG
AATCAAAGTG TTATCGAGAA CCGGCTTACC ACCGAAGCCA ATGAAATTGA ACGCACTGAA
AGCAGAATAG ATACTATTGA AGACCGGGCG GAAATGCGTG AAGAACAACT TTGGTCACAA
TTCACAGCTA TGGAAGAAGG AATACAAAGC GCTCAGGCTG AATCACAGTC TCTAATGCAG
GCTATGGGAG GCGGTATGGG CGGAATGATG ATGTAA
 
Protein sequence
MRLGGLATGF DTENLVKQLA ELERQPIQRH EQDIQEIEGV KDAWRNVNKL LSGVSDELSG 
LQSEDTFQGM EASSTNENIE VSADNQAVEA DYAFEIQQKA QNQRLASLDT INNVENEDGH
KGTLTFDFGD EEFDIKVDES DTIQDIADKI NNNEIENNDD KTIPMGATVI AGEYLEFNID
EGYELENIEA DNEEKADNDG DSDLLGWLQL DADIKNDDEE NNDVNEDDRE VLQVQESQNA
KVIVNGIQGI ESSTNEIEIA DGVEVDISAA TESDMEIQET ISISQDTETL INNVESFEEN
YNEAISAIED KTDVTVLEDD TQEGEEEVQS GVLQGDSTLN NITNNLRRAI TDPVDVSDQD
IEVDGNEVDE LVPAHFGIEI GGDTVAGGSY SGADANELTI DYDRLEEQLQ ENPEAVQALF
ANDGEGGGEG IAVRMEEYLD GVVYDGELDS TGLRNQTQDQ NQSVIENRLT TEANEIERTE
SRIDTIEDRA EMREEQLWSQ FTAMEEGIQS AQAESQSLMQ AMGGGMGGMM M