Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1400 |
Symbol | |
ID | 6314617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1471495 |
End bp | 1473021 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642643780 |
Product | protein of unknown function DUF1078 domain protein |
Protein accession | YP_001917571 |
Protein GI | 188586026 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.153228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.872762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAGGT CAATGTATTC TGGTGTAAGC GGATTGAGAA ATCATCAAAC CCGTTTGGAT ACAATCGGAA ATAATGTGGC AAATGTAAAT ACTGCAGGCT TTAAAGGGGA TCGAGTTACC TTTCAAGATG CTTTTAATCA GACTTTAGAA GCGGCATCTG CTCCTAATGA GCGAGGCGGA ACCAATCCCC AACAGGTTGG TCTGGGTATG AATATCGGAA GTATGGATAC AATCCACACC CAGGGTAGTT TGGAAAACAC TGGTCGAGAT ACTGATCTGG CCATAGAAGG GGATGGTTTT TTCGTAGTCA ATGACGGACA AAGCGATATG TTTACAAGAG TGGGTAATTT TGGAGTAGAT AGTGAGGGGA ACTTGGTTTC TCAAGGCACA GGTTACAAAG TTCAGGGTTA TGCTTACGAT GAAGATGCCG ATGAAATTAA TACTGACGAA ATCGTAGATA TGGAAATTCC CTTAGGAGAT GTGGTGGACC CTGAAGCTAC TGAAAATGTA AAGTATAATG GAAATTTGAA CTCAAATACT GAACCAGGAG AAACCGTTTC TACCACTGTT AATGTGTTTG ATAATTTAGG AAGTGAGCAT ACATTGAATT TGGAATTTGA AGAGTCAGGT GAAAATGAAT GGCAGATGAC TGTACGAAAA GATGGGATTT TGATCGAAGA CGAAATCCAA GTCTCCTTTT CAGATGATGG GGAGCTAACA AGTGTTGGCG ATGAAGGTAC TTCTTATTCA ATCAACGATT TTGAAGCTAC GGTTCAGGAC TTTGGTGGAC TGTCAGATAG TGCAAAAGAA GACCTGGGTT TTGATATAGA TGACGATAAT GGACATGACG AAGATAACAT CCAAGGCATG TTAATGAGAA ACGATGTTCA AGCTGGAAAT GCTGAGGCGG AAGATGTGGC GGCTTTAGCC ATTGAAGTAG AAGGCAGATA CGAATGGTAT CATCCCGATG ATATAAACTT TGATAGTTTA GAGGAAGGCG ATTACAGTCC CGATGTTGAC CCTATTTTCA CCCAAGATCA GGAACCTCAA AATGATGAAG AAGTTGATCT GCTAAACGCT CCCGGTGGAA TTGAATTCGA CCTTGGCGAT GTAAGACAGG TTGCCGGAGA ATCTACAGTT GAAGGTGTAG CTGAAGATGG ACAGGCCCAG GGTGAACTTG AAACCTATGA AGTTGATTCA GCAGGAATTA TTACAGGAAC TTACACCAAT GGTGAAACTC GTCAACTAGG CCAAGTAGTC CTATCAGATT TCACCAATCC CGGTGGACTT GAAAAGCTAG GAAGCGGTCT TTATCAACCT ACCAGGAACT CGGGTGAGCC AAGTTACGGT ATAGCAGGAA CCGGTGGACG AGGTGAAATC GCCCCTGGTG CCCTGGAAAT GTCCAATGTA GATATGGGAG AGCAGTTTAC TGATATGATC GTAACTCAAA GAGGTTTTCA GGCTAATTCC AGGTCAATAA CCACTGGGGA CGAAGTTCTT CAGGAACTCG TTAATCTGAA AACCTAG
|
Protein sequence | MLRSMYSGVS GLRNHQTRLD TIGNNVANVN TAGFKGDRVT FQDAFNQTLE AASAPNERGG TNPQQVGLGM NIGSMDTIHT QGSLENTGRD TDLAIEGDGF FVVNDGQSDM FTRVGNFGVD SEGNLVSQGT GYKVQGYAYD EDADEINTDE IVDMEIPLGD VVDPEATENV KYNGNLNSNT EPGETVSTTV NVFDNLGSEH TLNLEFEESG ENEWQMTVRK DGILIEDEIQ VSFSDDGELT SVGDEGTSYS INDFEATVQD FGGLSDSAKE DLGFDIDDDN GHDEDNIQGM LMRNDVQAGN AEAEDVAALA IEVEGRYEWY HPDDINFDSL EEGDYSPDVD PIFTQDQEPQ NDEEVDLLNA PGGIEFDLGD VRQVAGESTV EGVAEDGQAQ GELETYEVDS AGIITGTYTN GETRQLGQVV LSDFTNPGGL EKLGSGLYQP TRNSGEPSYG IAGTGGRGEI APGALEMSNV DMGEQFTDMI VTQRGFQANS RSITTGDEVL QELVNLKT
|
| |