Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2231 |
Symbol | |
ID | 6315232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2369330 |
End bp | 2370481 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642644619 |
Product | flagellin domain protein |
Protein accession | YP_001918385 |
Protein GI | 188586840 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.000000159197 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAATTA ACACGAACAT TGAAGCGTTG AATGCTCACC GCAACTTAGA GCAAACTAAT CAAAACATGC AGAAAAACCT AGAAAGATTG TCAAGTGGAC AACGAATCAA CAGAGCCGCT GATGATGCAG CAGGTCTAAG CATTTCAGAA AAAATGAGAG GTCAGGTAAG TGGTCTTGAT CAGGCAGTAA GAAACGCTCA GGACAGTATC AGCTTAATCC AAACTGCAGA AGGTGCTCTA GAAGAATCCC ATAGTATACT GCAGAGAATG AGAGAGCTAG CTGTACAATC AGCTAACGAC ACAAATATCG ATGCAGACAG AGGGGAAATT CAAGAGGAAA TTGATCAACT TGCAGAAGAA TTACACAGAA TCGAAGAAAC CACAGAATTT AACACACAAA ATCTAGTTGA CGGTGATTTT GCTGGAACTT TCCATATCGG TGCAAATGAG GGTCAAAACT TACAATTGAA CATTGATAAT ATGGGAGCGT CAGAATTAGG TGTTGGTGAA GAATATACTG CTACTAATGA TGAGGCGGAT TTAGATGAAA TAGCTGTTGC TGATGGTGAA ACCTATGAAG TTGTTGAACT TGATGAAACA CTAGATGGGG TATTTGATGG GAATGGAGAT GCTCACTACG GACTAGTCAA TAATGAAGGA GAATATGTAG CACTAGCTGA TGATGAAGCA CAAGAATTTG AATTTCTCGA TGATGCAGAA GCAGATTTAG ACGAAATTGG AAGTGACGAA GTTTCAGACG AATCTGTTGA TTTCGGAAAC ACAGTAGACC GCGGTAGTGT AACTATAGAA GAAAATGATG AAGGGGAGAT AGAAGCTACA GCAAGAATGG GAATAGAAGT AGATGAACAA GAATCAGCTG ATGAAGCAAT CACTGATATA GATAACGCTA TAGATACAGT ATCTTCCCAG CGTTCTGAAC TGGGAGCTCT TCAGAACAGA TTAGAGCACA CTATCAACAA CTTAAGTGTA GCTTCTGAAA ACTTATCTGC TGCAGAATCC AGAATCAGAG ACGTTGACAT GGCAGAAGAA ATGATGGACT TCTCAACTCA GCAGGTACTT GAAGAAGCAG GTACAGCTAT GATGGCTCAG GCCAACATGC AGCCTCAATC AGTTCTTCAG CTTCTTCAGT AA
|
Protein sequence | MRINTNIEAL NAHRNLEQTN QNMQKNLERL SSGQRINRAA DDAAGLSISE KMRGQVSGLD QAVRNAQDSI SLIQTAEGAL EESHSILQRM RELAVQSAND TNIDADRGEI QEEIDQLAEE LHRIEETTEF NTQNLVDGDF AGTFHIGANE GQNLQLNIDN MGASELGVGE EYTATNDEAD LDEIAVADGE TYEVVELDET LDGVFDGNGD AHYGLVNNEG EYVALADDEA QEFEFLDDAE ADLDEIGSDE VSDESVDFGN TVDRGSVTIE ENDEGEIEAT ARMGIEVDEQ ESADEAITDI DNAIDTVSSQ RSELGALQNR LEHTINNLSV ASENLSAAES RIRDVDMAEE MMDFSTQQVL EEAGTAMMAQ ANMQPQSVLQ LLQ
|
| |