Gene Nther_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2231 
Symbol 
ID6315232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2369330 
End bp2370481 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content41% 
IMG OID642644619 
Productflagellin domain protein 
Protein accessionYP_001918385 
Protein GI188586840 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000000159197 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAATTA ACACGAACAT TGAAGCGTTG AATGCTCACC GCAACTTAGA GCAAACTAAT 
CAAAACATGC AGAAAAACCT AGAAAGATTG TCAAGTGGAC AACGAATCAA CAGAGCCGCT
GATGATGCAG CAGGTCTAAG CATTTCAGAA AAAATGAGAG GTCAGGTAAG TGGTCTTGAT
CAGGCAGTAA GAAACGCTCA GGACAGTATC AGCTTAATCC AAACTGCAGA AGGTGCTCTA
GAAGAATCCC ATAGTATACT GCAGAGAATG AGAGAGCTAG CTGTACAATC AGCTAACGAC
ACAAATATCG ATGCAGACAG AGGGGAAATT CAAGAGGAAA TTGATCAACT TGCAGAAGAA
TTACACAGAA TCGAAGAAAC CACAGAATTT AACACACAAA ATCTAGTTGA CGGTGATTTT
GCTGGAACTT TCCATATCGG TGCAAATGAG GGTCAAAACT TACAATTGAA CATTGATAAT
ATGGGAGCGT CAGAATTAGG TGTTGGTGAA GAATATACTG CTACTAATGA TGAGGCGGAT
TTAGATGAAA TAGCTGTTGC TGATGGTGAA ACCTATGAAG TTGTTGAACT TGATGAAACA
CTAGATGGGG TATTTGATGG GAATGGAGAT GCTCACTACG GACTAGTCAA TAATGAAGGA
GAATATGTAG CACTAGCTGA TGATGAAGCA CAAGAATTTG AATTTCTCGA TGATGCAGAA
GCAGATTTAG ACGAAATTGG AAGTGACGAA GTTTCAGACG AATCTGTTGA TTTCGGAAAC
ACAGTAGACC GCGGTAGTGT AACTATAGAA GAAAATGATG AAGGGGAGAT AGAAGCTACA
GCAAGAATGG GAATAGAAGT AGATGAACAA GAATCAGCTG ATGAAGCAAT CACTGATATA
GATAACGCTA TAGATACAGT ATCTTCCCAG CGTTCTGAAC TGGGAGCTCT TCAGAACAGA
TTAGAGCACA CTATCAACAA CTTAAGTGTA GCTTCTGAAA ACTTATCTGC TGCAGAATCC
AGAATCAGAG ACGTTGACAT GGCAGAAGAA ATGATGGACT TCTCAACTCA GCAGGTACTT
GAAGAAGCAG GTACAGCTAT GATGGCTCAG GCCAACATGC AGCCTCAATC AGTTCTTCAG
CTTCTTCAGT AA
 
Protein sequence
MRINTNIEAL NAHRNLEQTN QNMQKNLERL SSGQRINRAA DDAAGLSISE KMRGQVSGLD 
QAVRNAQDSI SLIQTAEGAL EESHSILQRM RELAVQSAND TNIDADRGEI QEEIDQLAEE
LHRIEETTEF NTQNLVDGDF AGTFHIGANE GQNLQLNIDN MGASELGVGE EYTATNDEAD
LDEIAVADGE TYEVVELDET LDGVFDGNGD AHYGLVNNEG EYVALADDEA QEFEFLDDAE
ADLDEIGSDE VSDESVDFGN TVDRGSVTIE ENDEGEIEAT ARMGIEVDEQ ESADEAITDI
DNAIDTVSSQ RSELGALQNR LEHTINNLSV ASENLSAAES RIRDVDMAEE MMDFSTQQVL
EEAGTAMMAQ ANMQPQSVLQ LLQ