Gene Nther_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2101 
Symbol 
ID6316105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2216754 
End bp2217968 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content37% 
IMG OID642644489 
Producttype II secretion system protein 
Protein accessionYP_001918256 
Protein GI188586711 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCAA AATATTCATA TAAAGCCAGG GATCCAAGTG GAAACCTGAT CAAGGGTAGT 
CAAGAAACTG AGGACCGTCA AAGCTTAATA CAAGAATTAA AATCAGATCA GCTTTACATA
GTTGAAATTA AACAGACAGG TAAAGAATTC AGTTTTAGTA ATCTGAAAAA ATATTTTCAA
AAACCTGTAG CTTCCAGGGA CCTGATGATT TTTTCTAGGC AGTTTGCCAC TATGATCAAA
GCCGGGGTTA CAGTGTTAGA TAGTTTAGAG TCTTTGTCAG AGCAGATGGA TAGCAAAAAA
TTGTCTCGAG GCCTGGTGAA TGTGACAGAA AGTTTGAGAG GTGGTAATAG CCTGGCCGAA
TCGTTTCACG AACAAAAAGA TGTGTTTCCC GAATTATTTG TTAATATGAT CAGTGCTGGT
GAAGAAGCTG GTGCCCTTGA GGACGTTTTG GAACGAATGG CAGTTCACTA TGAGAAGGAA
CATCAGCTTT TGGAAAAAAT CAAAAGTGCC ATGACGTATC CCGTAATACT TTTAGGAGTG
GCCTTCTTAG TTGTATATTT TTTAGTGACT TATGTACTGC CCGAATTTGC AGGGATTTTC
GCAGGAATGG ACGTAGAGCT TCCGTTATTA ACTGAAATTA TGTTATTTAC TGGAAAAGCT
TTAAGGGCCA ATATACAATG GTTATTAATA ATTTTAATTG TCCTAAGTTT AGCTGGCTAT
TATCTCTTGA ATACCGATGG AGGAAAATAT TTATATGACA AGGCAAAATT AACTTTACCA
GTACTGGGAG GAGTTAGCAA AAAAGTTATT ATAGCCAGGT TTTCTCGTAT TTTAACTACT
CTGATTGGCA GTGGAATTAC TTTAATGGAT GCTTTAGGAT TAGTTAAAAA AACTATTGGA
AATAAGTTAA TGGAAAAAGT CTTAGATGAG GCCGTGGATA ATATAGAACA GGGACAGACC
ATGTCAAAAC CCTTTGAGGA AAGTGAGTTA TTTCCGCCTC TTGTGAGCAA GATGATGTCT
GTAGGTGAGG AAACGGGAGC AATTGAAGAG ATGATGGATA AAGTAGCTGA TTTTTATGAA
CAGGAAAGTA GTTATACTCT GGACAGGCTC AGTGCCCTGA TAGAGCCGGT AATGATTTTA
ATTTTAATGG TTGTAGTAGC TGTGATAGTA CTGGCAGTTG TATTACCTAT GGTGGAAATG
TGGCAAATTT ATTAA
 
Protein sequence
MTPKYSYKAR DPSGNLIKGS QETEDRQSLI QELKSDQLYI VEIKQTGKEF SFSNLKKYFQ 
KPVASRDLMI FSRQFATMIK AGVTVLDSLE SLSEQMDSKK LSRGLVNVTE SLRGGNSLAE
SFHEQKDVFP ELFVNMISAG EEAGALEDVL ERMAVHYEKE HQLLEKIKSA MTYPVILLGV
AFLVVYFLVT YVLPEFAGIF AGMDVELPLL TEIMLFTGKA LRANIQWLLI ILIVLSLAGY
YLLNTDGGKY LYDKAKLTLP VLGGVSKKVI IARFSRILTT LIGSGITLMD ALGLVKKTIG
NKLMEKVLDE AVDNIEQGQT MSKPFEESEL FPPLVSKMMS VGEETGAIEE MMDKVADFYE
QESSYTLDRL SALIEPVMIL ILMVVVAVIV LAVVLPMVEM WQIY