Gene Nther_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2100 
Symbol 
ID6316104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2215652 
End bp2216695 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content37% 
IMG OID642644488 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_001918255 
Protein GI188586710 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGGG GAAAGTTGAT TCCTGAACAG GTACCACTGG GAATAGACTT AGGAAGATCC 
AGTATTAAAA TAGTTGAACT AAGTAAATTG CGGTCTAAGT TTGTACTCAA AAGATTCACT
ATCGAAGAAT TTTCGGTACC CCAAGATGAT GAAGGGGATA ATCAGGGAGA GCAAATTGGA
CAGTATCTGC GAGAATGTCT GGATAAATAC AAAATAAAAA ATAAAAATGC TTATTTAGCA
ATAAAAAATG ACGATGTAGT TATCAGAACT ATCTCATTCC CTCCTATGCC CCAATCGGAA
TTAGCTCAGG CGGTTGACTA TGAGGCAGAT AATTATATTA TGATGCCCAA AAACCAGGCT
AATGTTGATT GGCATATCCT AAAACAAGAT GAACACGGAA TCACAGTCTT ACTTTTGGCT
GCGAGAAAAG ACTTAATAAA CAAATACCAT GAAGTTTGCG AGACTGCTGG AATTAAATTA
AAAGCCTTAG ATGTAGAAGT TTTTTCCTTG AAAAGGCTAG TTGATTTTTT GGAAACACAT
AACAAGGATA ACTTTTCAGG GAAGGTAGAT GGTCATTTGA CTTTAACTCT AGATATGGGA
GCGGGTGGCA CAACTTTATT ATTTACTAAA AAAAATAACT ATGTGTTTTC CAGAAACATT
TCTATTGGTG GAATGGATTT TACCAAGGCT ATTTCCCAGG AAGCAGGTAT TAGCTTCCAA
GAAGCGGAAC AAAACAAAGA TCAGGTTAAT TACTTAGAAT ATAATAGTGT TCTAGGACAA
GCCCAAGACT TACTAAGAGA AATAGCAAGA TCGGTAGAAT ATGTAAATTC ACAAAAACTG
TCCAAGGGTG ATCCAGAACA ATTATATGTT ACTGGCGGAG GTTGGCGAAT GAAAGGCCTG
TTAGATTATA TTAGGGAAAA CCTAGGTACT ACCCCAGAGA TAGTTAATCC TTTCAAGCAC
ATTAAATCCA GGGGAGAACT TGTAACTGAA GGTATGATGG CTAGTATTGC TACTGGCTTG
GCCTTACGGA GGTGGACGAA ATGA
 
Protein sequence
MLRGKLIPEQ VPLGIDLGRS SIKIVELSKL RSKFVLKRFT IEEFSVPQDD EGDNQGEQIG 
QYLRECLDKY KIKNKNAYLA IKNDDVVIRT ISFPPMPQSE LAQAVDYEAD NYIMMPKNQA
NVDWHILKQD EHGITVLLLA ARKDLINKYH EVCETAGIKL KALDVEVFSL KRLVDFLETH
NKDNFSGKVD GHLTLTLDMG AGGTTLLFTK KNNYVFSRNI SIGGMDFTKA ISQEAGISFQ
EAEQNKDQVN YLEYNSVLGQ AQDLLREIAR SVEYVNSQKL SKGDPEQLYV TGGGWRMKGL
LDYIRENLGT TPEIVNPFKH IKSRGELVTE GMMASIATGL ALRRWTK