Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2103 |
Symbol | |
ID | 6316107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2219277 |
End bp | 2220992 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642644491 |
Product | type II secretion system protein E |
Protein accession | YP_001918258 |
Protein GI | 188586713 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTCGTC AGAAGAAACA AAGATTAGGG GATTTGTTGT TGGAAAGCGG TGCTATCACC GAAGAAGACT TAAAACAAGC CCTTGACCAT CAAAATAAGT CAGGACAAAA GTTAGGGGCT AGCCTGGTAG ATCTGGGAAT AATTACAGAA GAAGAGATTA TTGAAGTATT GGAATTCCAA CTTGGAATAC CCCATGTATC CCTGTCCCAA TACGATACCA ATAGAGAAAC TGCTACACTC ATTCCTGCCT ATTTAGCAGA ACGGTATCAA GTGCTTCCCA TAGATAATAG AAGTGGAAAA TTAGTACTGG CCATGGGTGA CCCTTTAAAT GTGGTTGCTA TTGATGACGT CAAAATGGCT ACCGGTATGG AAGTTGAGCC TGTTATAGCA TCTCCTCGAG AAATCGAAGG TGAGATTAAC CGTCACTTTG GAATCCAGGA TTCTGTTGAA AAAGCTATAG AAGAAATTGA AGGCTCAGCC GAGGAAGAGG CAGAAAGTGA AATAGCTGCA ACTGAAGAAG AAGAGTTATC AAATCTAGAA ACTAATGCAC CTGTTGTCAA AGTGGTCAAT TCTTTAGTGT CTCAAGCATA CGAGCAGGGT GCTAGTGATA TACATATTGA ACCTACTAAA CAGGGGATGC AGATACGGTA CCGAATAGAT GGAGTCTTGC ATAATGTGGC AACTCCTCCC CGGTATGCTA AAGATCTGTT GATCAGTCGT GTGAAAATTA TGGCTGGTAT GGACATTACA AAAAAAAGAA TCCCCCAAGA TGGTAGGAGC AATTATAATA TAGGAGGACA TGAAATTGAC TTAAGGGTAT CTACTTTACC GACAATTTAC GGTGAAAAAG TAGTGATCCG CTTGCTTCAT AAAGATAAAG TGATTTTTTC ACTGGATAAA TTGGGGTTTC AACAGGATAA TTTTAAGCTC TATCAAGGAT TATTGAAAAA CAGTGCCGGG ATGGTTTTGG TTACCGGCCC TACTGGATGT GGTAAAACTA CTACTCTCTA TTCTTCCCTC AACCGGATAA ACAGTTCCGA GAAAAATATA ATTACTATAG AGGATCCAGT GGAATATCAG ATTGAAGGGA TTAATCAAGT TCAAACCAAT GAAAAAGGTG GATTGACCTT CGCAAACGGT TTAAGAGCTA TTTTACGTCA AGATCCGGAT ATCATCATGG TAGGGGAAAT CAGAGACTTA GAAACTGCTC AAATTGCAAT TAGATCGGCC CTGACAGGAC ATTTAGTGTT TTCTACATTA CATACTAATA ATGCGATTGC CACCCTTTCC AGGCTAGTGG ATATGGGAAT CCCGCCTTTT TTAGTTAGCT CTGCTGTGGA AGGAGTATTA TCCCAGAGGT TGGTAAGGAT AATCTGTTCC AATTGCAAAA TTGAATACAG CCCCACGGCC GAGGAACAAG AAATATATCA CCGTTACTCG GGGGAACAGG TGGATACCCT TTATAAAGGT AAAGGCTGTA CCAACTGTAA TAATACTGGT TATAAAGGTC GGACTGCAAT TCATGAACTA TTGATACTTG ATAAAACTTT AAAAGATATG CTAGCTAGAG AAGCTTCTGA AAGAGAACTG ACAGAGGAAG CTCGTAAAAG AGGTTTTTCA TATTTAATCG AAAACGCCAT TTCTAAAGTC AGTCAGGGTA TCACAACTAT GGAAGAAGTT ATTAGAGCTA CCTTTCATCA GGAAAGCCAT CTCTAA
|
Protein sequence | MFRQKKQRLG DLLLESGAIT EEDLKQALDH QNKSGQKLGA SLVDLGIITE EEIIEVLEFQ LGIPHVSLSQ YDTNRETATL IPAYLAERYQ VLPIDNRSGK LVLAMGDPLN VVAIDDVKMA TGMEVEPVIA SPREIEGEIN RHFGIQDSVE KAIEEIEGSA EEEAESEIAA TEEEELSNLE TNAPVVKVVN SLVSQAYEQG ASDIHIEPTK QGMQIRYRID GVLHNVATPP RYAKDLLISR VKIMAGMDIT KKRIPQDGRS NYNIGGHEID LRVSTLPTIY GEKVVIRLLH KDKVIFSLDK LGFQQDNFKL YQGLLKNSAG MVLVTGPTGC GKTTTLYSSL NRINSSEKNI ITIEDPVEYQ IEGINQVQTN EKGGLTFANG LRAILRQDPD IIMVGEIRDL ETAQIAIRSA LTGHLVFSTL HTNNAIATLS RLVDMGIPPF LVSSAVEGVL SQRLVRIICS NCKIEYSPTA EEQEIYHRYS GEQVDTLYKG KGCTNCNNTG YKGRTAIHEL LILDKTLKDM LAREASEREL TEEARKRGFS YLIENAISKV SQGITTMEEV IRATFHQESH L
|
| |