Gene Nther_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2103 
Symbol 
ID6316107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2219277 
End bp2220992 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content40% 
IMG OID642644491 
Producttype II secretion system protein E 
Protein accessionYP_001918258 
Protein GI188586713 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCGTC AGAAGAAACA AAGATTAGGG GATTTGTTGT TGGAAAGCGG TGCTATCACC 
GAAGAAGACT TAAAACAAGC CCTTGACCAT CAAAATAAGT CAGGACAAAA GTTAGGGGCT
AGCCTGGTAG ATCTGGGAAT AATTACAGAA GAAGAGATTA TTGAAGTATT GGAATTCCAA
CTTGGAATAC CCCATGTATC CCTGTCCCAA TACGATACCA ATAGAGAAAC TGCTACACTC
ATTCCTGCCT ATTTAGCAGA ACGGTATCAA GTGCTTCCCA TAGATAATAG AAGTGGAAAA
TTAGTACTGG CCATGGGTGA CCCTTTAAAT GTGGTTGCTA TTGATGACGT CAAAATGGCT
ACCGGTATGG AAGTTGAGCC TGTTATAGCA TCTCCTCGAG AAATCGAAGG TGAGATTAAC
CGTCACTTTG GAATCCAGGA TTCTGTTGAA AAAGCTATAG AAGAAATTGA AGGCTCAGCC
GAGGAAGAGG CAGAAAGTGA AATAGCTGCA ACTGAAGAAG AAGAGTTATC AAATCTAGAA
ACTAATGCAC CTGTTGTCAA AGTGGTCAAT TCTTTAGTGT CTCAAGCATA CGAGCAGGGT
GCTAGTGATA TACATATTGA ACCTACTAAA CAGGGGATGC AGATACGGTA CCGAATAGAT
GGAGTCTTGC ATAATGTGGC AACTCCTCCC CGGTATGCTA AAGATCTGTT GATCAGTCGT
GTGAAAATTA TGGCTGGTAT GGACATTACA AAAAAAAGAA TCCCCCAAGA TGGTAGGAGC
AATTATAATA TAGGAGGACA TGAAATTGAC TTAAGGGTAT CTACTTTACC GACAATTTAC
GGTGAAAAAG TAGTGATCCG CTTGCTTCAT AAAGATAAAG TGATTTTTTC ACTGGATAAA
TTGGGGTTTC AACAGGATAA TTTTAAGCTC TATCAAGGAT TATTGAAAAA CAGTGCCGGG
ATGGTTTTGG TTACCGGCCC TACTGGATGT GGTAAAACTA CTACTCTCTA TTCTTCCCTC
AACCGGATAA ACAGTTCCGA GAAAAATATA ATTACTATAG AGGATCCAGT GGAATATCAG
ATTGAAGGGA TTAATCAAGT TCAAACCAAT GAAAAAGGTG GATTGACCTT CGCAAACGGT
TTAAGAGCTA TTTTACGTCA AGATCCGGAT ATCATCATGG TAGGGGAAAT CAGAGACTTA
GAAACTGCTC AAATTGCAAT TAGATCGGCC CTGACAGGAC ATTTAGTGTT TTCTACATTA
CATACTAATA ATGCGATTGC CACCCTTTCC AGGCTAGTGG ATATGGGAAT CCCGCCTTTT
TTAGTTAGCT CTGCTGTGGA AGGAGTATTA TCCCAGAGGT TGGTAAGGAT AATCTGTTCC
AATTGCAAAA TTGAATACAG CCCCACGGCC GAGGAACAAG AAATATATCA CCGTTACTCG
GGGGAACAGG TGGATACCCT TTATAAAGGT AAAGGCTGTA CCAACTGTAA TAATACTGGT
TATAAAGGTC GGACTGCAAT TCATGAACTA TTGATACTTG ATAAAACTTT AAAAGATATG
CTAGCTAGAG AAGCTTCTGA AAGAGAACTG ACAGAGGAAG CTCGTAAAAG AGGTTTTTCA
TATTTAATCG AAAACGCCAT TTCTAAAGTC AGTCAGGGTA TCACAACTAT GGAAGAAGTT
ATTAGAGCTA CCTTTCATCA GGAAAGCCAT CTCTAA
 
Protein sequence
MFRQKKQRLG DLLLESGAIT EEDLKQALDH QNKSGQKLGA SLVDLGIITE EEIIEVLEFQ 
LGIPHVSLSQ YDTNRETATL IPAYLAERYQ VLPIDNRSGK LVLAMGDPLN VVAIDDVKMA
TGMEVEPVIA SPREIEGEIN RHFGIQDSVE KAIEEIEGSA EEEAESEIAA TEEEELSNLE
TNAPVVKVVN SLVSQAYEQG ASDIHIEPTK QGMQIRYRID GVLHNVATPP RYAKDLLISR
VKIMAGMDIT KKRIPQDGRS NYNIGGHEID LRVSTLPTIY GEKVVIRLLH KDKVIFSLDK
LGFQQDNFKL YQGLLKNSAG MVLVTGPTGC GKTTTLYSSL NRINSSEKNI ITIEDPVEYQ
IEGINQVQTN EKGGLTFANG LRAILRQDPD IIMVGEIRDL ETAQIAIRSA LTGHLVFSTL
HTNNAIATLS RLVDMGIPPF LVSSAVEGVL SQRLVRIICS NCKIEYSPTA EEQEIYHRYS
GEQVDTLYKG KGCTNCNNTG YKGRTAIHEL LILDKTLKDM LAREASEREL TEEARKRGFS
YLIENAISKV SQGITTMEEV IRATFHQESH L