Gene GWCH70_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2371 
Symbol 
ID7979067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2412580 
End bp2413626 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content45% 
IMG OID644799174 
Producttype II secretion system protein E 
Protein accessionYP_002950334 
Protein GI239827710 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA TTGAATATGT TGCCGATCGT CTCATAAAAG AAGCGAGTTT GCTTCATGTA 
TCTGACATTC ATATCGTTCC GCGCAAAGAC GATGCGATTG TGCGTTTCCG GTTAGATGGA
TTGCTGATGG AAAAGGAAGC GCTGACAAAA GAAATGTGCG AGCGGCTTAT TACGCATTTT
AAATTTTTAG CAGGGATGGA CATTGGCGAA CGCCGCCGTC CGCAAAGCGG AGCGATGGAA
GCAAGGCATC AGGAAGAAAT CATTCACTTA CGCCTCTCCA CATTACCGAC ATCGTATGAT
GAAAGCCTCG TTATCCGGCT TCTTCCGCAG AATTTTTTTA TTCCTCGATC ACAACTTTCT
CTATTTGCAA ATGCCACGAA AACGTTACTT TCCCTTTTTC GGCAGCCCCA AGGATTAATT
ATTTTTACAG GACCAACTGG ATCAGGCAAA ACGTCAACAT TATATACGTT ATTGCGCATT
TGTCAATATG AGTGGCATCG CAATGTCATC ACATTGGAAG ACCCTGTTGA AAAGCGAATC
GACAACATAT TGCAAGTGCA AATTAATGAG AAAGCGGGAA TTACGTATAC AACCGGTTTA
AAAGCTGTTT TGCGCCATGA TCCGGATGTG ATTATGATCG GCGAAATTCG CGACGCCGAG
ACCGCAAAAA TTGCGGTACG CTCAGCAATG ACGGGACATT TGATTGCTAC GACCATGCAT
ACAAAAAACG CTGTTGGTGC GATTTACCGT TTGCGTGAAT TCGGGATTCC GCTTGGAGAT
ATTGAGCAAA CATTGCTCGC CGTTGTCGCA CAGCGGCTCG TGGACTTAGT ATGCCCGTTT
TGCGGTGAAC ATTGCTCCAT ATTTTGCCGT AAATATCGCC CCATTCGCCG CGCTGCTGTC
CATGAATTGC TGTATGGGAA TGCTTTGTCG AACGCCATTC AATCCGTACA AACAAAGGAA
AAGACGCATC ACTACTATAC GTTGCAACAC GTTATTCGAA AAGGAGTTGC TCTTGGATTT
TTGCCAGCAC ACCTTCTTTA CAGGTAG
 
Protein sequence
MNEIEYVADR LIKEASLLHV SDIHIVPRKD DAIVRFRLDG LLMEKEALTK EMCERLITHF 
KFLAGMDIGE RRRPQSGAME ARHQEEIIHL RLSTLPTSYD ESLVIRLLPQ NFFIPRSQLS
LFANATKTLL SLFRQPQGLI IFTGPTGSGK TSTLYTLLRI CQYEWHRNVI TLEDPVEKRI
DNILQVQINE KAGITYTTGL KAVLRHDPDV IMIGEIRDAE TAKIAVRSAM TGHLIATTMH
TKNAVGAIYR LREFGIPLGD IEQTLLAVVA QRLVDLVCPF CGEHCSIFCR KYRPIRRAAV
HELLYGNALS NAIQSVQTKE KTHHYYTLQH VIRKGVALGF LPAHLLYR