Gene GWCH70_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3024 
Symbol 
ID7977389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3039757 
End bp3040776 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content44% 
IMG OID644799819 
Productflagellin domain protein 
Protein accessionYP_002950958 
Protein GI239828334 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTA ACCACAATAT TGCGGCGTTG AACACGTATC GTCAATTAAC GATCGGTCAA 
AACGCAGCAA CGAAAAATAT GGAAAAACTT TCTTCCGGTC TTCGCATTAA CCGCGCGGGC
GACGATGCGG CAGGGCTTGC GATTTCGGAA AAAATGCGTG GGCAAATTCG TGGGTTGGAG
CAAGCGTCAC GCAATGCACA GGACGGAATC TCGCTCATCC AGACAGCAGA AGGGGCTTTA
AATGAAGTTC ATTCTATTCT TCAACGTATG CGTGAATTGG CTGTTCAAGC TGCGAATGAC
ACGAATACAG GTACTGACCG TGACGAGATT CAAAAAGAAA TTAATCAATT AACTTCTGAA
ATTAATCGTA TCGGTAATAC AACTGAGTTC AATACTCAAA AATTATTAAA CGGAACAAGA
ACTACTACTC CTTTATCTAC ACCTGCAGGG GCAACACAGA ATGGTGGTAA TGATCCACAA
TCTGCAGGGG CAACACAGAA TGGTGGTAAT GATCCACAAT CTGCAGGGGC AACACAGAAT
GGTGGTAATG ATCCACAATC TGCAGGGGCA ACACAGAATG GTGGTAATGA TCCACAATCT
GCAGGGGCAA CACAGAATGG TGGTAATGAT TCACAATCTG CAGGGACAAT AGAAATAACT
TTACAAATTG GTGCGAACCA AAGCCAAAGC TTAACAATTG ACATTCAGGA CATGCGTGCT
AGGGCACTTG GTATCACAGG AACAGCTGGT ACTTCAGGGT TTACTGCTAC AAACACTGTC
ACTGATGGTA CAAACAACGA TACTGTGGAA GCTGCACTTG ATGTATCGAA TGCAGCGAAT
GCATCTGCTG CCATCACAAG GATTCAAATT GCAATCGATA GAGTATCTGC AGAGCGTTCA
AAGCTAGGTG CTTATCAAAA TCGTTTAGAA CATACTATTA ATAATCTCAG CACCTCTGCA
GAAAACCTAC AAGCCGCAGA ATCCCGCATC CGCGACGTAG ATTATGCCTT AGCTGCCTAA
 
Protein sequence
MRINHNIAAL NTYRQLTIGQ NAATKNMEKL SSGLRINRAG DDAAGLAISE KMRGQIRGLE 
QASRNAQDGI SLIQTAEGAL NEVHSILQRM RELAVQAAND TNTGTDRDEI QKEINQLTSE
INRIGNTTEF NTQKLLNGTR TTTPLSTPAG ATQNGGNDPQ SAGATQNGGN DPQSAGATQN
GGNDPQSAGA TQNGGNDPQS AGATQNGGND SQSAGTIEIT LQIGANQSQS LTIDIQDMRA
RALGITGTAG TSGFTATNTV TDGTNNDTVE AALDVSNAAN ASAAITRIQI AIDRVSAERS
KLGAYQNRLE HTINNLSTSA ENLQAAESRI RDVDYALAA