Gene GWCH70_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3031 
SymbolflgK 
ID7977396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3045390 
End bp3047015 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content48% 
IMG OID644799825 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002950964 
Protein GI239828340 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTCGA CATTTCATGG ATTAGAAGCC GCTAAGCGTG CCATGATGAC CCAGCAAGCT 
GCGCTGTACG TAACAGGGCA TAACGTCGCG AACGCAAACA CACCTGGATA TACAAGACAG
CGTGTCAATT TTGTACAGAC GGAGCCATTT CCCTCCCCTG GTCTGAATCG TCCGCAAATC
CCAGGGCAGA TGGGGACAGG TGTAAAAGCA GGTTCGGTTG AGCGTGTGCG CGAATACTTT
TTAGACATTC AATATCGCGG GGAAAACAAC AAGCTTGGTT ATTGGGAAGC GCGCGCGGAT
GCCATCGCCA AGATGGAAGA CATCATGAAC GAACCGTCGG ATAACGGATT GGCGAAAACG
ATGGATCAGT TTTGGCAAGC GCTGCAAGAC TTAAGCGTCA ATCCCGAAAA CGAAGGGGCG
CGGTCTGTTG TACGCCAGCG CGGACTTGCC GTAGTCGAAA CGTTCCATTA TTTATCCAAC
TCGTTATCGC AAATTCAAAC CGATATCGGC ACGCAGATTG GTGTGACGAT TACACAAATT
AACTCTCTCG CAAAACAAAT CAGCGAGATT AACCAACAAA TTGCGAGTGT CGAACCGAAT
GGCTATTTGC CGAACGATCT GTATGACGAA CGTGACCGTC TTATTGACGA GCTTTCCAAA
CTGATTAACG TACAAGTCGA AAAACATCCA ACCGGCGGAA ACGCGCCAGC AACGGCGGAA
GGGACATATG ATATTTATTT CCTCAACGGA AACGAAAAAG TATATCTCGT CCAAGGCGGG
GATTTTCAAT CGATTTCATT TCCAGATGGT CAAGATGTGG ACGGGGACAA AGAGTACATC
AAGGAAATGC CGCCAGTTAC GGGAGTCACA GAGTTGCAAG TGGGCGGCAC GTCCATTTCC
TTCACCGATA ATAACAATCA AGTTACATTT CCAATGGGAA AACTGCGCGG GCTCATCGAA
GCGTACGGCT ATGTAAGTGG GCAGGATAGC AACGGTCAGC CAATTGTGGC AGGTATTTAT
CCAGATATGT TAAATAACCT AGATAAGCTT GCATATACGT TTGGCAAATT GTTTAATGCA
GTTCACGAAA AAGGATATGG GCTCAACGGC GAAACGGGTG TTTCCTTTTT CGATGGACTT
GGTCAAGAGG CAAAAGGAGC GGCAAAAACG ATTCGTCTTT CTGCCGATAT TGACGACCTT
GCGAACATTG CTGCTTCCAC GGAAGAAGGA AAGCCAGGAA ACGGAAACAA TGCGATTAAT
CTCGCGAATG TTGGCAGCAT GTTGCTTTCT GCGGATACGG TCAGTTTAAT TGGGACAACA
AACACGATCC AGATCAGCAC GCTCAATCTG CCGTTGACTT CTGGCACGAT CCAAACGAAC
TACCAAGGCT GGATCGGCAA ACTCGGTGTC GACGGCCAGC AAGCCAACCG GATGAAAAAC
AATAGCGACG TCCTCCGCCA ATCAGTGGAA GAACGCCGTC AATCAGTTAG CTCCGTATCT
CTTGATGAGG AAATGATGAA CATGATTAAA TTCCAGCACG CTTATAACGC GGCAGCGCGG
CAAATTACTG TCGTGGATGA AATGCTCGAT AAAATCATCA ACGGAATGGG AATCGTCGGA
AGGTAG
 
Protein sequence
MLSTFHGLEA AKRAMMTQQA ALYVTGHNVA NANTPGYTRQ RVNFVQTEPF PSPGLNRPQI 
PGQMGTGVKA GSVERVREYF LDIQYRGENN KLGYWEARAD AIAKMEDIMN EPSDNGLAKT
MDQFWQALQD LSVNPENEGA RSVVRQRGLA VVETFHYLSN SLSQIQTDIG TQIGVTITQI
NSLAKQISEI NQQIASVEPN GYLPNDLYDE RDRLIDELSK LINVQVEKHP TGGNAPATAE
GTYDIYFLNG NEKVYLVQGG DFQSISFPDG QDVDGDKEYI KEMPPVTGVT ELQVGGTSIS
FTDNNNQVTF PMGKLRGLIE AYGYVSGQDS NGQPIVAGIY PDMLNNLDKL AYTFGKLFNA
VHEKGYGLNG ETGVSFFDGL GQEAKGAAKT IRLSADIDDL ANIAASTEEG KPGNGNNAIN
LANVGSMLLS ADTVSLIGTT NTIQISTLNL PLTSGTIQTN YQGWIGKLGV DGQQANRMKN
NSDVLRQSVE ERRQSVSSVS LDEEMMNMIK FQHAYNAAAR QITVVDEMLD KIINGMGIVG
R