Gene Athe_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1675 
SymbolflgK 
ID7409183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1765742 
End bp1767319 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content39% 
IMG OID643716044 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002573542 
Protein GI222529660 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTT ACGGGCTTGA GATTGCAAGA ACAGGTATAT TTGTCAACAG AAAAGGGTTA 
GAGGTGACAT CGCACAACGT TGCCAATGCT TCAACGCCAG GGTATACAAG ACAGGTTTTA
AATGTAAAGT CAAATCCACC ATCTGCCAAG GTTGGTTTTT ATAGCCCAAA GTTTCAGGTT
GGTTTGGGTG CTGATGTGCA AAGCCTTGAG CAAATAAGAG ATATGTTTTT AGATGTTCAA
TATAGAAACG AATATTCGCG CCAGGGAGAG TATGAAATAA AAGCTGATAA CCTGAATTTT
ATTGAAGCCA TATTTAATGA GCCAAGCGAT ACAGGCTTGT CTTCTGTTAT AGACCAATTT
TTCTCAAGCT TGCAGGAGCT TTCAAAAAAT CCAGAGAGTT TAACAGTTCG TGCGCTTGTT
CGCCAGAGAG CTCAAGCTTT GACTGATGCG ATACATAAAA TGTATAAACA GCTTGAAGAT
TTGCAGAGCG AGCTAAATGA CCAGGTATAT GATAAAATTT TAGAAATAAA CAGCATAGCT
TCTCAGATTG CAGATTTAAA CCAGCAGATA TTTGTTTTGG AGCTTCGAGG TGAAAAGGCA
AATGACCTTC GTGACCAGAG AAATCTTCTT GTTGACAAAC TCTCAAAAAT TGTTGACACA
ACTGCTTATG AAGACAAAGA TGGCAGGTTC ATTGTGCAGA TAGCTGGAGG AGAGACCCTT
GTAAATCACT TTACAGTCTA TCAGCTTGAG ACAGACAAAT CGAAAATTAT GAGAAAAAGT
GGATTTGACT CAAGTGGTCT GCCAACAGGA TTCAATCCTG ACGACCCGCT ATCACAGCAA
AATCTTTACG ATGTTCCAGG ACTTTTTGTT GTTATGTGGA AAGACACAGG GCAGGTTTTG
AATATAAAAT CTGGTGAATT AAAAGGATTA TTAGATGTTC GTGATGGTGT AGGTGGACTT
GATGAAGATG TGTCTGTAAA TGGTCAGCCA GTTGATGTTC CCAATAAAAA TTCTTTTACA
GGTATTCCTT ACTATTTGAA CAGACTCAAT GAGTTTGCTC AAAAACTTAT CGAAAAGTTC
AATGAACTTC ACACACAGGG TTGGTCACTC AACGGACAAA ATACAGGTAT AAACTTTTTC
GAACCACCTG TTGGCCAGAC ATTTTTCTAT GCAAGGTATA TAAAAGTCTC TGATGCTATT
ATGAACGACC TTAACAACAT TGCAACAACT TATGATGCAA GTAGTCTTCC AGGTGGGAAT
GACCTTGTTG TTGATATGCT AAAGCTAAGA AACGACAATT CAGTTTTCAA AGAAGGAAAA
TTTGAAGATT TTCTAAAGTC TTTAATTTCA AACCTTGGGG TTGATTCTCA GGGTGCCAAA
AACTTTGCAG AAAACCAAAA GGTAATGGTC ACCCAGCTTG ACAACAGACG CCAGGCAGTA
TCAGGTGTTT CCATAGATGA AGAGATGACA AACCTAATCA AATACCAGCA CGGTTTTCAA
GCCTCAGCCA GAATGATTAA TGCTTTTGAC GAGATGCTTG ATGTTATAGT CAACAGACTT
GGTATTGTTG GAAGATAG
 
Protein sequence
MSFYGLEIAR TGIFVNRKGL EVTSHNVANA STPGYTRQVL NVKSNPPSAK VGFYSPKFQV 
GLGADVQSLE QIRDMFLDVQ YRNEYSRQGE YEIKADNLNF IEAIFNEPSD TGLSSVIDQF
FSSLQELSKN PESLTVRALV RQRAQALTDA IHKMYKQLED LQSELNDQVY DKILEINSIA
SQIADLNQQI FVLELRGEKA NDLRDQRNLL VDKLSKIVDT TAYEDKDGRF IVQIAGGETL
VNHFTVYQLE TDKSKIMRKS GFDSSGLPTG FNPDDPLSQQ NLYDVPGLFV VMWKDTGQVL
NIKSGELKGL LDVRDGVGGL DEDVSVNGQP VDVPNKNSFT GIPYYLNRLN EFAQKLIEKF
NELHTQGWSL NGQNTGINFF EPPVGQTFFY ARYIKVSDAI MNDLNNIATT YDASSLPGGN
DLVVDMLKLR NDNSVFKEGK FEDFLKSLIS NLGVDSQGAK NFAENQKVMV TQLDNRRQAV
SGVSIDEEMT NLIKYQHGFQ ASARMINAFD EMLDVIVNRL GIVGR