Gene EcolC_3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3362 
Symbol 
ID6067459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3683834 
End bp3685210 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content58% 
IMG OID641602776 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001726308 
Protein GI170021354 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGA TTAACATCGG CTACAGCGGC GCATCAACCG CGCAGGTGGA GCTGAACGTC 
ACGGCGCAAA ACACCGCTAA CGCCATGACC ACGGGCTACA CCCGTCAGGT GGCGGAGATC
AGCACCATCG GTGCCAGCGG CGGTTCGCCG AACAGCGCCG GTAACGGCGT ACAGGTCGAC
AGCATTCGCC GCGTCTCCAA CCAGTATCAG GTGAATCAGG TGTGGTATGC CGCCAGCGAT
TACGGCTATT ACAGCACCCA GCAGGGGTAT CTCAGCCAAC TGGAAGCAGT ACTAAGCGAC
GATAACAGCA GCCTGAGCGG CGGCTTCGAT AACTTCTTCG CCGCCCTTAA CGAAGCGACC
ACCAGCCCCG ATGATTCCGC CCTGCGCGAG CAGGTGATCA GCGAAGCCGG GGCGCTGTCG
TTGCGTATTG ATAACACGCT CGATTACATC GACTCGCAAA GCACGGAAAT CATCAGCCAG
CAGCAGGCGA TGGTGTCGCA AATCAATACG CTCACCAGCG GCATTGCCAG CTATAACCAG
CAAATCGCGC AGGCCGAAGC CAACGGGGAT AACGCCTCTG CGCTGTATGA CGCCCGCGAC
CAGATGGTGG AAGAACTCAG CGGGATGATG GATGTGCAGG TCAATATCGA CGATCAGGGC
AACTACAACG TCACCCTGAA AAACGGTCAG CCACTGGTGA GCGGGCAGCA AAGTTCGACC
ATCGCGCTGG AAACCAACGC CGACGGCACG CCGACCATGT CGCTGACCTT CGCTGGCACC
ACCTCGACGA TGACCACCGA CACTGGAGGT TCATTAGGCG CACTGTTTGA TTATCAAAAC
GATGTGCTGA CGCCGTTGAC CGACACCATC AACAGCATGG CGTCGCAGTT TGCCGATGCG
GTCAACAACC AGCTTGCGCA GGGCTACGAT CTCAACGGTA ACCCCGGCGA GCCGCTGTTT
ATCTATGACG CCAGCAATGC CGATGGCCCG CTGACCGTCA ACCCGGATAT CACCGCCGAT
GAGCTGGCGT TCTCCAGTTC GCCGGACGAA AGCGGCAACA GCGACAACCT GCAGGCGCTG
ATCAACATCT CCACCGAACC GCTGGAGATC GCCAACCTCG GCAGCGTGAC GGTCGGGCAG
GCGTGCTCGT CAATCATCAG CAATATCGGC ATTTACAGTC AGCAAAACCA GACGGAAGTC
GATGCCGCGT CCAGTGTCTA TTCCGCAGCG CAAAACCAGC AGAGCAGCGT CAGCGGTGTC
AGCATGGACG AAGAGGCGGT GAACCTCATC ACCTATCAAC AAATTTATGA AGCCAATCTG
AAAGTCATTT CCGCCGGGGC CGAGATTTTC GATTCGGTGC TGGAAATGTG CAGCTAA
 
Protein sequence
MDMINIGYSG ASTAQVELNV TAQNTANAMT TGYTRQVAEI STIGASGGSP NSAGNGVQVD 
SIRRVSNQYQ VNQVWYAASD YGYYSTQQGY LSQLEAVLSD DNSSLSGGFD NFFAALNEAT
TSPDDSALRE QVISEAGALS LRIDNTLDYI DSQSTEIISQ QQAMVSQINT LTSGIASYNQ
QIAQAEANGD NASALYDARD QMVEELSGMM DVQVNIDDQG NYNVTLKNGQ PLVSGQQSST
IALETNADGT PTMSLTFAGT TSTMTTDTGG SLGALFDYQN DVLTPLTDTI NSMASQFADA
VNNQLAQGYD LNGNPGEPLF IYDASNADGP LTVNPDITAD ELAFSSSPDE SGNSDNLQAL
INISTEPLEI ANLGSVTVGQ ACSSIISNIG IYSQQNQTEV DAASSVYSAA QNQQSSVSGV
SMDEEAVNLI TYQQIYEANL KVISAGAEIF DSVLEMCS