Gene EcolC_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3357 
Symbol 
ID6067442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3678027 
End bp3679343 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content56% 
IMG OID641602771 
Productflagellar hook-associated 2 domain-containing protein 
Protein accessionYP_001726303 
Protein GI170021349 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.507353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACC CAAGAACCAT TGCGCAAGAA ATCGCTTATG CGGATGTTGC CACTCAGGCA 
GCCAATTTGC AGGAGAAGCA GAGCGAGCTG GATGCTGAAA GCAGCGGCCT GGACTCGCTC
AGCTCAGCGT TGAGCGATTT CCAGAGCGCC GTTGACGCGC TGAACAGCGA TACCGACGGC
CCGGTGACTT TTGCCGCCAC CAGCAATAAT GACTCGGCGA CCGTTTCCGC CAATTCCCAG
GCGCAGGCGG GAAGCTACTC ATTTTTCGTT GAGCAACTGG CGCAGGGGCA GCAGACCACA
TTCAGTATGG GCGACGACGC CTTTTCCGCC ACCGGCACCT TCGAACTGAC GATGGGCGAC
AGCACCATGG ATATCGATCT GTCGGCGGCA GATCAGAACG GTGACGGCGA TGGTTTTATC
GACGCCAGCG AGCTGGTGAA TGCGATTAAC GACTCCGATG ACAATCCAGG CGTGTCGGCG
GCACTGGTAA AAACCGACGG CACCACCACG ATTATGCTCA CCTCCGATAG CACCGGGGCG
CAGAGCGCGT TTTCGGTGAG CGTAACGGGG CATGATGCCA GCAACGATAG CACCAGCGCG
CCCGTTGCCA CGGATGTTTC CTCCGCTCAG GACGCGATTA TTCATCTCGG TAGCGCCACA
GGGCCTGCGA TCACCAACAG CAGCAATACA TTTGATGATG TGATCCCCGG CGTCACCATG
ACTTTTACCG AAGTCAGCGA TTCTGACAGC GATCTCACCA CCTTTAACAT CAGCGAAGAT
TCCAGCGCCA GCCAGGAGAA AGTACAGACC TTTGTCGATG CCTATAACAC CTTGATTGAT
ACGGTTGATT CGCTGACCAC TCACGGTGAT GACAGTACCA GCGCCGGGGT ATTTGCAGGC
GACGCCGGGC TCAGTTCACT GGCGAACCAG CTCGATGACA TCGCCCATGC CAGCTACAAC
GGCGTGTCGA TTGTTGACTA TGGCATCACC CTCGATTCTC ACGGCCACTT ACAGATTGAC
TCCGACCAGT TCAACGACGA AATGGCGAAA AATCCTGACG GTCTGACCTC TATTTTCGTT
GGCGACAACA GCATGGTGGC GCAGATGGAC GACCTGATTA ACACCTACAC GGACTCCAGC
AACGGCATCA TCACCCTGCG CCAGCAGAAC ATTGACGATC AGATGAGCAA AATTCAGGAC
GAAGGCGATC AGCTTACCGA TACCTATAAC GCCAACTACG ACCGTTATCT GGAGGAGTAC
ACCAACACGC TGGTTGAGGT GTACACCATG AAAGCCAGCA TGGCGGCATT CGCGTAA
 
Protein sequence
MINPRTIAQE IAYADVATQA ANLQEKQSEL DAESSGLDSL SSALSDFQSA VDALNSDTDG 
PVTFAATSNN DSATVSANSQ AQAGSYSFFV EQLAQGQQTT FSMGDDAFSA TGTFELTMGD
STMDIDLSAA DQNGDGDGFI DASELVNAIN DSDDNPGVSA ALVKTDGTTT IMLTSDSTGA
QSAFSVSVTG HDASNDSTSA PVATDVSSAQ DAIIHLGSAT GPAITNSSNT FDDVIPGVTM
TFTEVSDSDS DLTTFNISED SSASQEKVQT FVDAYNTLID TVDSLTTHGD DSTSAGVFAG
DAGLSSLANQ LDDIAHASYN GVSIVDYGIT LDSHGHLQID SDQFNDEMAK NPDGLTSIFV
GDNSMVAQMD DLINTYTDSS NGIITLRQQN IDDQMSKIQD EGDQLTDTYN ANYDRYLEEY
TNTLVEVYTM KASMAAFA