Gene EcolC_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3354 
Symbol 
ID6067432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3676238 
End bp3677299 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content58% 
IMG OID641602768 
Productflagellar hook-length control protein 
Protein accessionYP_001726300 
Protein GI170021346 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID[TIGR02514] type III secretion system needle length determinant 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.26505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGG CATTGCTTGC CACGCTCGGA ACGCTTGCGG AAACCGCTTC GTTGAAGGCG 
GATATTTTAC CGCCGGTGAG CGGTGAAAAC GCGCCCGCGT TTACCCTGCC GAAAATGGCG
GTCGCGGCGG TGGCGGAGCG CGTTCATAGC GCTAAAACCA GTCAGCAACA GGCGACGCGC
CCGCAGGAGA ACGATCCGGT GGCGATGCAG GCGCTAATGG CGCTGTTACT TCCACAACCT
GCCGCGCCGC ATCAGGACAC GCCGCAGCCG CGAAACGTCG CAACATCGCC GGTTATCCAG
CAATTGACGA AAGCGGTGGT GCAAAACGCG CCGCAACGCC CGACGCAACA GCAGGAACTC
ACGCCGTTGC CGCCGCAGTT GCAGGAACTG ATCAGTCAGT TGCCGCAGGA GAAACCCGAA
CAGCAGGCCA GACTGGCGAC TTACGCCAGT GAAGATTTAC ATGCCATTGC GTCGACGCAG
CCGCGCGTCT CAACACAGCC AGCTCGCCCG AAACCTGAAC TAACCCGTGT GACCGCGCGC
CCGCAGGTCG AGCGTAAAAC GGAAAAAGTG CCGGACAGCG AACCGGTTAT TGCGCGTGCG
GTGTTGCAGG TTAAGACGCC GGAGCTGGTC AGCGATCATC AGGAGATTGT CGCCAAACCC
GTCACGCTTT CGATGGACGA ACTGGGCGAA AAACTGACGA CGCTGTTGAA AGATCAGATC
CACTTTCAGC TCAACAAACA ACAGCAGATC TCCACCATCC GTCTCGATCC ACCGTCGCTT
GGCAAGCTCG AGATCGCCGT ACAACTCGAC AACGGCAAAC TGATGGTGCA CATCGGCGCG
AACCAAAGTG AAGTTTGCCG CGCGTTACAG CAGTTTAGCG ACGATCTCCG CCAGCATCTG
ACGGCGCAAA ATTTTATGGA GGTGAGCGTA CAGGTTTCCT CCGAAGGGCA GTCGCAGCAA
CAACAACAGT CGGGCCATCA GCAGGAAGAG GTGAGTGCTG CCTTACAGCT TGATGATGCG
CCTCAATTTC AACAGAACGA ATCCGTTTTG ATCAAAGTGT AA
 
Protein sequence
MNPALLATLG TLAETASLKA DILPPVSGEN APAFTLPKMA VAAVAERVHS AKTSQQQATR 
PQENDPVAMQ ALMALLLPQP AAPHQDTPQP RNVATSPVIQ QLTKAVVQNA PQRPTQQQEL
TPLPPQLQEL ISQLPQEKPE QQARLATYAS EDLHAIASTQ PRVSTQPARP KPELTRVTAR
PQVERKTEKV PDSEPVIARA VLQVKTPELV SDHQEIVAKP VTLSMDELGE KLTTLLKDQI
HFQLNKQQQI STIRLDPPSL GKLEIAVQLD NGKLMVHIGA NQSEVCRALQ QFSDDLRQHL
TAQNFMEVSV QVSSEGQSQQ QQQSGHQQEE VSAALQLDDA PQFQQNESVL IKV