Gene EcolC_3368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3368 
Symbol 
ID6064903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3689283 
End bp3690485 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content58% 
IMG OID641602782 
Productflagellar basal body FlaE domain-containing protein 
Protein accessionYP_001726314 
Protein GI170021360 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATG AAATTGCCGC GACGGGGCTG AATGCCGTTA ACGAACAGCT GGACGGGATC 
AGTAACAACA TCGCCAACGC CGGAACGGTG GGCTATAAGT CGATGACCAC CCAGTTTTCC
GCCATGTATG CCGGAAGCCA GGCGATGGGT GTCAGCGTGG CGGGCACCGC GCAGAGCATT
TCGCGCGGCG GTTCGCTGGT CTCCACCGGC AACGCGCTGG ATCTGGCGAT TAACGATGAT
GGCTTTTTTG TTACCTGCGA CAGTGCGGGC AACATTTCTT ATACCCGCGC CGGTTCGTTT
GAAACCGACA AAAACGGCTA TATCGTCAAC GCCTCGGGCG CTTATTTGCA GGGTTATCCG
GTGGATGACA GCGGCACTCT GCAAACCGGT ACGGTCACCG ATATCCAGAT CAAAACCGGC
AATATCCCGG CGCAGGCAAG CAGCAGCCTG ACTTTTACCG CCAACTTCGA TGCCAGCGAT
GCGGCTATCG ATCGCACCAC CGTACCGTTC GACGCCACCA ACAGCAGCTC CTATACCGAC
AGCTACACCA CCACGGTATA TGACTCATTG GGTAACGAAC ACTCGGTATG CCAGTATTTC
ACCAAAACCA GCGACAACAC CTGGGAAGTG CAGTACACCT TCGACGGTCA GCAGCAGACC
GGCGTTCCTG CGACCACCTT AACCTTCGAC CCGAACACCG GGAAGCTGAC CTCGCCAACC
ACGCCGCAGA CCATTGAGTT TCAGACCGAC GCCGCCGCGC CCATCGACTT AACCGTCGAT
TACTCCACCT GTACGCAATA CGGCTCTGAA TTTTCTGTCA CCACCAACGC CGCCAACGGT
TACGCTTCCG CCACGCAAAA CGGTGTGCAG GTTGATGACG ATGGCAAAGT TTACGCCACC
TACAGCAACG GCGAGCGCAT GTTGCAGGGC CAGGTGGTGC TGGCGACTTT CCCGAATGAA
AACGGCCTGG AGGCAGTGAG CGGCACCGCA TGGGTACAAA CCGGGGAATC CGGCACCCCG
CTGATTGGCG TTCCCGGCTC CGGCACCTGC GGTACGCTGT CGTCCGGCGT GCTCGAAAGC
TCTAACGTCG ATATCACCAG CGAACTGGTC AACCTGATGA CCGCCCAGCG TAACTATCAG
GCCAACACCA AAGTTATCGC TACCAGCACA CAGCTCGATG ACGCGCTGTT CCAGGCAATG
TAA
 
Protein sequence
MSYEIAATGL NAVNEQLDGI SNNIANAGTV GYKSMTTQFS AMYAGSQAMG VSVAGTAQSI 
SRGGSLVSTG NALDLAINDD GFFVTCDSAG NISYTRAGSF ETDKNGYIVN ASGAYLQGYP
VDDSGTLQTG TVTDIQIKTG NIPAQASSSL TFTANFDASD AAIDRTTVPF DATNSSSYTD
SYTTTVYDSL GNEHSVCQYF TKTSDNTWEV QYTFDGQQQT GVPATTLTFD PNTGKLTSPT
TPQTIEFQTD AAAPIDLTVD YSTCTQYGSE FSVTTNAANG YASATQNGVQ VDDDGKVYAT
YSNGERMLQG QVVLATFPNE NGLEAVSGTA WVQTGESGTP LIGVPGSGTC GTLSSGVLES
SNVDITSELV NLMTAQRNYQ ANTKVIATST QLDDALFQAM