Gene EcolC_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3391 
Symbol 
ID6067614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3708909 
End bp3711002 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content58% 
IMG OID641602805 
Producttype III secretion FHIPEP protein 
Protein accessionYP_001726337 
Protein GI170021383 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1298] Flagellar biosynthesis pathway, component FlhA 
TIGRFAM ID[TIGR01398] flagellar biosynthesis protein FlhA 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA CAACTAAATC GTTCCTCGCG CTGCTGCGCG GCGGCAATCT CGGCGTGCCG 
CTGGTGATAC TTTGTATTCT GGCGATGGTT ATTTTGCCGC TGCCGCCTGC GTTGCTGGAT
ATTCTGTTCA CCTTCAACAT TGTGCTGGCG GTGATGGTGC TGCTGGTGGC GGTGTCGGCG
AAAAGGCCGC TGGAATTCAG CCTGTTCCCG ACCATTTTGC TGATCACCAC CTTAATGCGC
CTGACGCTTA ACGTTGCTTC TACGCGCGTG GTGCTGCTGC ACGGGCATCT CGGCGCGGGC
GCGGCGGGTA AGGTGATTGA GTCGTTTGGT CAGGTGGTGA TCGGCGGCAA CTTTGTCGTC
GGCTTCGTGG TGTTTATCAT CCTGATGATC ATCAACTTTA TTGTCGTCAC CAAAGGGGCC
GAGCGTATTT CCGAGGTTTC TGCCCGCTTT ACCTTAGACG CGATGCCCGG CAAACAGATG
GCGATTGACG CCGATCTTAA CGCCGGATTG ATCAACCAGG CGCAGGCGCA AACCCGGCGT
AAAGATGTTG CCAGCGAGGC CGATTTCTAC GGCGCGATGG ACGGGGCATC GAAGTTTGTG
CGCGGGGACG CCATCGCCGG GATGATGATT CTGGCGATCA ACCTGATCGG CGGCGTCTGT
ATCGGGATCT TCAAATACAA CCTGAGCGCC GATGCTGCCT TCCAGCAGTA TGTGCTGATG
ACCATCGGCG ACGGCCTGGT GGCGCAGATC CCTTCCCTGC TGCTCTCCAC GGCGGCGGCG
ATTATCGTCA CCCGCGTCAG CGACAACGGC GATATCGCCC ATGACGTGCG CCACCAGCTG
CTGGCAAGCC CGTCGGTGCT CTACACCGCT ACCGGGATTA TGTTTGTGCT GGCGATGGTG
CCGGGAATGC CGCATCTGCC GTTTTTGCTG TTCAGCGCCC TGCTTGGCTT TACCGGCTGG
CGGATGAGCA AACGCCCGCA GGCGGCGGAA GCGGAAGAGA AAAGTCTCGA AACGCTGACC
CGCACCATCA CCGAAACCAG CGAGCAACAG GTCAGTTGGG AAACCATTCC GCTGATCGAG
CCCATCAGCT TAAGCCTCGG CTACAAGCTG GTGGCGCTGG TGGACAAAGC CCAGGGCAAC
CCGCTCACCC AGCGTATTCG CGGCGTGCGA CAGGTGATTT CTGACGGCAA CGGCGTGCTG
CTGCCGGAGA TCCGCATTCG GGAAAACTTC CGCCTTAAGC CCAGCCAGTA CGCTATTTTC
ATCAACGGCA TTAAGGCTGA TGAAGCGGAT ATTCCGGCGG ATAAACTGAT GGCCCTGCCC
TCCAGCGAAA CCTACGGCGA GATTGACGGC GTGCTGGGGA ACGACCCGGC GTATGGGATG
CCGGTCACCT GGATCCAGCC TGCGCAGAAG GCGAAGGCGC TGAATATGGG GTATCAGGTG
ATCGACAGCG CCAGCGTGAT TGCTACCCAT GTGAACAAGA TTGTGCGCAG CTATATTCCT
GATTTGTTTA ACTATGATGA TATTACGCAG TTGCATAACC GTTTGGCGTC GATGGCACCG
CGCCTGGCGG AAGATTTAAG CGCGGCGCTC AATTACAGCC AGTTGCTGAA AGTGTACCGG
GCGCTGCTGA CCGAAGGCGT TTCCCTGCGC GATATCGTCA CCATCGCCAC CGTGCTGGTC
GCCAGTAGCG CGGTGACCAA AGATCATATT CTGCTGGCAG CTGATGTGCG CCTGGCGCTG
CGGCGCAGCA TTACCCATCC GTTCGTTCGC AAGCAGGAGC TGACGGTGTA TACGCTGAAT
AATGAACTGG AAAATCTGCT GACCAATGTG GTGAATCAGG CGCAACAGGG CGGGAAAGTG
ATGCTTGACA GCGTGCCAGT GGACCCGAAT ATGCTCAACC AGTTCCAGAG CACGATGCCA
CAGGTGAAAG AGCAGATGAA AGCGGCGGGG AAAGACCCGG TGCTGCTGGT GCCACCGCAG
CTGCGCCCTT TGCTGGCGCG TTATGCAAGG TTGTTTGCGC CGGGGCTGCA TGTGCTGTCG
TATAACGAAG TGCCGGATGA GCTGGAGTTG AAGATTATGG GGGCGTTGAT GTAG
 
Protein sequence
MAKTTKSFLA LLRGGNLGVP LVILCILAMV ILPLPPALLD ILFTFNIVLA VMVLLVAVSA 
KRPLEFSLFP TILLITTLMR LTLNVASTRV VLLHGHLGAG AAGKVIESFG QVVIGGNFVV
GFVVFIILMI INFIVVTKGA ERISEVSARF TLDAMPGKQM AIDADLNAGL INQAQAQTRR
KDVASEADFY GAMDGASKFV RGDAIAGMMI LAINLIGGVC IGIFKYNLSA DAAFQQYVLM
TIGDGLVAQI PSLLLSTAAA IIVTRVSDNG DIAHDVRHQL LASPSVLYTA TGIMFVLAMV
PGMPHLPFLL FSALLGFTGW RMSKRPQAAE AEEKSLETLT RTITETSEQQ VSWETIPLIE
PISLSLGYKL VALVDKAQGN PLTQRIRGVR QVISDGNGVL LPEIRIRENF RLKPSQYAIF
INGIKADEAD IPADKLMALP SSETYGEIDG VLGNDPAYGM PVTWIQPAQK AKALNMGYQV
IDSASVIATH VNKIVRSYIP DLFNYDDITQ LHNRLASMAP RLAEDLSAAL NYSQLLKVYR
ALLTEGVSLR DIVTIATVLV ASSAVTKDHI LLAADVRLAL RRSITHPFVR KQELTVYTLN
NELENLLTNV VNQAQQGGKV MLDSVPVDPN MLNQFQSTMP QVKEQMKAAG KDPVLLVPPQ
LRPLLARYAR LFAPGLHVLS YNEVPDELEL KIMGALM