Gene EcHS_A0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0226 
Symbol 
ID5591432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp241171 
End bp244695 
Gene Length3525 bp 
Protein Length1174 aa 
Translation table11 
GC content57% 
IMG OID640919413 
Producthypothetical protein 
Protein accessionYP_001457000 
Protein GI157159682 
COG category[S] Function unknown 
COG ID[COG3523] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03348] type VI secretion protein IcmF 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCAGAT TACCAACACC CCGACTACTC AGCGGACTCA AATCAGCCCT GCGACCGGCG 
ATGCCCCGGT TTAAAGTCTC TGCTTTCTGG CTGCTGATAC TGGCGTGGAT TTTTCTGCTG
GTGTGGATCT GGTGGAAAGG CCCGACGTGG ACGCTGTATG AAGAACAGTG GCTCAAACCA
CTGGCGAACC GCTGGCTGGC AACGGCGGCG TGGGGGATTA TTGCCCTGAT GTGGCTTACC
GTCCGGGTGA TGAAGCGCCT GCAACAGCTT GAAAAAATGC AAAAGCAACA GCGCGAGGAA
GCCGTTGATC CGCTCAGCGT GGAACTGAAC GCCCAGCAGC GTTATCTTGA CCGCTGGCTG
CTGCGCCTGC AACGCCATCT CGACAACCGC CGTTTCCTGT GGCAGTTGCC GTGGTATATG
GTCATCGGCC CGGCGGGCAG TGGCAAAACG ACACTGCTGC GCGAGGGGTT TCCGTCCGAC
ATTATTTATG CCCCGGAGGG CGCACGCGGC GCAGAACAAC GCCTGTACCT CACGCCCCAT
GTCGGTAAAC AGGCGGTGAT CTTTGATATC GACGGCACAC TCTGCGCTCC CGCTGATGCG
GATATCCTGC ATCGCCGTCT GTGGGAACAT GCCCTCGGCT GGCTGAAAGA AAAGCGCGCG
CGCCAGCCGC TGAATGGGAT TATTCTGACA CTCGATTTAC CCGATCTGCT TACCGCAGAC
AAACGCCGCC GCGAGCATCT GTTACAGGCG CTGCGCAGCC GCTTGCAGGA TATACGCCAG
CATCTTCACT GCCAGTTACC GGTTTACGTG GTACTTACCC GGCTTGATTT ATTGCAGGGT
TTTGCCGCCC TGTTTCAGTC CCTGAACAGA CAGGATCGCG ATGCGATTCT GGGCGTCACG
TTCACCCGCC GTGCCCATGA AAATGATGAC TGGCGAACAG AGTTGAATGC TTTCTGGCAG
ACATGGGTGG ATCGAATGAA TCTGGCGTTG CCGGATCTGA TGGTCGCTCA GACTCACACC
CGCGCGTCTT TATTCAGTTT TTCCCGCCAG ATGCAGGGAA GCCGTGAACC GCTGGTGTCA
CTGCTTGAGG GTCTGCTTGA TGGCGAAAAT ATGAACGTGA TGCTGCGTGG TGTCTATCTC
ACCTCTTCGC TTCAGCGTGG ACAGATGGAT GATATATTCA CCCAGTCTGC CGCCCGCCAG
TACCGGCTGG GCAATAACCC ACTGGCGTCC TGGCCCCTGG TGGACACCGC GCCTTATTTC
ACCCGCAGCC TGTTCCCGCA GGCATTACTC GCAGAGCCTA ATCTGGCAAC AGAGAGCCGC
GCCTGGCTGA TACGTTCCCG TCGCCGCCTG ACGGTTTTCT CTGCCACAGG CGGCGTGGCA
GCACTGCTGC TCATCACCGG CTGGCATCAC TATTACAACG GTAACTATCA GTCTGGCATC
ACCGTGCTTA AGCAGGCCAA AGCCTTTATG GACGTGCCGC CTCCGCAGGG GGAAGATGAC
TTTGGCAACC TGCAACTGCC GCTCCTGAAC CCGGTACGCG ATGCCACACT GGCCTATGGC
GACTGGGGCG ACCGCAGCCG TCTGGCCGAT ATGGGACTGT ACCAGGGACG ACGTATCGGG
CCTTATGTGG AACAGACCTA TCTGCAACTG CTGGAGCAAC GTTACCTGCC CTCGCTGTTT
AACGGGCTGG TCAAAGCGCT GAACGCCGCG CCGCCGGAGA GTGAAGAAAA ACTCGCGGTG
CTGCGCGTGA TGCGAATGCT GGAGGACAAA AGCGGACGTA ACAATCAGGT GGTGAAGCAG
TATATGGCAA AACGCTGGAG CGAAAAGTTC CACGGTCAGC GCGATATCCA GGCACAACTG
ATGTCCCATC TTGACTACGC GCTGGCTCAT ACTGACTGGC ACGCAGAGCG TCAGGCGGGC
GACGGTGACG CCATCAGCCG CTGGACGCCA TATGACAAGC CCGTGGTATC AGCACAGAAA
GAACTGAGCA AACTGCCTGT CTACCAGCGG GTTTACCAGA GCCTGAAAAC GCGGGCGCTG
GGCGTTCTTC CTGCCGACCT CAATCTGCGT GACCAGGTAG GGCCAACCTT TGACCAGGTG
TTTACATCTG CCGATGACAA CAAACTGGTT GTTCCACAGT TTCTTACCCG TTACGGCCTG
CAAAGCTATT TTGTAAAACA GCGCGATGAA CTGGTTGAAC TGACGGCGAT GGATTCCTGG
GTACTTAACC TTACCCGCAG CGTGAAATAC AGTGACGCCG ACCGCGCGGA AATCCAGCGC
CAGTTGACCG AGCAGTATAT CAGCGACTAC ACCGCCACCT GGCGGGCCGG GATGGACAAT
CTGAATATCC GCAATTTTGA GTCCATCGGA CAACTGACCG GGGCGCTGGA GCAGGTTATC
AGCGGCGACC AGCCTTTGCA GCGGGCGCTG ACCGTGCTGC GTGACAACAC ACAGCCAGGC
GTCTTTTCTG AAAAACTCTC TGCCAAAGAA CGGGAGGAAG CCCTGGCAGA GCCGGATTAC
CAGTTACTCA CCCGCCTCGG GCATGAATTC GCCCCGGAAA ACAGTACCCT GGCAGTACAG
AAAGACAAAG AAAGCACGAT GCAGGCCGTG TATCAGCAAC TCACCGAGTT GCACCGCTAC
CTGCTGGCAA TCCAGAACGC GCCTGTACCA GGGAAATCGG CGCTGAAAGC CGTGCAGTTA
CGGCTTGATC AGAACAGCAG CGATCCGATA TTCGCCACCC GCCAGATGGC AAAAACGCTG
CCTGCTCCGC TCAACCGCTG GGTTGGCAGA CTGACTGACC AGGCCTGGCA TGTGGTGATG
GTGGAGGCTG TTCATTATAT GGAAGTGGAC TGGCGCGACA GCGTGGTGAA ACCGTTTAAC
GAGCAACTGG CAAATAACTA TCCGTTTAAT CCGCGTTCTG CACAGGATGC CTCACTGGAT
GCCTTCGAAC GCTTCTTTAA ACCGGATGGC ATACTGGATA CCTTCTACCA GCAGAACCTG
AAGCTGTTTA TCGATAATGA CCTGAGTCTG GAGGATGGCG ATAACAACGT CATTATTCGC
GAAGATATTA TTGCGCAACT GGAAACTGCG CAGAAAATCC GTGACATCTT CTTCAGCAAA
CAGAACGGTC TGGGAACATC CTTTGCCGTG GAAACGGTAT CGCTTTCAGG CAATAAACGC
CGCAGTGTAC TGAACCTTGA CGGTCAGTTA GTCGATTACA GCCAGGGCCG TAACTATACC
GCCCATCTGG TCTGGCCTAA CAACATGCGC GAAGGCAACG AAAGTAAGCT GACGCTCATC
GGCACCAGCG GCAACGCGCC GCGCAGTATC AGCTTCAGCG GGCCGTGGGC GCAGTTCCGC
CTGTTCGGGG CCGGACAACT GACCGGAGTA CAGGATGGCA ACTTTACCGT GCGCTTTAGC
GTGGACGGTG GCGCGATGAC CTACCGTGTG CATACCGACA CGGAAGATAA CCCGTTCAGC
GGTGGGTTGT TCAGCCAGTT TGGTCTGTCA GACACACTGT ACTGA
 
Protein sequence
MFRLPTPRLL SGLKSALRPA MPRFKVSAFW LLILAWIFLL VWIWWKGPTW TLYEEQWLKP 
LANRWLATAA WGIIALMWLT VRVMKRLQQL EKMQKQQREE AVDPLSVELN AQQRYLDRWL
LRLQRHLDNR RFLWQLPWYM VIGPAGSGKT TLLREGFPSD IIYAPEGARG AEQRLYLTPH
VGKQAVIFDI DGTLCAPADA DILHRRLWEH ALGWLKEKRA RQPLNGIILT LDLPDLLTAD
KRRREHLLQA LRSRLQDIRQ HLHCQLPVYV VLTRLDLLQG FAALFQSLNR QDRDAILGVT
FTRRAHENDD WRTELNAFWQ TWVDRMNLAL PDLMVAQTHT RASLFSFSRQ MQGSREPLVS
LLEGLLDGEN MNVMLRGVYL TSSLQRGQMD DIFTQSAARQ YRLGNNPLAS WPLVDTAPYF
TRSLFPQALL AEPNLATESR AWLIRSRRRL TVFSATGGVA ALLLITGWHH YYNGNYQSGI
TVLKQAKAFM DVPPPQGEDD FGNLQLPLLN PVRDATLAYG DWGDRSRLAD MGLYQGRRIG
PYVEQTYLQL LEQRYLPSLF NGLVKALNAA PPESEEKLAV LRVMRMLEDK SGRNNQVVKQ
YMAKRWSEKF HGQRDIQAQL MSHLDYALAH TDWHAERQAG DGDAISRWTP YDKPVVSAQK
ELSKLPVYQR VYQSLKTRAL GVLPADLNLR DQVGPTFDQV FTSADDNKLV VPQFLTRYGL
QSYFVKQRDE LVELTAMDSW VLNLTRSVKY SDADRAEIQR QLTEQYISDY TATWRAGMDN
LNIRNFESIG QLTGALEQVI SGDQPLQRAL TVLRDNTQPG VFSEKLSAKE REEALAEPDY
QLLTRLGHEF APENSTLAVQ KDKESTMQAV YQQLTELHRY LLAIQNAPVP GKSALKAVQL
RLDQNSSDPI FATRQMAKTL PAPLNRWVGR LTDQAWHVVM VEAVHYMEVD WRDSVVKPFN
EQLANNYPFN PRSAQDASLD AFERFFKPDG ILDTFYQQNL KLFIDNDLSL EDGDNNVIIR
EDIIAQLETA QKIRDIFFSK QNGLGTSFAV ETVSLSGNKR RSVLNLDGQL VDYSQGRNYT
AHLVWPNNMR EGNESKLTLI GTSGNAPRSI SFSGPWAQFR LFGAGQLTGV QDGNFTVRFS
VDGGAMTYRV HTDTEDNPFS GGLFSQFGLS DTLY