Gene ECD_00256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00256 
SymboleaeH 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp284540 
End bp285772 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content49% 
IMG OID 
Productattaching and effacing protein, pathogenesis factor 
Protein accessionACT42155 
Protein GI253976485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACATT ATAAAACAGG TCATAAACAA CCACGATTTC GTTATTCAGT TCTGGCCCGC 
TGCGTGGCGT GGGCAAATAT CTCTGTTCAG GTTCTTTTTC CACTCGCTGT CACCTTTACC
CCAGTAATGG CGGCACGTGC GCAGCATGCG GTTCAGCCAC GGTTGAGCAT GGGAAATACT
ACGGTAACTG CTGATAATAA CGTGGAGAAA AATGTCGCGT CGTTTGCCGC AAATGCCGGG
ACATTTTTAA GCAGTCAGCC AGATAGCGAT GCGACACGTA ACTTTATTAC CGGAATGGCC
ACAGCTAAAG CTAACCAGGA AATACAGGAG TGGCTCGGGA AATATGGTAC TGCGCGCGTC
AAACTGAATG TCGATAAAGA TTTCTCGCTG AAGGATTCTT CGCTGGAAAT GCTTTATCCG
ATTTATGATA CGCCAACAAA TATGTTGTTC ACTCAGGGAG CAATACATCG TACCGACGAT
CGTACTCAGT CAAATATTGG TTTTGGCTGG CGTCATTTTT CAGGAAATGA CTGGATGGCG
GGGGTGAATA CTTTTATCGA TCATGATTTA TCCCGTAGTC ATACCCGCAT TGGTGTTGGT
GCGGAATACT GGCGTGATTA TTTGAAACTG AGCGCCAATG GTTATATCCG GGCTTCTGGC
TGGAAAAAAT CGCCGGATGT TGAGGATTAT CAGGAACGCC CGGCGAATGG TTGGGATATC
CGCGCAGAGG GCTATTTACC TGCCTGGCCG CAGCTTGGCG CAAGCCTGAT GTATGAACAG
TATTATGGCG ATGAAGTCGG GCTGTTTGGT AAAGATAAGC GCCAGAAAGA CCCGCATGCT
ATTTCTGCCG AGGTGACCTA TACGCCAGTG CCTCTTCTGA CACTGAGCGC CGGGCATAAG
CAGGGCAAGA GTGGTGAGAA TGACACTCGC TTTGGCCTGG AAGTTAATTA TCGGATTACC
CTGATGGCGG GAGTCAATCC CGTAGGAGGA AGTATGTGGG TCGACATTGA GGCTCCGGAA
GGAGTGACGG AGAAGGATTA TCAATTCCTG CCGTCGAAGG CTGACCATTT CTCAGGTGGG
AAAATCACGC GTACATTTAG TACCAGCAAG CCAGGTGTCT ATACGTTCAC ATTCAACGCA
CTGACGTATG GCGGGTACGA AATGACGCCT GTGAAGGTGA CAATTAACGC CGTTGCTGCA
GAGACTGAAA ATGGCGAGGA GGAGATGCCA TAA
 
Protein sequence
MSHYKTGHKQ PRFRYSVLAR CVAWANISVQ VLFPLAVTFT PVMAARAQHA VQPRLSMGNT 
TVTADNNVEK NVASFAANAG TFLSSQPDSD ATRNFITGMA TAKANQEIQE WLGKYGTARV
KLNVDKDFSL KDSSLEMLYP IYDTPTNMLF TQGAIHRTDD RTQSNIGFGW RHFSGNDWMA
GVNTFIDHDL SRSHTRIGVG AEYWRDYLKL SANGYIRASG WKKSPDVEDY QERPANGWDI
RAEGYLPAWP QLGASLMYEQ YYGDEVGLFG KDKRQKDPHA ISAEVTYTPV PLLTLSAGHK
QGKSGENDTR FGLEVNYRIT LMAGVNPVGG SMWVDIEAPE GVTEKDYQFL PSKADHFSGG
KITRTFSTSK PGVYTFTFNA LTYGGYEMTP VKVTINAVAA ETENGEEEMP