Gene EcDH1_2426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2426 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2603300 
End bp2604694 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content51% 
IMG OID 
Productputative invasin 
Protein accessionACX40066 
Protein GI260449644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0439788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCCGTT TCGTTCCTCG CATTATCCCG TTTTATTTAC TCTTGCTTGT AGCAGGCGGT 
ACAGCTAACG CACAATCTAC CTTCGAGCAA AAAGCGGCAA ATCCCTTTGA TAATAACAAT
GATGGTCTGC CGGATTTAGG CATGGCTCCC GAAAATCATG ATGGGGAAAA ACACTTTGCT
GAAATTGTGA AAGATTTCGG CGAAACCAGT ATGAATGATA ACGGGCTGGA TACTGGCGAG
CAGGCAAAAG CTTTCGCATT GGGAAAAGTC CGCGACGCGC TTAGTCAACA GGTTAATCAG
CACGTAGAGT CCTGGCTATC ACCGTGGGGA AATGCCAGTG TTGACGTCAA AGTGGATAAC
GAAGGACATT TCACCGGCAG TCGTGGAAGC TGGTTTGTGC CGTTACAAGA TAATGATCGT
TATCTCACCT GGAGCCAGCT TGGTCTTACT CAGCAGGATA ATGGGTTGGT GAGCAATGTG
GGCGTTGGGC AACGCTGGGC GCGCGGCAAC TGGCTGGTGG GTTATAACAC TTTTTATGAC
AACTTGCTGG ACGAAAATCT TCAGCGAGCG GGCTTTGGTG CCGAAGCGTG GGGCGAATAT
TTGCGATTAT CGGCAAACTT TTATCAGCCG TTTGCTGCAT GGCATGAACA GACAGCCACG
CAGGAACAAC GGATGGCGCG CGGGTACGAC CTGACAGCTC GGATGCGCAT GCCGTTCTAT
CAACACCTCA ATACCAGTGT CAGCCTAGAA CAGTATTTTG GTGATCGTGT TGATTTGTTT
AACTCTGGTA CGGGTTATCA CAATCCCGTC GCGTTGAGTC TGGGATTAAA TTACACCCCT
GTGCCATTAG TCACTGTGAC GGCCCAGCAT AAACAGGGTG AAAGTGGCGA AAATCAAAAT
AACCTCGGGC TGAATCTTAA TTACCGCTTT GGTGTACCGC TCAAAAAACA ACTTTCTGCG
GGCGAGGTTG CCGAAAGTCA GTCGTTACGT GGTAGTCGCT ATGACAATCC GCAGCGAAAT
AATCTACCGA CTCTTGAGTA CCGACAGCGA AAAACGTTAA CGGTGTTTCT GGCGACACCG
CCGTGGGATC TAAAACCTGG CGAAACAGTG CCGCTGAAAT TACAAATCCG CAGTCGTTAC
GGTATTCGGC AACTGATTTG GCAGGGCGAT ACGCAGATAT TAAGTTTGAC GCCGGGCGCA
CAAGCCAACA GCGCGGAGGG CTGGACGCTG ATCATGCCTG ACTGGCAGAA CGGGGAAGGG
GCAAGCAATC ACTGGCGATT GTCTGTGGTG GTGGAAGATA ACCAGGGGCA GCGTGTCTCC
TCCAATGAGA TCACGCTAAC GCTTGTCGAA CCGTTCGATG CATTGTCAAA CGACGAACTG
CGCTGGGAAC CGTAA
 
Protein sequence
MSRFVPRIIP FYLLLLVAGG TANAQSTFEQ KAANPFDNNN DGLPDLGMAP ENHDGEKHFA 
EIVKDFGETS MNDNGLDTGE QAKAFALGKV RDALSQQVNQ HVESWLSPWG NASVDVKVDN
EGHFTGSRGS WFVPLQDNDR YLTWSQLGLT QQDNGLVSNV GVGQRWARGN WLVGYNTFYD
NLLDENLQRA GFGAEAWGEY LRLSANFYQP FAAWHEQTAT QEQRMARGYD LTARMRMPFY
QHLNTSVSLE QYFGDRVDLF NSGTGYHNPV ALSLGLNYTP VPLVTVTAQH KQGESGENQN
NLGLNLNYRF GVPLKKQLSA GEVAESQSLR GSRYDNPQRN NLPTLEYRQR KTLTVFLATP
PWDLKPGETV PLKLQIRSRY GIRQLIWQGD TQILSLTPGA QANSAEGWTL IMPDWQNGEG
ASNHWRLSVV VEDNQGQRVS SNEITLTLVE PFDALSNDEL RWEP