Gene EcDH1_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1808 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1957136 
End bp1959769 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content54% 
IMG OID 
ProductMammalian cell entry related domain protein 
Protein accessionACX39467 
Protein GI260449045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0324574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGG AAACGCCCGC TTCGACGACT GAAGCGCAGA TTAAAAATAA ACGCCGTATC 
TCACCTTTCT GGCTGCTGCC GTTCATCGCG CTAATGATTG CCAGTTGGCT GATTTGGGAC
AGTTATCAGG ACCGGGGTAA TACCGTCACC ATCGACTTTA TGTCGGCGGA TGGTATTGTT
CCAGGCCGTA CGCCTGTTCG TTATCAGGGC GTTGAAGTCG GAACAGTGCA GGATATCAGC
CTCAGCGACG ATCTTCGTAA GATTGAAGTC AAGGTCAGCA TCAAGTCCGA TATGAAAGAT
GCGCTGCGCG AAGAGACTCA GTTCTGGCTG GTGACGCCAA AAGCATCGTT GGCAGGTGTC
TCCGGGCTGG ACGCCCTCGT CGGTGGGAAC TATATCGGCA TGATGCCGGG TAAAGGTAAA
GAGCAGGATC ACTTTGTCGC ACTCGATACC CAACCGAAAT ATCGGCTGGA CAATGGCGAT
CTGATGATCC ACCTGCAAGC CCCCGATCTC GGTTCGCTGA ACAGCGGTTC ATTGGTCTAT
TTCCGCAAGA TCCCGGTGGG AAAAGTCTAC GACTATGCCA TCAATCCCAA CAAGCAAGGC
GTGGTGATTG ATGTCCTGAT CGAGCGGCGT TTTACCGATC TGGTGAAAAA AGGTAGCCGT
TTCTGGAACG TTTCCGGCGT TGATGCCAAC GTCAGTATCA GTGGCGCGAA GGTGAAACTG
GAAAGCCTGG CGGCACTGGT TAACGGTGCG ATTGCCTTCG ATTCACCAGA AGAGTCGAAA
CCTGCCGAGG CGGAAGATAC CTTTGGTCTG TATGAAGATC TGGCCCACAG CCAGCGTGGC
GTAATAATAA AACTGGAACT GCCGAGTGGG GCCGGATTAA CCGCCGACTC GACGCCGTTA
ATGTATCAGG GGCTGGAAGT CGGACAGCTG ACTAAACTGG ATTTAAATCC TGGTGGTAAA
GTCACAGGGG AAATGACCGT TGATCCCAGC GTCGTTACCC TGCTTCGTGA AAATACCCGC
ATCGAATTAC GCAACCCGAA ATTATCCCTT AGCGATGCTA ATCTCAGCGC CCTGCTGACC
GGCAAAACCT TCGAGCTGGT GCCCGGCGAT GGCGAGCCAC GCAAAGAGTT CGTTGTTGTG
CCAGGCGAAA AAGCACTGCT GCATGAACCT GATGTTCTGA CGCTGACCCT GACCGCGCCG
GAAAGTTACG GTATTGATGC GGGTCAGCCG CTCATTCTTC ACGGCGTGCA GGTAGGCCAG
GTTATCGATC GTAAACTCAC CAGCAAAGGC GTCACCTTTA CCGTCGCCAT CGAGCCTCAG
CATCGCGAAC TGGTAAAAGG CGATAGCAAA TTTGTCGTCA ACAGCCGTGT CGATGTGAAG
GTGGGGCTGG ATGGCGTTGA GTTTCTCGGT GCCAGCGCCT CAGAATGGAT CAATGGCGGG
ATACGTATTC TGCCGGGCGA TAAAGGTGAG ATGAAAGCCA GCTATCCACT GTATGCCAAT
TTGGAAAAAG CGCTGGAGAA CAGCCTTAGC GATTTACCCA CCACAACCGT GAGTTTGAGT
GCAGAGACGC TGCCGGATGT GCAGGCAGGA TCGGTAGTGC TGTACCGTAA ATTTGAAGTT
GGTGAAGTGA TTACCGTGCG TCCGCGAGCT AACGCGTTTG ATATCGATCT GCATATTAAG
CCGGAGTATC GCAACCTTCT GACCAGCAAT AGCGTGTTCT GGGCAGAAGG CGGGGCGAAA
GTTCAGCTGA ATGGTAGTGG TCTGACCGTA CAGGCATCCC CGCTCTCCAG AGCATTAAAG
GGAGCCATTA GCTTCGACAA CCTCAGCGGT GCCAGCGCCA GTCAGCGTAA AGGCGATAAG
CGTATTCTGT ATGCTTCCGA AACAGCGGCC CGTGCGGTTG GCGGGCAGAT TACGCTTCAC
GCTTTCGATG CCGGAAAACT GGCGGTCGGG ATGCCAATTC GCTATCTCGG TATTGATATC
GGGCAAATCC AGACGCTGGA TCTGATTACC GCACGCAATG AAGTACAGGC AAAGGCGGTA
CTTTACCCGG AATATGTCCA GACCTTTGCT CGCGGTGGTA CGCGCTTCTC AGTGGTCACA
CCGCAAATTT CGGCAGCTGG CGTTGAGCAT CTTGATACTA TCCTCCAGCC GTATATCAAC
GTCGAACCAG GCCGGGGCAA TCCTCGCCGC GACTTTGAAT TACAAGAGGC CACCATTACT
GATTCGCGTT ACCTGGATGG CTTAAGCATT ATTGTTGAAG CGCCGGAAGC CGGTTCGTTA
GGCATCGGTA CGCCTGTGCT GTTCCGTGGT CTGGAAGTCG GTACGGTTAC AGGAATGACG
CTGGGGACAT TGTCAGATCG CGTGATGATT GCGATGCGCA TCAGTAAACG CTATCAACAC
CTGGTGCGTA ACAATTCCGT CTTCTGGTTG GCATCGGGTT ACAGTCTGGA CTTTGGTCTG
ACGGGCGGCG TAGTGAAAAC CGGCACCTTT AACCAGTTTA TCCGTGGCGG CATCGCCTTC
GCCACGCCTC CGGGTACGCC ACTGGCACCG AAAGCCCAGG AAGGCAAACA CTTCCTGTTG
CAGGAAAGTG AACCGAAAGA GTGGCGTGAA TGGGGAACTG CGCTTCCCAA ATAA
 
Protein sequence
MSQETPASTT EAQIKNKRRI SPFWLLPFIA LMIASWLIWD SYQDRGNTVT IDFMSADGIV 
PGRTPVRYQG VEVGTVQDIS LSDDLRKIEV KVSIKSDMKD ALREETQFWL VTPKASLAGV
SGLDALVGGN YIGMMPGKGK EQDHFVALDT QPKYRLDNGD LMIHLQAPDL GSLNSGSLVY
FRKIPVGKVY DYAINPNKQG VVIDVLIERR FTDLVKKGSR FWNVSGVDAN VSISGAKVKL
ESLAALVNGA IAFDSPEESK PAEAEDTFGL YEDLAHSQRG VIIKLELPSG AGLTADSTPL
MYQGLEVGQL TKLDLNPGGK VTGEMTVDPS VVTLLRENTR IELRNPKLSL SDANLSALLT
GKTFELVPGD GEPRKEFVVV PGEKALLHEP DVLTLTLTAP ESYGIDAGQP LILHGVQVGQ
VIDRKLTSKG VTFTVAIEPQ HRELVKGDSK FVVNSRVDVK VGLDGVEFLG ASASEWINGG
IRILPGDKGE MKASYPLYAN LEKALENSLS DLPTTTVSLS AETLPDVQAG SVVLYRKFEV
GEVITVRPRA NAFDIDLHIK PEYRNLLTSN SVFWAEGGAK VQLNGSGLTV QASPLSRALK
GAISFDNLSG ASASQRKGDK RILYASETAA RAVGGQITLH AFDAGKLAVG MPIRYLGIDI
GQIQTLDLIT ARNEVQAKAV LYPEYVQTFA RGGTRFSVVT PQISAAGVEH LDTILQPYIN
VEPGRGNPRR DFELQEATIT DSRYLDGLSI IVEAPEAGSL GIGTPVLFRG LEVGTVTGMT
LGTLSDRVMI AMRISKRYQH LVRNNSVFWL ASGYSLDFGL TGGVVKTGTF NQFIRGGIAF
ATPPGTPLAP KAQEGKHFLL QESEPKEWRE WGTALPK