Gene EcDH1_2692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2692 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2866220 
End bp2867860 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID 
ProductMammalian cell entry related domain protein 
Protein accessionACX40325 
Protein GI260449903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000610454 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTA ATAATGGGGA AGCCAAAATC CAGAAAGTGA AGAACTGGTC TCCCGTGTGG 
ATATTTCCTA TCGTCACGGC GCTCATTGGG GCCTGGGTTC TTTTTTATCA TTACAGCCAT
CAGGGACCGG AAGTGACCCT GATCACCGCG AATGCGGAAG GAATTGAAGG TGGCAAAACC
ACCATTAAAA GCCGTAGCGT TGACGTCGGC GTGGTTGAAA GCGCCACACT GGCTGATGAT
TTGACGCACG TTGAAATCAA AGCGCGGCTG AATTCCGGTA TGGAAAAATT GCTGCATAAA
GACACCGTCT TTTGGGTGGT GAAACCGCAG ATTGGTCGCG AAGGGATTAG CGGCCTGGGA
ACGCTGCTGT CTGGAGTTTA TATCGAACTG CAGCCAGGCG CGAAAGGCAG CAAAATGGAT
AAATACGATT TGCTGGACTC GCCACCGTTG GCCCCGCCTG ATGCGAAAGG TATCCGTGTG
ATTCTCGATA GCAAAAAAGC CGGGCAGCTC TCGCCAGGAG ATCCGGTGCT GTTCCGTGGC
TATCGGGTAG GTTCGGTTGA AACCAGCACC TTCGATACAC AAAAACGCAA TATCAGCTAT
CAACTGTTCA TCAATGCACC TTATGACCGA CTGGTGACCA ACAATGTTCG CTTCTGGAAA
GATAGTGGCA TTGCGGTTGA TCTGACGTCA GCAGGGATGC GTGTGGAGAT GGGCTCATTG
ACAACGCTGC TGAGTGGCGG TGTCAGCTTT GATGTGCCGG AAGGTCTGGA TTTAGGGCAG
CCAGTGGCAC CGAAAACAGC TTTCGTTTTG TATGATGATC AGAAGAGCAT TCAGGATTCG
TTGTACACCG ATCACATTGA TTATCTGATG TTCTTTAAAG ATTCGGTACG CGGTCTGCAA
CCGGGAGCTC CGGTAGAGTT CCGGGGTATT CGCCTGGGTA CCGTAAGCAA AGTGCCATTC
TTTGCGCCGA ATATGCGTCA GACATTTAAC GATGATTACC GTATTCCGGT ACTGATTCGT
ATCGAGCCAG AGCGGCTGAA AATGCAGCTT GGCGAAAATG CGGATGTTGT TGAGCACCTT
GGCGAATTGT TGAAACGTGG TTTACGCGGA TCGCTGAAAA CCGGAAACCT GGTCACTGGT
GCACTGTATG TTGATCTCGA TTTCTATCCA AATACGCCTG CAATAACCGG TATTCGTGAA
TTTAATGGTT ATCAGATTAT CCCGACCGTT AGCGGCGGCC TGGCGCAAAT CCAGCAACGA
CTGATGGAAG CGTTGGATAA GATCAACAAA CTGCCATTGA ATCCGATGAT TGAACAGGCA
ACCAGTACGC TTTCTGAAAG TCAGCGCACA ATGAAAAACC TGCAAACGAC GCTGGATAGC
ATGAACAAGA TCCTCGCTAG CCAGTCGATG CAGCAGTTGC CGACGGATAT GCAGTCAACG
TTGCGTGAAT TGAATCGCAG CATGCAGGGC TTCCAGCCTG GCTCCGCAGC CTACAACAAG
ATGGTGGCGG ATATGCAGCG CCTTGATCAG GTGTTGCGAG AACTGCAACC GGTGCTGAAA
ACGCTCAATG AGAAGAGTAA CGCGCTGGTA TTTGAAGCGA AGGACAAAAA AGATCCAGAG
CCGAAGAGGG CGAAACAATG A
 
Protein sequence
MESNNGEAKI QKVKNWSPVW IFPIVTALIG AWVLFYHYSH QGPEVTLITA NAEGIEGGKT 
TIKSRSVDVG VVESATLADD LTHVEIKARL NSGMEKLLHK DTVFWVVKPQ IGREGISGLG
TLLSGVYIEL QPGAKGSKMD KYDLLDSPPL APPDAKGIRV ILDSKKAGQL SPGDPVLFRG
YRVGSVETST FDTQKRNISY QLFINAPYDR LVTNNVRFWK DSGIAVDLTS AGMRVEMGSL
TTLLSGGVSF DVPEGLDLGQ PVAPKTAFVL YDDQKSIQDS LYTDHIDYLM FFKDSVRGLQ
PGAPVEFRGI RLGTVSKVPF FAPNMRQTFN DDYRIPVLIR IEPERLKMQL GENADVVEHL
GELLKRGLRG SLKTGNLVTG ALYVDLDFYP NTPAITGIRE FNGYQIIPTV SGGLAQIQQR
LMEALDKINK LPLNPMIEQA TSTLSESQRT MKNLQTTLDS MNKILASQSM QQLPTDMQST
LRELNRSMQG FQPGSAAYNK MVADMQRLDQ VLRELQPVLK TLNEKSNALV FEAKDKKDPE
PKRAKQ