Gene EcDH1_3613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3613 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3889929 
End bp3891260 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content54% 
IMG OID 
ProductHipA domain protein 
Protein accessionACX41226 
Protein GI260450804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGC TGACTGATCT TTTACTGCAA GGGCCGCGTT CTGCCCCGGA ATTGCGCCAG 
CGTCTGGCAA TCAGTCAGGC GACGTTCTCA CGCCTTGTTG CCAGAGAAGA TCGGGTGATT
CGCTTTGGTA AAGCACGGGC AACGCGATAT GCACTGCTGC GTCCTTATCG CGGAATTGAG
CGTATTCCCG TCTGGCGGGT GGACGATACC GGAAAGGCGC ATAAATTCGC CGACATCCGG
TTGTGCTGGC CGCAGGGAAG TTGTCTGGTA ACAGGCGCAG ATGGCGACGA ACAGTGGTTT
GATGGTTTGC CCTGGTATTT GACCGATCTC CGACCGCAGG GCTTTTTAGG GCGCGCGTGG
GGCAGGAAGT TAGCCGCGCA ACTGAATCTG ACTGATGATA TACGTCTCTG GCAGGAAGAA
GATGTGCTCT ACGCCCTGAC CGTATTTAAC GGTGAATATA CTGGCGGTTG GTTGGTCGGG
GAGGGGAATT ATCAGCGATG GATTACTGCA CAACACCCTG CGGAAATTCC TCTGGATCAA
AAACTCACCC ATTACGAACA GCTGGCAAGT GATGCACTGG CAGGAGAAAT TGTGGGTTCT
TCTGCGGGCG GCGAGCAGCC AAAATTTACC TACTATGCAC AAACGCCGTC AGGCAATAAA
CATGTGTTGG TGAAATTCAC CGTACCACAG CAAACCGCGG TCAGCCAACG TTGGGGTGAC
CTGCTAATTG CTGAATCTAT TGCCGCGCAA ATCCTGCGTG ACGGTGGGAT CCACGCCATC
GAGTCAACGG TGCTTGTAAC AAGTAACAGG CAGGTATTCC TCGAAGCGGA ACGCTTTGAC
TGCAAAGGTA ACGATGGTCG CTTGCCTATT GTGTCGCTGG AGGCGGTGCA GAGTGAGTTT
ATCTCTTCTC CGGGATCGTG GCCGCAGGCA ATGCGCCGTT TGTGTGAGCA ACAACTTGTC
ACTCACCAGA GCGTGGCGCA AACAGAAGTG ATCTGGGCAT TTGGGCGACT TATCGCCAAC
AGCGATATGC ACGCAGGTAA TTTATCGTTT TATTTATCTG AACCGCCATT TGCGCTGACG
CCCGTCTACG ACATGCTGCC GATGGTCTAT GCACCAAACA GCGCTGGAAT GCTGCGTGAT
GCTGCCATTG AGGTGAAGTT TGATCTTAAC GTCAGTAAAA GCGCTTGGTT AACGGCGATC
CCGCTGGCGC AGCAGTTCTG GCAAACGGTC GCCAGAGATC CGCGTATCAG CGAGGCGTTT
CGCCACATTG CGCAAGAAAT GCCGGAAAAA ATCCGGCAAA TCGAAGAGAA AGTTGCCCGC
ATGGGCGGGT AA
 
Protein sequence
MSELTDLLLQ GPRSAPELRQ RLAISQATFS RLVAREDRVI RFGKARATRY ALLRPYRGIE 
RIPVWRVDDT GKAHKFADIR LCWPQGSCLV TGADGDEQWF DGLPWYLTDL RPQGFLGRAW
GRKLAAQLNL TDDIRLWQEE DVLYALTVFN GEYTGGWLVG EGNYQRWITA QHPAEIPLDQ
KLTHYEQLAS DALAGEIVGS SAGGEQPKFT YYAQTPSGNK HVLVKFTVPQ QTAVSQRWGD
LLIAESIAAQ ILRDGGIHAI ESTVLVTSNR QVFLEAERFD CKGNDGRLPI VSLEAVQSEF
ISSPGSWPQA MRRLCEQQLV THQSVAQTEV IWAFGRLIAN SDMHAGNLSF YLSEPPFALT
PVYDMLPMVY APNSAGMLRD AAIEVKFDLN VSKSAWLTAI PLAQQFWQTV ARDPRISEAF
RHIAQEMPEK IRQIEEKVAR MGG