Gene EcDH1_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3744 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4036554 
End bp4038368 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content32% 
IMG OID 
Producthypothetical protein 
Protein accessionACX41349 
Protein GI260450927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA TATCAGATTT AAATTATTCT CAACACATTA CATTAGCCGA CAATTTTAAA 
CAAAAAAGTG AAGTTTTAAA TACCTGGCGT GTTGGAATGA ATGATTTTGC CCGTATTGCC
GGGGGGCAGG ATAACAGAAG GAATATTCTT TCTCCTGGAG CATTTTTAGA GTTTTTGGCA
AAGATATTTA CCCTGGGTTA TGTGGATTTT AGCAAACGCT CCAACGAAGC GGGTAGAAAT
ATGATGGCTC ATATTAAGTC CTCATCTTAT TCTAAAGATA CTAATGGCAA TGAAAAAATG
AAGTTTTACA TGAATAATCC TGTAGGGGAA CGAGCGGATT CACCCAAGGT GATTATAGAA
ATTTCACTTT CCACTATCAC TACTATGGGG ACTCGTCAAG GACATACAGC CATTATATTT
CCACAACCTG ATGGTTCGAC TAACCGTTAT GAAGGGAAGT CCTTTGAAAG AAAAGATGAG
AGTTCATTAC ACCTGATTAC TAACAAGGTT CTGGCGTGTT ACCAAAGTGA AGCTAACAAG
AAAATAGCGC GTCTATTAAA TAATAATCAG GAGTTAAATA ATCTACAGAA ATTAAATAAT
CTACAGAAGT TAAATAATCT ACTGAAGTTA AATAATATAC AGGGGTTAAA TAATCCTCAG
GAGTTAAATA ATCCGCAGAA TTTAAATGAT TCTCAGGAGT TAAATAACTC GCAGGAATTA
AATAGTCCAC AGGAGTTAAA TGATCCGCAG GAGTTAAATA ATTCTCAGGA CTTAAATAAC
TCTAAGGTGA GTTGTACAGT TTCAGTTGAT TCTACGATTA CGGGTTTATT AAAAGAACCA
TTGAATAATG CATTATTAGC AATAAGGAAC GAACATCTGC TATTAATGCC TCATGTATGT
GATGAATCGA TTTCATACTT ACTGGGCGAA AAAGGTATAC TTGAAGAAAT AGATAAGCTC
TACGCATTAA ATGATCACGG AATTGATAAT GACAAAGTAG GTAACAATGA AATTAATGAC
ATCAAAGTTA ACCTGTCTCA TATTCTTATT GATTCCTTAG ATGATGCAAA GGTTAACCTT
ACACCGGTCA TCGATTCGAT TCTGGAGACT TTTTCAAAAT CCCCATATAT TAATGATGTA
AGAATACTGG ATTGGTGTTT TAATAAAAGC ATGCAATATT TTGATGATAC TAAAAAGATA
AAGCATGCAT GCTCCGTAAT AAATCATATT AATCTTCGCA GCGATCAGTC TAAAATAGCT
GAGACATTAT TTTTCAATCT CGATAAAGAA CCCTATAAAA ATAGCCCTGA ATTACAGGGG
TTGATTTGGA ATAAGTTGGT TGTATATGTC AATGAATTTA ACTTAAGTAA TCGAGAAAAA
ACAAATTTAA TACAAAGGCT ATTTGATAAT GTTGAGTCTA TATTTAATGA AGTACCTGTC
AGCATTTTAG TGAATGATAT TTTTATGAAT GATTTCTTTA TGAAAAATCC TGAGATGATT
AATTGGTACT TCCCTCAGTT ACTTAAGAGT TATGAGGGTG AAAAGATTTA TTTTGATAAT
TTAAAATATG ATTTAAATGA TAATGATAAG GAATCTAATA AAGAAATTTT GAAGAATCAA
CCAGATAATG TTATCAAAGA AAAACTGAAT AATGAATACA AACTTAGATT TAGAATGATG
CAAACTATCT TGCAATCGAG AGTTAATGTA TTACCATATA TTAATGAACA GCGTTTAAAT
AAACTAAATC CACCGGAAAA TTTACGTATA GCAATAGAAC ACTTTGGGTG GAAGAATAGA
CCTATCACTG CATAA
 
Protein sequence
MSKISDLNYS QHITLADNFK QKSEVLNTWR VGMNDFARIA GGQDNRRNIL SPGAFLEFLA 
KIFTLGYVDF SKRSNEAGRN MMAHIKSSSY SKDTNGNEKM KFYMNNPVGE RADSPKVIIE
ISLSTITTMG TRQGHTAIIF PQPDGSTNRY EGKSFERKDE SSLHLITNKV LACYQSEANK
KIARLLNNNQ ELNNLQKLNN LQKLNNLLKL NNIQGLNNPQ ELNNPQNLND SQELNNSQEL
NSPQELNDPQ ELNNSQDLNN SKVSCTVSVD STITGLLKEP LNNALLAIRN EHLLLMPHVC
DESISYLLGE KGILEEIDKL YALNDHGIDN DKVGNNEIND IKVNLSHILI DSLDDAKVNL
TPVIDSILET FSKSPYINDV RILDWCFNKS MQYFDDTKKI KHACSVINHI NLRSDQSKIA
ETLFFNLDKE PYKNSPELQG LIWNKLVVYV NEFNLSNREK TNLIQRLFDN VESIFNEVPV
SILVNDIFMN DFFMKNPEMI NWYFPQLLKS YEGEKIYFDN LKYDLNDNDK ESNKEILKNQ
PDNVIKEKLN NEYKLRFRMM QTILQSRVNV LPYINEQRLN KLNPPENLRI AIEHFGWKNR
PITA