Gene EcDH1_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3651 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3933674 
End bp3935071 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content40% 
IMG OID 
ProductATPase associated with various cellular activities AAA_5 
Protein accessionACX41263 
Protein GI260450841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGG CATATCTTAT GGAATCTATT CAACCCTGGA TTGAAAAATT TATTAAGCAA 
GCACAGCAAC AACGTTCGCA ATCCACTAAA GATTATCCAA CGTCTTACCG TAACCTGCGA
GTAAAATTGA GTTTCGGTTA TGGTAATTTT ACGTCTATTC CCTGGTTTGC ATTTCTTGGA
GAAGGTCAGG AAGCTTCTAA CGGTATATAT CCCGTTATTC TCTATTATAA AGATTTTGAT
GAGTTGGTTT TGGCTTATGG TATAAGCGAC ACGAATGAAC CACATGCCCA ATGGCAGTTC
TCTTCAGACA TACCTAAAAC AATCGCAGAG TATTTTCAGG CAACTTCGGG TGTATATCCT
AAAAAATACG GACAGTCCTA TTACGCCTGT TCCCAAAAAG TCTCACAGGG TATTGATTAC
ACCCGATTTG CCTCTATGCT GGACAACATA ATCAACGACT ATAAATTAAT ATTTAATTCT
GGCAAGAGTG TTATTCCACC TATGTCAAAA ACTGAATCAT ACTGTCTGGA AGATGCGTTA
AATGATTTGT TTATCCCTGA AACCACAATA GAGACGATAC TCAAACGATT AACCATCAAA
AAAAATATTA TCCTCCAGGG GCCGCCCGGC GTTGGAAAAA CCTTTGTTGC ACGCCGTCTG
GCTTACTTGC TGACAGGAGA AAAGGCTCCG CAACGCGTCA ATATGGTTCA GTTCCATCAA
TCTTATAGCT ATGAGGATTT TATACAGGGC TATCGTCCGA ATGGCGTCGG CTTCCGACGT
AAAGACGGCA TATTTTACAA TTTTTGTCAG CAAGCTAAAG AGCAGCCAGA GAAAAAGTAT
ATTTTTATTA TAGATGAAAT CAATCGTGCC AATCTCAGTA AAGTATTTGG CGAAGTGATG
ATGTTAATGG AACATGATAA ACGAGGTGAA AACTGGTCTG TTCCCCTAAC CTACTCCGAA
AACGATGAAG AACGATTCTA TGTCCCGGAG AATGTTTATA TCATCGGTTT AATGAATACT
GCCGATCGCT CTCTGGCCGT TGTTGACTAT GCCCTACGCA GACGATTTTC TTTCATAGAT
ATTGAGCCAG GTTTTGATAC ACCACAGTTC CGGAATTTTT TACTGAATAA AAAAGCAGAA
CCTTCATTTG TTGAGTCTTT ATGCCAAAAA ATGAACGAGT TGAACCAGGA AATCAGCAAA
GAGGCCACTA TCCTTGGGAA AGGATTCCGC ATTGGGCATA GTTACTTCTG CTGTGGGTTG
GAAGATGGCA CCTCTCCGGA TACGCAATGG CTTAATGAAA TTGTGATGAC GGATATCGCC
CCTTTACTCG AAGAATATTT CTTTGATGAC CCCTATAAAC AACAGAAATG GACCAACAAA
TTATTAGGGG ACTCATAG
 
Protein sequence
MRKAYLMESI QPWIEKFIKQ AQQQRSQSTK DYPTSYRNLR VKLSFGYGNF TSIPWFAFLG 
EGQEASNGIY PVILYYKDFD ELVLAYGISD TNEPHAQWQF SSDIPKTIAE YFQATSGVYP
KKYGQSYYAC SQKVSQGIDY TRFASMLDNI INDYKLIFNS GKSVIPPMSK TESYCLEDAL
NDLFIPETTI ETILKRLTIK KNIILQGPPG VGKTFVARRL AYLLTGEKAP QRVNMVQFHQ
SYSYEDFIQG YRPNGVGFRR KDGIFYNFCQ QAKEQPEKKY IFIIDEINRA NLSKVFGEVM
MLMEHDKRGE NWSVPLTYSE NDEERFYVPE NVYIIGLMNT ADRSLAVVDY ALRRRFSFID
IEPGFDTPQF RNFLLNKKAE PSFVESLCQK MNELNQEISK EATILGKGFR IGHSYFCCGL
EDGTSPDTQW LNEIVMTDIA PLLEEYFFDD PYKQQKWTNK LLGDS