Gene EcDH1_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3594 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3868578 
End bp3871040 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content53% 
IMG OID 
Productaspartate kinase 
Protein accessionACX41207 
Protein GI260450785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGTGTT 
GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC
GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT
TTACCCAATA TCAGCGATGC CGAACGTATT TTTGCCGAAC TTTTGACGGG ACTCGCCGCC
GCCCAGCCGG GGTTCCCGCT GGCGCAATTG AAAACTTTCG TCGATCAGGA ATTTGCCCAA
ATAAAACATG TCCTGCATGG CATTAGTTTG TTGGGGCAGT GCCCGGATAG CATCAACGCT
GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT ATTAGAAGCG
CGCGGTCACA ACGTTACTGT TATCGATCCG GTCGAAAAAC TGCTGGCAGT GGGGCATTAC
CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG CCGCATTCCG
GCTGATCACA TGGTGCTGAT GGCAGGTTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG
GTGCTTGGAC GCAACGGTTC CGACTACTCT GCTGCGGTGC TGGCTGCCTG TTTACGCGCC
GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTCTATA CCTGCGACCC GCGTCAGGTG
CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC
GGCGCTAAAG TTCTTCACCC CCGCACCATT ACCCCCATCG CCCAGTTCCA GATCCCTTGC
CTGATTAAAA ATACCGGAAA TCCTCAAGCA CCAGGTACGC TCATTGGTGC CAGCCGTGAT
GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ACATGGCAAT GTTCAGCGTT
TCTGGTCCGG GGATGAAAGG GATGGTCGGC ATGGCGGCGC GCGTCTTTGC AGCGATGTCA
CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCATCTT CCGAATACAG CATCAGTTTC
TGCGTTCCAC AAAGCGACTG TGTGCGAGCT GAACGGGCAA TGCAGGAAGA GTTCTACCTG
GAACTGAAAG AAGGCTTACT GGAGCCGCTG GCAGTGACGG AACGGCTGGC CATTATCTCG
GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCACTG
GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT
GTCGTGGTAA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC
AATACCGATC AGGTTATCGA AGTGTTTGTG ATTGGCGTCG GTGGCGTTGG CGGTGCGCTG
CTGGAGCAAC TGAAGCGTCA GCAAAGCTGG CTGAAGAATA AACATATCGA CTTACGTGTC
TGCGGTGTTG CCAACTCGAA GGCTCTGCTC ACCAATGTAC ATGGCCTTAA TCTGGAAAAC
TGGCAGAAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC
GTGAAAGAAT ATCATCTGCT GAACCCGGTC ATTGTTGACT GCACTTCCAG CCAGGCAGTG
GCGGATCAAT ATGCCGACTT CCTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG
GCCAACACCT CGTCGATGGA TTACTACCAT CAGTTGCGTT ATGCGGCGGA AAAATCGCGG
CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA
AATCTGCTCA ATGCAGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC TGGTTCGCTT
TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC CACGCTGGCG
CGGGAAATGG GTTATACCGA ACCGGACCCG CGAGATGATC TTTCTGGTAT GGATGTGGCG
CGTAAACTAT TGATTCTCGC TCGTGAAACG GGACGTGAAC TGGAGCTGGC GGATATTGAA
ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCCGAGGGTG ATGTTGCCGC TTTTATGGCG
AATCTGTCAC AACTCGACGA TCTCTTTGCC GCGCGCGTGG CGAAGGCCCG TGATGAAGGA
AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGCG TCTGCCGCGT GAAGATTGCC
GAAGTGGATG GTAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTC
TATAGCCACT ATTATCAGCC GCTGCCGTTG GTACTGCGCG GATATGGTGC GGGCAATGAC
GTTACAGCTG CCGGTGTCTT TGCTGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC
TGA
 
Protein sequence
MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA 
LPNISDAERI FAELLTGLAA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA
ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP
ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV
PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD
EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF
CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL
ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL
LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQKELAQAKE PFNLGRLIRL
VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH QLRYAAEKSR
RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA
REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA
NLSQLDDLFA ARVAKARDEG KVLRYVGNID EDGVCRVKIA EVDGNDPLFK VKNGENALAF
YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV