Gene EcDH1_2497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2497 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2672941 
End bp2674308 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTerminase 
Protein accessionACX40134 
Protein GI260449712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0224039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTA AATTAACCGG CTATGTATGG GATGGTTGCG CTGCATCAGG CATGAAGTTA 
TCCAGCGTGG CAATTATGGC CCGCCTGGCT GATTTCAGTA ATGACGAAGG TGTGTGCTGG
CCATCAATTG AAACCATTGC CCGTCAGATT GGCGCGGGGA TGAGTACCGT CAGAACGGCT
ATCGCACGGC TGGAAGCAGA AGGCTGGTTA ACGCGTAAGG CGCGTCGCCA GGGTGATGGT
TCATCACCCC ACTGTGCCGT GGTGGATGAA TATCACGAGC ACGCCACAGA TGCGCTTTAC
ACCACGATGC TTACCGGGAT GGGGGCGCGA CGCCAGCCAC TGATGTGGGC CATTACCACC
GCCGGGTACA ACATTGAGGG GCCGTGCTAC GACAAACGGC GGGAAGTCAT CGAGATGCTC
AACGGCTCGG TGCCAAACGA TGAACTGTTC GGGATCATCT ATACCGTTGA TGAAGGTGAC
GACTGGACCG ACCCGCAGGT GCTGGAAAAA GCCAATCCAA ATATTGGCGT GTCGGTTTAT
CGCGAATTTT TGTTAAGTCA GCAGCAGCGT GCGAAAAATA ACGCCCGTCT GGCAAACGTC
TTTAAAACAA AACACCTCAA TATCTGGGCG TCGGCGCGTT CGGCGTATTT CAACCTGGTG
AGCTGGCAGA GCTGCGAGGA TAAATCACTG ACCCTTGAGC AGTTCGAGGG GCAGCCGTGC
ATTCTGGCCT TTGACCTGGC GCGTAAGCTG GATATGAACA GCATGGCGCG ACTTTATACC
CGCGAGATTG ACGGTAAAAC GCATTACTAC AGTGTGGCCC CGCGTTTCTG GGTACCGTAT
GACACGGTGT ACAGCGTCGA GAAAAATGAA GATCGCCGGA CAGCCGAACG CTTTCAGAAA
TGGGTGGAAA TGGGCGTTCT GACCGTTACC GATGGTGCGG AGGTGGATTA TCGCTACATC
CTCGAAGAGG CCAAAGCGGC GAACAAAATC AGCCCGGTCA GTGAGTCACC CATCGACCCC
TTCGGGGCGA CCGGGCTGTC ACATGACCTT GCTGATGAAG ACCTGAACCC CGTCACCATC
ATTCAGAACT ACACCAACAT GTCCGATCCG ATGAAAGAGC TGGAAGCGGC GATTGAATCG
GGGCGCTTTC ATCATGACGG CAATCCCATC ATGACCTGGT GTATCGGCAA CGTGGTCGGC
AAAACCATTC CGGGTAACGA TGATGTGGTG AAGCCCGTCA AGGAGCAGGC GGAAAACAAA
ATCGATGGTG CAGTTGCGCT GATTATGGCG GTTGGCAGAG CCATGCTGTA CGAGAAAGAA
GACACGCTGT CTGATCACAT TGAGTCCTAC GGGATCCGCT CGCTTTAA
 
Protein sequence
MSTKLTGYVW DGCAASGMKL SSVAIMARLA DFSNDEGVCW PSIETIARQI GAGMSTVRTA 
IARLEAEGWL TRKARRQGDG SSPHCAVVDE YHEHATDALY TTMLTGMGAR RQPLMWAITT
AGYNIEGPCY DKRREVIEML NGSVPNDELF GIIYTVDEGD DWTDPQVLEK ANPNIGVSVY
REFLLSQQQR AKNNARLANV FKTKHLNIWA SARSAYFNLV SWQSCEDKSL TLEQFEGQPC
ILAFDLARKL DMNSMARLYT REIDGKTHYY SVAPRFWVPY DTVYSVEKNE DRRTAERFQK
WVEMGVLTVT DGAEVDYRYI LEEAKAANKI SPVSESPIDP FGATGLSHDL ADEDLNPVTI
IQNYTNMSDP MKELEAAIES GRFHHDGNPI MTWCIGNVVG KTIPGNDDVV KPVKEQAENK
IDGAVALIMA VGRAMLYEKE DTLSDHIESY GIRSL