Gene EcDH1_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3659 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3945783 
End bp3947063 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF445 
Protein accessionACX41270 
Protein GI260450848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC TCATCGAACT CAGACGCGCC AAAAGGTTGG CGCTCTCTTT ACTGCTTATC 
GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG
AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG
CTGTTTCGCC GCGTGCCGAT TCCGATCATT TCTCGCCATA CGGCGATTAT CCCGCGTAAT
AAAGACCGGA TTGGCGAAAA TCTCGGCCAG TTCGTGCAGG AAAAATTTCT TGATACTCAA
TCCCTGGTGG CATTGATTCG ACGCCACGAA CCGGCGTTGC TGATTGGCAA CTGGTTTAGC
CAGCCAGAAA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG CGGTTTTCTC
GAACTTACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCGGTCCA TCGGGCGATT
GATAAGGTCG ATCTTTCCGG CACCAGTGCG TTGATGCTGG AGAGTATGAC CAAAAACGAT
CGTCATCAGG TGCTGCTGGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT
AAATCGCGCA AGTTTATCGC CCAGCAAATT GTTCGCTGGC TGGAGAGCGA GCATCCACTG
AAAGCCAAAA TTCTCCCCAC CGAATGGTTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC
GCGGTGAATT CTTTGCTTGA TGATATTAGT CGCGATCGTG CGCATCAGAT CCGCCATGCG
TTTGATCGCG CCACCTTCGC CCTGATCGAC AAGCTGAAAA ACGATCCGGA AATGGCAGCG
CGAGCCGATG CCGTAAAAAG CTATCTGAAA GAAGATGAAG CTTTTAATCG CTATCTCAGT
GAATTGTGGG GGGATTTACG GGAATGGCTG AAAGTGGATA TCAACAGTGA AGATTCTCGT
GTGAAAGAAC GCATCGCACG AGCGGGTCAA TGGTTTGGCG AAACGTTAAT TGCCGATGAT
GCCTTGCGGG CGTCGTTAAA TGGTCATCTT GAACAAGCCG CGCACCGCGT CGCGCCTGAG
TTTTCCGCAT TCCTGACGCG CCACATCAGC GATACGGTAA AAAGCTGGGA TGCGCGGGAT
ATGTCGCGGC AAATAGAGTT AAATATCGGC AAAGATCTGC AGTTTATTCG TGTCAACGGT
ACGCTGGTTG GCGGTTGTAT TGGGCTAATT TTATATTTGC TGTCGCAGCT CCCGGCCTTG
TTCCCCCTCG GCAATTTTTA G
 
Protein sequence
MNKLIELRRA KRLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA 
LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PALLIGNWFS
QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND
RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD
AVNSLLDDIS RDRAHQIRHA FDRATFALID KLKNDPEMAA RADAVKSYLK EDEAFNRYLS
ELWGDLREWL KVDINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRVAPE
FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL
FPLGNF