Gene EcDH1_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3323 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3573934 
End bp3575259 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content45% 
IMG OID 
Productintegrase family protein 
Protein accessionACX40947 
Protein GI260450525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTT CAAGACAAAA ATTTACCTTC GAAAGACTTC GCAGATTCAC CTTACCGGAA 
GGGAAAAAAC AAACTTTTCT TTGGGATGCA GATGTAACAA CCCTGGCATG CCGAGCAACT
AGCGGAGCAA AAGCCTTTGT ATTCCAAAGC GTATATGCGG GGAAAACCCT TCGCATGACT
ATTGGCAACA TTAACGACTG GAAGATTGAT GATGCGAGAG CCGAGGCAAG ACGGTTACAA
ACATTGATCG ATACAGGGAT AGATCCACGA ATTGCTAAGG CTGTAAAAAT CGCAGAAGCA
GAATCCCTGC AGGCAGAATC ACGTAAAACA AAAGTGACTT TCTCCGTCGC CTGGGAAGAC
TATCTTCAAG AATTGAGAAC CGGTATCAGT GCAAAAACTA AACGCCCATA TTCTACTCGA
TACATTGCCG ATCACATTAA CTTGTCCAGT CGTGGAGGCG AAAGTAAAAA AAGAGGCCAA
GGCCCGACTT CGGCTGGACC ATTGGCTAGT TTGCTCAACC TGCCGTTATC GGAGCTAACC
CCAGATTACA TAGCAGCGTG GCTGAGTACA GAAAGGCAAA ATAGACCTAC CGTCACTGCT
CACGCTTATC GCCTACTACG TGCTTTCATC AAATGGAGTA ATTATCAGAA AAAATATCAA
GGGATCATTC CTGGCGATCT GGCACAAGAT TACAACGTAA GAAAAATGGT TCCCGTGTCA
GCGAGTAAAG CTGATGATTG CCTGCAAAAG GAACAACTAA AAAGCTGGTT TAGTGCCGTG
CGTAGCCTCA ATAATCCTAT TGCATCGGCC TATCTCCAAG TACTTTTGCT CACTGGTGCT
CGGCGTGAAG AAATTGCGTC GCTTCGCTGG TCAGACGTAG ATTTCAAATG GTCAAGCATG
CGAATTAAAG ACAAGATCGA AGGTGAACGT ATCATCCCTC TCACTCCTTA TGTTTCTGAA
TTGTTAAATG TACTAGCGCA ATCCCCAAAT TCTGACGTAA ATAAGGAGGG TTGGGTTTTC
AGAAGTAACA GTAAAAGTGG CAAAATTATT GAGCCGCGTT CAGCGCACAA CAGAGCATTA
GTGCTGGCTG AGTTACCACA TATCAGCCTT CACGGTTTAC GTCGTAGTTT TGGTACTTTG
GCCGAGTGGG TTGAAGTTCC CACTGGTATT GTTGCTCAAA TTATGGGACA CAAACCCAGC
GCTCTTGCCG AAAAACACTA TCGCCGTCGT CCGTTAGATC TGTTACGAAA ATGGCACGAG
AAAATTGAGA CATGGATCTT AAATGAAGCA GGTATTACCA TAAAAAACAA CGTTGATATG
CGTTGA
 
Protein sequence
MALSRQKFTF ERLRRFTLPE GKKQTFLWDA DVTTLACRAT SGAKAFVFQS VYAGKTLRMT 
IGNINDWKID DARAEARRLQ TLIDTGIDPR IAKAVKIAEA ESLQAESRKT KVTFSVAWED
YLQELRTGIS AKTKRPYSTR YIADHINLSS RGGESKKRGQ GPTSAGPLAS LLNLPLSELT
PDYIAAWLST ERQNRPTVTA HAYRLLRAFI KWSNYQKKYQ GIIPGDLAQD YNVRKMVPVS
ASKADDCLQK EQLKSWFSAV RSLNNPIASA YLQVLLLTGA RREEIASLRW SDVDFKWSSM
RIKDKIEGER IIPLTPYVSE LLNVLAQSPN SDVNKEGWVF RSNSKSGKII EPRSAHNRAL
VLAELPHISL HGLRRSFGTL AEWVEVPTGI VAQIMGHKPS ALAEKHYRRR PLDLLRKWHE
KIETWILNEA GITIKNNVDM R