Gene EcDH1_3772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3772 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4069690 
End bp4071423 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content53% 
IMG OID 
Productsurface antigen (D15) 
Protein accessionACX41377 
Protein GI260450955 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCTATA TCCGACAGTT ATGCTGTGTA AGCTTACTCT GCTTAAGCGG ATCTGCCGTC 
GCCGCGAACG TCCGTCTACA GGTCGAGGGG TTATCGGGAC AGCTGGAAAA GAACGTTCGT
GCGCAGCTTT CTACGATTGA AAGTGATGAA GTGACGCCAG ACCGTCGCTT TCGCGCACGC
GTCGATGATG CCATCCGCGA AGGTCTGAAA GCGCTGGGTT ATTACCAGCC GACCATTGAA
TTTGATCTCC GTCCACCGCC AAAGAAAGGG CGGCAGGTAT TGATCGCCAA AGTCACGCCA
GGCGTGCCGG TGTTAATTGG CGGCACCGAT GTGGTATTGC GCGGCGGCGC GCGGACCGAT
AAAGACTATT TGAAATTGCT CGATACTCGC CCGGCTATTG GCACGGTACT GAACCAGGGC
GATTATGAAA ATTTCAAAAA GTCCTTAACC AGCATTGCGT TGCGTAAAGG TTATTTCGAT
AGCGAATTTA CCAAAGCGCA GCTGGGCATT GCGCTCGGCC TGCATAAAGC CTTCTGGGAT
ATTGATTATA ACAGTGGCGA ACGTTACCGC TTTGGGCATG TGACCTTTGA AGGATCACAA
ATCCGCGATG AATACCTGCA AAATCTGGTG CCGTTTAAAG AGGGCGATGA GTACGAATCG
AAAGATCTGG CAGAACTGAA CCGCCGACTT TCTGCTACCG GCTGGTTTAA CTCGGTGGTG
GTGGCTCCAC AATTTGATAA AGCGCGCGAA ACGAAAGTAT TACCATTGAC GGGCGTGGTT
TCGCCGCGAA CAGAAAACAC CATCGAAACC GGGGTCGGTT ACTCTACGGA CGTGGGACCG
CGCGTGAAAG CGACGTGGAA AAAGCCGTGG ATGAACTCTT ATGGTCACAG TCTGACCACC
AGTACTAGTA TTTCCGCGCC GGAACAGACC CTCGACTTCA GCTATAAAAT GCCGCTGCTG
AAGAATCCAC TGGAACAATA TTATTTGGTG CAGGGCGGTT TTAAGCGCAC TGACCTGAAC
GATACCGAAT CTGACTCCAC TACGCTGGTG GCTTCTCGCT ACTGGGATCT CTCCAGCGGC
TGGCAGCGTG CCATTAACCT GCGCTGGAGT CTCGACCACT TTACTCAGGG TGAAATTACC
AATACCACGA TGCTGTTTTA TCCTGGGGTG ATGATTAGCC GCACGCGTTC TCGTGGTGGC
CTGATGCCAA CCTGGGGCGA CTCGCAACGC TACTCTATCG ACTACTCCAA CACGGCCTGG
GGTTCAGATG TCGATTTCTC CGTTTTCCAG GCGCAGAACG TCTGGATCCG CACACTGTAC
GATCGCCATC GTTTTGTTAC ACGCGGCACG CTGGGCTGGA TTGAAACCGG TGATTTCGAC
AAAGTACCGC CGGATCTGCG TTTCTTCGCC GGGGGCGACC GCAGTATTCG TGGCTACAAA
TACAAATCTA TCGCTCCGAA ATACGCCAAC GGTGACCTGA AAGGGGCCTC GAAGTTGATA
ACCGGATCGC TGGAATACCA GTACAACGTG ACCGGAAAAT GGTGGGGCGC GGTGTTTGTC
GATAGTGGCG AAGCGGTAAG CGATATTCGC CGCAGCGACT TTAAAACCGG TACCGGGGTC
GGCGTGCGCT GGGAATCGCC GGTCGGGCCA ATCAAACTCG ATTTTGCCGT ACCGGTCGCG
GATAAAGACG AACACGGGTT ACAGTTTTAC ATCGGTCTGG GGCCAGAATT ATGA
 
Protein sequence
MRYIRQLCCV SLLCLSGSAV AANVRLQVEG LSGQLEKNVR AQLSTIESDE VTPDRRFRAR 
VDDAIREGLK ALGYYQPTIE FDLRPPPKKG RQVLIAKVTP GVPVLIGGTD VVLRGGARTD
KDYLKLLDTR PAIGTVLNQG DYENFKKSLT SIALRKGYFD SEFTKAQLGI ALGLHKAFWD
IDYNSGERYR FGHVTFEGSQ IRDEYLQNLV PFKEGDEYES KDLAELNRRL SATGWFNSVV
VAPQFDKARE TKVLPLTGVV SPRTENTIET GVGYSTDVGP RVKATWKKPW MNSYGHSLTT
STSISAPEQT LDFSYKMPLL KNPLEQYYLV QGGFKRTDLN DTESDSTTLV ASRYWDLSSG
WQRAINLRWS LDHFTQGEIT NTTMLFYPGV MISRTRSRGG LMPTWGDSQR YSIDYSNTAW
GSDVDFSVFQ AQNVWIRTLY DRHRFVTRGT LGWIETGDFD KVPPDLRFFA GGDRSIRGYK
YKSIAPKYAN GDLKGASKLI TGSLEYQYNV TGKWWGAVFV DSGEAVSDIR RSDFKTGTGV
GVRWESPVGP IKLDFAVPVA DKDEHGLQFY IGLGPEL