Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3772 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4069690 |
End bp | 4071423 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | surface antigen (D15) |
Protein accession | ACX41377 |
Protein GI | 260450955 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCTATA TCCGACAGTT ATGCTGTGTA AGCTTACTCT GCTTAAGCGG ATCTGCCGTC GCCGCGAACG TCCGTCTACA GGTCGAGGGG TTATCGGGAC AGCTGGAAAA GAACGTTCGT GCGCAGCTTT CTACGATTGA AAGTGATGAA GTGACGCCAG ACCGTCGCTT TCGCGCACGC GTCGATGATG CCATCCGCGA AGGTCTGAAA GCGCTGGGTT ATTACCAGCC GACCATTGAA TTTGATCTCC GTCCACCGCC AAAGAAAGGG CGGCAGGTAT TGATCGCCAA AGTCACGCCA GGCGTGCCGG TGTTAATTGG CGGCACCGAT GTGGTATTGC GCGGCGGCGC GCGGACCGAT AAAGACTATT TGAAATTGCT CGATACTCGC CCGGCTATTG GCACGGTACT GAACCAGGGC GATTATGAAA ATTTCAAAAA GTCCTTAACC AGCATTGCGT TGCGTAAAGG TTATTTCGAT AGCGAATTTA CCAAAGCGCA GCTGGGCATT GCGCTCGGCC TGCATAAAGC CTTCTGGGAT ATTGATTATA ACAGTGGCGA ACGTTACCGC TTTGGGCATG TGACCTTTGA AGGATCACAA ATCCGCGATG AATACCTGCA AAATCTGGTG CCGTTTAAAG AGGGCGATGA GTACGAATCG AAAGATCTGG CAGAACTGAA CCGCCGACTT TCTGCTACCG GCTGGTTTAA CTCGGTGGTG GTGGCTCCAC AATTTGATAA AGCGCGCGAA ACGAAAGTAT TACCATTGAC GGGCGTGGTT TCGCCGCGAA CAGAAAACAC CATCGAAACC GGGGTCGGTT ACTCTACGGA CGTGGGACCG CGCGTGAAAG CGACGTGGAA AAAGCCGTGG ATGAACTCTT ATGGTCACAG TCTGACCACC AGTACTAGTA TTTCCGCGCC GGAACAGACC CTCGACTTCA GCTATAAAAT GCCGCTGCTG AAGAATCCAC TGGAACAATA TTATTTGGTG CAGGGCGGTT TTAAGCGCAC TGACCTGAAC GATACCGAAT CTGACTCCAC TACGCTGGTG GCTTCTCGCT ACTGGGATCT CTCCAGCGGC TGGCAGCGTG CCATTAACCT GCGCTGGAGT CTCGACCACT TTACTCAGGG TGAAATTACC AATACCACGA TGCTGTTTTA TCCTGGGGTG ATGATTAGCC GCACGCGTTC TCGTGGTGGC CTGATGCCAA CCTGGGGCGA CTCGCAACGC TACTCTATCG ACTACTCCAA CACGGCCTGG GGTTCAGATG TCGATTTCTC CGTTTTCCAG GCGCAGAACG TCTGGATCCG CACACTGTAC GATCGCCATC GTTTTGTTAC ACGCGGCACG CTGGGCTGGA TTGAAACCGG TGATTTCGAC AAAGTACCGC CGGATCTGCG TTTCTTCGCC GGGGGCGACC GCAGTATTCG TGGCTACAAA TACAAATCTA TCGCTCCGAA ATACGCCAAC GGTGACCTGA AAGGGGCCTC GAAGTTGATA ACCGGATCGC TGGAATACCA GTACAACGTG ACCGGAAAAT GGTGGGGCGC GGTGTTTGTC GATAGTGGCG AAGCGGTAAG CGATATTCGC CGCAGCGACT TTAAAACCGG TACCGGGGTC GGCGTGCGCT GGGAATCGCC GGTCGGGCCA ATCAAACTCG ATTTTGCCGT ACCGGTCGCG GATAAAGACG AACACGGGTT ACAGTTTTAC ATCGGTCTGG GGCCAGAATT ATGA
|
Protein sequence | MRYIRQLCCV SLLCLSGSAV AANVRLQVEG LSGQLEKNVR AQLSTIESDE VTPDRRFRAR VDDAIREGLK ALGYYQPTIE FDLRPPPKKG RQVLIAKVTP GVPVLIGGTD VVLRGGARTD KDYLKLLDTR PAIGTVLNQG DYENFKKSLT SIALRKGYFD SEFTKAQLGI ALGLHKAFWD IDYNSGERYR FGHVTFEGSQ IRDEYLQNLV PFKEGDEYES KDLAELNRRL SATGWFNSVV VAPQFDKARE TKVLPLTGVV SPRTENTIET GVGYSTDVGP RVKATWKKPW MNSYGHSLTT STSISAPEQT LDFSYKMPLL KNPLEQYYLV QGGFKRTDLN DTESDSTTLV ASRYWDLSSG WQRAINLRWS LDHFTQGEIT NTTMLFYPGV MISRTRSRGG LMPTWGDSQR YSIDYSNTAW GSDVDFSVFQ AQNVWIRTLY DRHRFVTRGT LGWIETGDFD KVPPDLRFFA GGDRSIRGYK YKSIAPKYAN GDLKGASKLI TGSLEYQYNV TGKWWGAVFV DSGEAVSDIR RSDFKTGTGV GVRWESPVGP IKLDFAVPVA DKDEHGLQFY IGLGPEL
|
| |