Gene EcDH1_3452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3452 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3701650 
End bp3703893 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent siderophore receptor 
Protein accessionACX41067 
Protein GI260450645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.380409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGTT CCAAAACTGC TCAGCCAAAA CACTCACTGC GTAAAATCGC AGTTGTAGTA 
GCCACAGCGG TTAGCGGCAT GTCTGTTTAT GCACAGGCAG CGGTTGAACC GAAAGAAGAC
ACTATCACCG TTACCGCTGC ACCTGCGCCG CAAGAAAGCG CATGGGGGCC TGCTGCAACT
ATTGCGGCGC GACAGTCTGC TACCGGCACT AAAACCGATA CGCCGATTCA AAAAGTGCCA
CAGTCTATTT CTGTTGTGAC CGCCGAAGAG ATGGCGCTGC ATCAGCCGAA GTCGGTAAAA
GAAGCGCTTA GCTACACGCC GGGTGTCTCT GTTGGTACGC GTGGCGCATC CAACACCTAT
GACCACCTGA TCATTCGCGG CTTTGCGGCA GAAGGCCAAA GCCAGAATAA CTATCTGAAT
GGCCTGAAGT TGCAGGGCAA CTTCTATAAC GATGCGGTCA TTGACCCGTA TATGCTGGAA
CGCGCTGAAA TTATGCGTGG CCCGGTTTCC GTGCTTTACG GTAAAAGCAG TCCTGGCGGC
CTGTTGAATA TGGTCAGCAA GCGTCCGACC ACCGAACCGC TGAAAGAAGT TCAGTTTAAA
GCCGGTACTG ACAGCCTGTT CCAGACTGGT TTTGACTTTA GCGATTCGTT GGATGATGAC
GGTGTTTACT CTTATCGCCT GACCGGTCTT GCGCGTTCTG CCAATGCCCA GCAGAAAGGG
TCAGAAGAGC AGCGTTATGC TATTGCACCG GCGTTCACCT GGCGTCCGGA TGATAAAACC
AATTTTACCT TCCTTTCTTA CTTCCAGAAC GAGCCGGAAA CCGGTTATTA CGGCTGGTTG
CCGAAAGAGG GAACCGTTGA GCCGCTGCCG AACGGTAAGC GTCTGCCGAC AGACTTTAAT
GAAGGGGCGA AGAACAACAC CTATTCTCGT AATGAGAAGA TGGTCGGCTA CAGCTTCGAT
CACGAATTTA ACGACACCTT TACTGTGCGT CAGAACCTGC GCTTTGCTGA AAACAAAACC
TCGCAAAACA GCGTTTATGG TTACGGCGTC TGCTCCGATC CGGCGAATGC TTACAGCAAA
CAGTGTGCGG CATTAGCGCC AGCGGATAAA GGCCATTATC TGGCACGTAA ATACGTCGTT
GATGATGAGA AGCTGCAAAA CTTCTCCGTT GATACCCAGT TGCAGAGCAA GTTTGCCACT
GGCGATATCG ACCACACCCT GCTGACCGGT GTCGACTTTA TGCGTATGCG TAATGACATC
AACGCCTGGT TTGGTTACGA CGACTCTGTG CCACTGCTCA ATCTGTACAA TCCGGTGAAT
ACCGATTTCG ACTTCAATGC CAAAGATCCG GCAAACTCCG GCCCTTACCG CATTCTGAAT
AAACAGAAAC AAACGGGCGT TTATGTTCAG GATCAGGCGC AGTGGGATAA AGTGCTGGTC
ACCCTAGGCG GTCGTTATGA CTGGGCAGAT CAAGAATCTC TTAACCGCGT TGCCGGGACG
ACCGATAAAC GTGATGACAA ACAGTTTACC TGGCGTGGTG GTGTTAACTA CCTGTTTGAT
AATGGTGTAA CACCTTACTT CAGCTATAGC GAATCGTTTG AACCTTCTTC GCAAGTTGGG
AAGGATGGTA ATATTTTCGC ACCGTCTAAA GGTAAGCAGT ATGAAGTCGG CGTGAAATAT
GTACCGGAAG ATCGTCCGAT TGTAGTTACT GGTGCCGTGT ATAATCTCAC TAAAACCAAC
AACCTGATGG CGGACCCTGA GGGTTCCTTC TTCTCGGTTG AAGGTGGCGA GATCCGCGCA
CGTGGCGTAG AAATCGAAGC GAAAGCGGCG CTGTCGGCGA GTGTTAACGT AGTCGGTTCT
TATACTTACA CCGATGCGGA ATACACCACC GATACTACCT ATAAAGGCAA TACGCCTGCA
CAGGTGCCAA AACACATGGC TTCGTTGTGG GCTGACTACA CCTTCTTTGA CGGTCCGCTT
TCAGGTCTGA CGCTGGGCAC CGGTGGTCGT TATACTGGCT CCAGTTATGG TGATCCGGCT
AACTCCTTTA AAGTGGGAAG TTATACGGTC GTGGATGCGT TAGTACGTTA TGATCTGGCG
CGAGTCGGCA TGGCTGGCTC CAACGTGGCG CTGCATGTTA ACAACCTGTT CGATCGTGAA
TACGTCGCCA GCTGCTTTAA CACTTATGGC TGCTTCTGGG GCGCAGAACG TCAGGTCGTT
GCAACCGCAA CCTTCCGTTT CTAA
 
Protein sequence
MARSKTAQPK HSLRKIAVVV ATAVSGMSVY AQAAVEPKED TITVTAAPAP QESAWGPAAT 
IAARQSATGT KTDTPIQKVP QSISVVTAEE MALHQPKSVK EALSYTPGVS VGTRGASNTY
DHLIIRGFAA EGQSQNNYLN GLKLQGNFYN DAVIDPYMLE RAEIMRGPVS VLYGKSSPGG
LLNMVSKRPT TEPLKEVQFK AGTDSLFQTG FDFSDSLDDD GVYSYRLTGL ARSANAQQKG
SEEQRYAIAP AFTWRPDDKT NFTFLSYFQN EPETGYYGWL PKEGTVEPLP NGKRLPTDFN
EGAKNNTYSR NEKMVGYSFD HEFNDTFTVR QNLRFAENKT SQNSVYGYGV CSDPANAYSK
QCAALAPADK GHYLARKYVV DDEKLQNFSV DTQLQSKFAT GDIDHTLLTG VDFMRMRNDI
NAWFGYDDSV PLLNLYNPVN TDFDFNAKDP ANSGPYRILN KQKQTGVYVQ DQAQWDKVLV
TLGGRYDWAD QESLNRVAGT TDKRDDKQFT WRGGVNYLFD NGVTPYFSYS ESFEPSSQVG
KDGNIFAPSK GKQYEVGVKY VPEDRPIVVT GAVYNLTKTN NLMADPEGSF FSVEGGEIRA
RGVEIEAKAA LSASVNVVGS YTYTDAEYTT DTTYKGNTPA QVPKHMASLW ADYTFFDGPL
SGLTLGTGGR YTGSSYGDPA NSFKVGSYTV VDALVRYDLA RVGMAGSNVA LHVNNLFDRE
YVASCFNTYG CFWGAERQVV ATATFRF