Gene EcDH1_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3709 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3997128 
End bp3999452 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content59% 
IMG OID 
ProductTonB-dependent siderophore receptor 
Protein accessionACX41317 
Protein GI260450895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0483354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCGT TACGCGTTTT TCGTAAAACA ACACCTTTGG TTAACACCAT TCGCCTGAGC 
CTGCTGCCGC TGGCCGGTCT CTCGTTTTCC GCTTTTGCTG CACAGGTTAA TATCGCACCG
GGATCGCTCG ATAAAGCGCT CAATCAGTAT GCCGCACACA GCGGATTTAC CCTCTCGGTT
GACGCCAGCC TGACGCGCGG CAAGCAGAGC AACGGCCTGC ACGGCGATTA CGACGTCGAG
AGCGGCCTGC AACAACTGCT GGACGGCAGC GGACTGCAGG TAAAACCGCT GGGAAATAAC
AGCTGGACGC TGGAGCCCGC GCCCGCACCA AAAGAAGATG CCCTGACCGT GGTCGGCGAC
TGGCTGGGTG ATGCGCGTGA AAACGACGTA TTTGAACATG CTGGCGCGCG TGACGTGATC
CGCCGTGAGG ATTTCGCCAA AACCGGCGCA ACCACCATGC GTGAGGTACT TAACCGCATC
CCTGGCGTCA GCGCGCCGGA AAACAACGGC ACCGGCAGCC ACGACCTGGC GATGAACTTT
GGCATCCGGG GCCTGAACCC GCGCCTCGCC AGCCGCTCGA CCGTCCTGAT GGACGGCATC
CCCGTCCCCT TCGCCCCTTA CGGTCAGCCG CAGCTTTCAC TGGCTCCCGT TTCGCTCGGC
AACATGGATG CCATTGACGT GGTACGCGGT GGTGGTGCGG TGCGTTACGG ACCGCAGAGC
GTGGGCGGCG TGGTGAACTT TGTTACCCGT GCCATTCCGC AGGACTTTGG TATCGAGGCG
GGCGTGGAAG GTCAGCTCAG CCCAACCTCT TCACAAAACA ACCCGAAAGA GACGCACAAC
CTGATGGTGG GCGGCACAGC GGACAACGGT TTTGGCACCG CGCTGCTCTA CTCCGGCACG
CGCGGCAGTG ACTGGCGCGA GCACAGCGCC ACCCGCATCG ACGACCTGAT GCTGAAAAGC
AAATATGCGC CGGATGAGGT GCACACCTTC AACAGCCTGC TGCAATATTA CGACGGTGAA
GCCGACATGC CCGGTGGCCT GTCTCGCGCG GATTACGACG CCGATCGCTG GCAATCCACC
CGCCCGTATG ACCGCTTCTG GGGTCGTCGC AAGCTGGCGA GCCTGGGCTA CCAGTTCCAG
CCAGACAGCC AGCATAAATT CAACATTCAG GGGTTCTACA CCCAAACCCT GCGCAGCGGC
TACCTGGAGC AAGGCAAACG CATCACCCTC TCGCCGCGTA ACTACTGGGT GCGCGGTATT
GAGCCACGCT ACAGCCAGAT CTTTATGATC GGCCCTTCCG CGCACGAAGT GGGCGTGGGC
TATCGCTATT TGAATGAATC AACGCATGAA ATGCGTTACT ACACCGCCAC CAGCAGCGGG
CAGTTGCCGT CCGGCTCAAG CCCTTACGAC CGCGATACGC GTTCCGGCAC CGAGGCGCAC
GCCTGGTATC TGGATGACAA AATCGACATC GGCAACTGGA CCATCACGCC GGGTATGCGT
TTCGAACATA TCGAGTCATA CCAGAACAAC GCCATCACAG GCACGCACGA AGAAGTGAGC
TATAACGCAC CGCTTCCGGC GTTGAACGTG CTCTATCACC TGACTGACAG CTGGAATCTT
TATGCAAACA CTGAAGGCTC GTTCGGCACC GTACAGTACA GCCAGATTGG CAAGGCTGTG
CAAAGCGGCA ATGTTGAACC GGAAAAAGCG CGAACCTGGG AACTCGGTAC CCGCTACGAC
GACGGCGCGC TGACGGCGGA AATGGGGCTG TTCCTGATTA ACTTTAACAA TCAGTACGAC
TCCAACCAGA CCAACGACAC CGTCACTGCA CGTGGCAAAA CGCGCCATAC CGGGCTGGAA
ACGCAGGCAC GTTACGATCT GGGTACGCTA ACGCCAACGC TTGATAACGT TTCCATCTAC
GCCAGCTATG CGTATGTGAA CGCGGAAATC CGCGAGAAAG GCGACACCTA CGGCAATCTG
GTACCATTCT CCCCGAAACA TAAAGGCACG CTGGGCGTGG ACTACAAGCC AGGAAACTGG
ACGTTCAATC TGAACAGCGA TTTCCAGTCC AGCCAGTTTG CGGATAACGC CAATACGGTG
AAAGAGAGCG CCGACGGCAG TACCGGCCGC ATTCCCGGCT TCATGCTCTG GGGCGCACGC
GTGGCGTATG ACTTTGGCCC GCAGATGGCA GATCTGAACC TGGCATTCGG TGTGAAAAAC
ATCTTCGACC AGGACTACTT CATCCGCTCT TATGACGACA ACAACAAAGG CATCTATGCA
GGCCAGCCGC GCACGCTGTA TATGCAGGGG TCGTTGAAGT TCTGA
 
Protein sequence
MTPLRVFRKT TPLVNTIRLS LLPLAGLSFS AFAAQVNIAP GSLDKALNQY AAHSGFTLSV 
DASLTRGKQS NGLHGDYDVE SGLQQLLDGS GLQVKPLGNN SWTLEPAPAP KEDALTVVGD
WLGDARENDV FEHAGARDVI RREDFAKTGA TTMREVLNRI PGVSAPENNG TGSHDLAMNF
GIRGLNPRLA SRSTVLMDGI PVPFAPYGQP QLSLAPVSLG NMDAIDVVRG GGAVRYGPQS
VGGVVNFVTR AIPQDFGIEA GVEGQLSPTS SQNNPKETHN LMVGGTADNG FGTALLYSGT
RGSDWREHSA TRIDDLMLKS KYAPDEVHTF NSLLQYYDGE ADMPGGLSRA DYDADRWQST
RPYDRFWGRR KLASLGYQFQ PDSQHKFNIQ GFYTQTLRSG YLEQGKRITL SPRNYWVRGI
EPRYSQIFMI GPSAHEVGVG YRYLNESTHE MRYYTATSSG QLPSGSSPYD RDTRSGTEAH
AWYLDDKIDI GNWTITPGMR FEHIESYQNN AITGTHEEVS YNAPLPALNV LYHLTDSWNL
YANTEGSFGT VQYSQIGKAV QSGNVEPEKA RTWELGTRYD DGALTAEMGL FLINFNNQYD
SNQTNDTVTA RGKTRHTGLE TQARYDLGTL TPTLDNVSIY ASYAYVNAEI REKGDTYGNL
VPFSPKHKGT LGVDYKPGNW TFNLNSDFQS SQFADNANTV KESADGSTGR IPGFMLWGAR
VAYDFGPQMA DLNLAFGVKN IFDQDYFIRS YDDNNKGIYA GQPRTLYMQG SLKF