Gene EcDH1_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3298 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3547190 
End bp3548617 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content50% 
IMG OID 
Productiron-sulfur cluster binding protein 
Protein accessionACX40922 
Protein GI260450500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATCA AAACCAGTAA TACAGATTTT AAGACACGCA TCCGTCAGCA AATTGAAGAT 
CCGATCATGC GCAAAGCGGT GGCAAACGCG CAGCAGCGTA TTGGGGCAAA TCGGCAAAAA
ATGGTCGATG AATTGGGGCA CTGGGAGGAG TGGCGCGATC GGGCCGCCCA GATACGTGAT
CATGTTCTGA GTAATCTCGA CGCTTATCTG TACCAGCTCT CAGAAAAAGT GACGCAAAAC
GGCGGTCACG TCTATTTTGC AAGAACCAAA GAAGACGCTA CCCGCTACAT TTTACAGGTT
GCCCAACGCA AAAATGCCCG GAAGGTGGTG AAATCTAAAT CGATGGTGAC CGAAGAGATT
GGTGTCAATC ATGTGTTGCA GGATGCTGGC ATTCAGGTGA TTGAAACCGA TCTGGGTGAA
TATATTCTCC AGCTGGATCA AGATCCGCCA TCTCATGTTG TGGTCCCGGC AATTCATAAA
GATCGCCATC AGATCCGTCG AGTGCTACAC GAACGTCTGG GCTATGAGGG GCCGGAAACG
CCTGAAGCGA TGACCTTATT CATCCGGCAA AAAATCCGCG AAGATTTCCT CAGTGCTGAA
ATAGGTATTA CCGGCTGTAA TTTCGCGGTG GCAGAGACCG GTTCGGTATG CCTGGTGACC
AATGAAGGTA ATGCGCGAAT GTGTACCACG CTGCCTAAAA CGCATATTGC AGTGATGGGA
ATGGAGCGTA TTGCCCCCAC GTTTGCCGAG GTAGATGTAT TGATCACCAT GCTGGCGCGC
AGTGCCGTTG GTGCACGTTT GACGGGATAC AACACCTGGC TGACAGGACC GCGCGAAGCT
GGGCACGTTG ATGGTCCTGA AGAGTTTCAT CTGGTTATTG TCGATAACGG GCGTTCTGAG
GTGCTGGCCT CTGAATTTCG GGATGTGCTG CGCTGTATTC GCTGCGGGGC TTGTATGAAT
ACTTGTCCGG CATATCGCCA TATTGGCGGT CATGGATATG GCTCTATTTA TCCAGGGCCA
ATTGGTGCGG TGATTTCTCC GCTACTTGGC GGCTATAAAG ATTTTAAAGA TTTACCCTAC
GCCTGCTCTT TATGCACAGC TTGTGACAAC GTGTGTCCGG TGCGTATTCC GCTGTCAAAA
CTGATTTTGC GTCATCGTCG GGTGATGGCT GAAAAAGGGA TCACCGCAAA AGCAGAGCAA
CGGGCGATAA AAATGTTCGC TTATGCCAAT AGTCATCCAG GATTGTGGAA AGTCGGGATG
ATGGCCGGTG CTCATGCGGC AAGCTGGTTT ATCAATGGCG GCAAAACACC ACTCAAATTT
GGCGCGATTA GCGACTGGAT GGAAGCACGC GATCTTCCTG AAGCTGACGG AGAGAGTTTC
CGTAGTTGGT TTAAGAAACA TCAGGCGCAG GAGAAAAAGA ATGGATAA
 
Protein sequence
MSIKTSNTDF KTRIRQQIED PIMRKAVANA QQRIGANRQK MVDELGHWEE WRDRAAQIRD 
HVLSNLDAYL YQLSEKVTQN GGHVYFARTK EDATRYILQV AQRKNARKVV KSKSMVTEEI
GVNHVLQDAG IQVIETDLGE YILQLDQDPP SHVVVPAIHK DRHQIRRVLH ERLGYEGPET
PEAMTLFIRQ KIREDFLSAE IGITGCNFAV AETGSVCLVT NEGNARMCTT LPKTHIAVMG
MERIAPTFAE VDVLITMLAR SAVGARLTGY NTWLTGPREA GHVDGPEEFH LVIVDNGRSE
VLASEFRDVL RCIRCGACMN TCPAYRHIGG HGYGSIYPGP IGAVISPLLG GYKDFKDLPY
ACSLCTACDN VCPVRIPLSK LILRHRRVMA EKGITAKAEQ RAIKMFAYAN SHPGLWKVGM
MAGAHAASWF INGGKTPLKF GAISDWMEAR DLPEADGESF RSWFKKHQAQ EKKNG