Gene EcDH1_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3688 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3975040 
End bp3976146 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content44% 
IMG OID 
ProductKelch repeat-containing protein 
Protein accessionACX41298 
Protein GI260450876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA CAATAACGGC GCTTGCTATC ATGATGGCTT CATTTGCCGC AAACGCGTCT 
GTATTACCGG AAACTCCTGT GCCATTTAAA AGTGGTACCG GAGCAATTGA TAACGACACT
GTCTACATTG GTTTAGGTAG CGCAGGTACG GCATGGTACA AGCTGGATAC ACAGGCCAAA
GATAAAAAAT GGACAGCGTT AGCTGCATTC CCTGGCGGAC CAAGAGATCA AGCAACCTCT
GCATTTATTG ATGGCAATCT GTATGTGTTT GGCGGCATTG GCAAAAACAG CGAGGGCTTG
ACTCAGGTAT TTAATGACGT ACACAAATAC AACCCCAAAA CCAATAGTTG GGTTAAATTG
ATGTCGCACG CGCCGATGGG CATGGCGGGC CATGTGACTT TTGTACACAA CGGCAAGGCT
TATGTTACTG GTGGTGTTAA CCAGAATATC TTCAATGGCT ATTTTGAAGA TCTCAACGAG
GCTGGAAAAG ATTCAACCGC TATAGATAAA ATCAATGCTC ACTATTTTGA CAAAAAAGCA
GAAGATTATT TCTTCAATAA GTTTCTGTTG TCTTTTGATC CCTCAACACA GCAATGGAGT
TACGCTGGCG AATCGCCCTG GTACGGAACG GCTGGTGCGG CGGTTGTGAA TAAAGGTGAT
AAAACCTGGC TTATTAATGG CGAAGCCAAA CCAGGATTGC GAACGGATGC CGTATTTGAA
CTTGATTTCA CCGGTAATAA TTTAAAATGG AATAAGCTTG CTCCCGTCTC ATCACCAGAT
GGCGTAGCTG GCGGTTTTGC GGGGATAAGC AATGATTCTC TTATATTTGC CGGAGGGGCC
GGATTCAAAG GTTCACGAGA AAATTACCAG AACGGTAAGA ACTATGCGCA TGAAGGCCTG
AAAAAATCAT ATAGCACTGA TATTCATCTT TGGCATAACG GGAAATGGGA TAAATCGGGT
GAATTATCGC AAGGTCGGGC CTACGGAGTA TCATTGCCCT GGAATAATAG TCTATTGATT
ATTGGCGGTG AAACTGCAGG CGGCAAAGCG GTGACGGATT CAGTTTTGAT CACTGTGAAG
GATAATAAAG TCACAGTACA AAACTAA
 
Protein sequence
MNKTITALAI MMASFAANAS VLPETPVPFK SGTGAIDNDT VYIGLGSAGT AWYKLDTQAK 
DKKWTALAAF PGGPRDQATS AFIDGNLYVF GGIGKNSEGL TQVFNDVHKY NPKTNSWVKL
MSHAPMGMAG HVTFVHNGKA YVTGGVNQNI FNGYFEDLNE AGKDSTAIDK INAHYFDKKA
EDYFFNKFLL SFDPSTQQWS YAGESPWYGT AGAAVVNKGD KTWLINGEAK PGLRTDAVFE
LDFTGNNLKW NKLAPVSSPD GVAGGFAGIS NDSLIFAGGA GFKGSRENYQ NGKNYAHEGL
KKSYSTDIHL WHNGKWDKSG ELSQGRAYGV SLPWNNSLLI IGGETAGGKA VTDSVLITVK
DNKVTVQN