Gene EcDH1_1502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1502 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1628327 
End bp1629796 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content53% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX39172 
Protein GI260448750 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000366881 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTCCG AAACTAAAAC CACAGAAGCG CCGGGCTTAC GCCGTGAATT AAAGGCGCGT 
CACCTGACGA TGATTGCCAT TGGCGGTTCC ATCGGTACAG GTCTTTTTGT TGCCTCTGGC
GCAACGATTT CTCAGGCAGG TCCGGGCGGG GCATTGCTCT CGTATATGCT GATTGGCCTG
ATGGTTTACT TCCTGATGAC CAGTCTCGGT GAACTGGCTG CATATATGCC GGTTTCCGGT
TCGTTTGCCA CTTACGGTCA GAACTATGTT GAAGAAGGCT TTGGCTTCGC GCTGGGCTGG
AACTACTGGT ACAACTGGGC GGTGACTATC GCCGTTGACC TGGTTGCAGC TCAGCTGGTC
ATGAGCTGGT GGTTCCCGGA TACACCGGGC TGGATCTGGA GTGCGTTGTT CCTCGGCGTT
ATCTTCCTGC TGAACTACAT CTCAGTTCGT GGCTTTGGTG AAGCGGAATA CTGGTTCTCA
CTGATCAAAG TCACGACAGT TATTGTCTTT ATCATCGTTG GCGTGCTGAT GATTATCGGT
ATCTTCAAAG GCGCGCAGCC TGCGGGCTGG AGCAACTGGA CAATCGGCGA AGCGCCGTTT
GCTGGTGGTT TTGCGGCGAT GATCGGCGTA GCTATGATTG TCGGCTTCTC TTTCCAGGGA
ACCGAGCTGA TCGGTATTGC TGCAGGCGAG TCCGAAGATC CGGCGAAAAA CATTCCACGC
GCGGTACGTC AGGTGTTCTG GCGAATCCTG TTGTTCTATG TGTTCGCGAT CCTGATTATC
AGCCTGATTA TTCCGTACAC CGATCCGAGC CTGCTGCGTA ACGATGTTAA AGACATCAGC
GTTAGTCCGT TCACCCTGGT GTTCCAGCAC GCGGGTCTGC TCTCTGCGGC GGCGGTGATG
AACGCAGTTA TTCTGACGGC GGTGCTGTCA GCGGGTAACT CCGGTATGTA TGCGTCTACT
CGTATGCTGT ACACCCTGGC GTGTGACGGT AAAGCGCCGC GCATTTTCGC TAAACTGTCG
CGTGGTGGCG TGCCGCGTAA TGCGCTGTAT GCGACGACGG TGATTGCCGG TCTGTGCTTC
CTGACCTCCA TGTTTGGCAA CCAGACGGTA TACCTGTGGC TGCTGAACAC CTCCGGGATG
ACGGGTTTTA TCGCCTGGCT GGGGATTGCC ATTAGCCACT ATCGCTTCCG TCGCGGTTAC
GTATTGCAGG GACACGACAT TAACGATCTG CCGTACCGTT CAGGTTTCTT CCCACTGGGG
CCGATCTTCG CATTCATTCT GTGTCTGATT ATCACTTTGG GCCAGAACTA CGAAGCGTTC
CTGAAAGATA CTATTGACTG GGGCGGCGTA GCGGCAACGT ATATTGGTAT CCCGCTGTTC
CTGATTATTT GGTTCGGCTA CAAGCTGATT AAAGGAACTC ACTTCGTACG CTACAGCGAA
ATGAAGTTCC CGCAGAACGA TAAGAAATAA
 
Protein sequence
MVSETKTTEA PGLRRELKAR HLTMIAIGGS IGTGLFVASG ATISQAGPGG ALLSYMLIGL 
MVYFLMTSLG ELAAYMPVSG SFATYGQNYV EEGFGFALGW NYWYNWAVTI AVDLVAAQLV
MSWWFPDTPG WIWSALFLGV IFLLNYISVR GFGEAEYWFS LIKVTTVIVF IIVGVLMIIG
IFKGAQPAGW SNWTIGEAPF AGGFAAMIGV AMIVGFSFQG TELIGIAAGE SEDPAKNIPR
AVRQVFWRIL LFYVFAILII SLIIPYTDPS LLRNDVKDIS VSPFTLVFQH AGLLSAAAVM
NAVILTAVLS AGNSGMYAST RMLYTLACDG KAPRIFAKLS RGGVPRNALY ATTVIAGLCF
LTSMFGNQTV YLWLLNTSGM TGFIAWLGIA ISHYRFRRGY VLQGHDINDL PYRSGFFPLG
PIFAFILCLI ITLGQNYEAF LKDTIDWGGV AATYIGIPLF LIIWFGYKLI KGTHFVRYSE
MKFPQNDKK