Gene EcDH1_3532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3532 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3796856 
End bp3798466 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content58% 
IMG OID 
Productthiamine ABC transporter, inner membrane subunit 
Protein accessionACX41146 
Protein GI260450724 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCACC 
ACGCTGGTGG TAGCGGTTGC GCTGGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG
GATGACTGGG TGGCAGTCTG GCAGGACAGC TATCTGTGGC ATGTGGTGCG CTTCTCCTTC
TGGCAGGCGT TTCTCTCGGC ACTGCTCTCT GTCATACCCG CGATATTCCT CGCCCGCGCG
CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGTGC AATGACCTTG
ATCCTCCCGG TGTTGGTCGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG CCAGGGCTGG
CTGGCAACAC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA
GGTATTTTGC TGGCCCATGT GTTTTTTAAT CTGCCGATGG CGAGCCGCTT ATTACTCCAG
GCACTGGAAA ACATCCCCGG CGAACAGCGT CAACTTGCCG CCCAGCTTGG GATGCGTAGC
TGGCATTTTT TCCGCTTCGT CGAATGGCCG TGGTTACGGC GACAAATCCC GCCGGTTGCT
GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTATCGCT GGGGGGCGGT
CCGCAGGCGA CCACTATCGA GCTGGCAATC TATCAGGCGC TGAGTTACGA CTACGATCCT
GCCCGCGCGG CAATGCTGGC GCTGCTCCAG ATGGTGTGCT GCCTCGGGCT GGTGCTGTTG
AGTCAGCGAT TGAGTAAGGC CATTGCGCCC GGCACCACGC TGCTGCAAGG CTGGCGCGAC
CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGGTGT TAATTGTGCT GGCGCTGCTG
CTGTTGCTGC CACCGTTACT GGCGGTGATC GTCGATGGGG TAAATCGCCA GTTGCCGGAA
GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG
GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTGCGG
GCGCGGCAGA AAATGCTGGC GGGTCAGGTG CTGGAGATGA GCGGCATGTT GATCCTCGCC
ATGCCGGGGA TTGTGCTGGC TACCGGCTTC TTTTTACTGC TCAACAACAC TATCGGCCTG
CCACAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG
CTGAAAGTGC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGCAT GTTATGTCAG
TCGCTGGGGA TTGAAGGCTG GTCACGCTTA AAAGTGGTGG AGCTGCGCGC CCTGAAACGT
CCACTGGCGC AGGCGCTGGC CTTTGCATGC GTGCTGTCGA TTGGTGATTT TGGCGTGGTG
GCGTTGTTCG GTAACGATGA TTTCCGCACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC
TCCTATCGCA GCCAGGACGG TGCGGTCACC GCGTTAATTC TGCTGCTGCT CTGTTTTCTG
CTGTTTACCG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
 
Protein sequence
MATRRQPLIP GWLIPGVSAT TLVVAVALAA FLALWWNAPQ DDWVAVWQDS YLWHVVRFSF 
WQAFLSALLS VIPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LATLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRS
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP
ARAAMLALLQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL
LLLPPLLAVI VDGVNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR
ARQKMLAGQV LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA
LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFTVIEKLPG RNVKTD