Gene EcDH1_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3559 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3829446 
End bp3830960 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content52% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionACX41173 
Protein GI260450751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATG AAAAGAGAAA AACGGGAATA GAACCGAAGG TTTTCTTTCC GCCGTTAATA 
ATCGTCGGCA TACTTTGTTG GCTTACAGTC AGAGATCTGG ATGCAGCGAA TGTCGTTATT
AATGCTGTAT TCAGTTACGT CACCAATGTA TGGGGATGGG CATTTGAATG GTATATGGTG
GTGATGCTTT TCGGTTGGTT CTGGCTGGTG TTTGGCCCGT ATGCCAAAAA GCGTTTAGGT
AACGAACCGC CAGAATTTAG CACCGCCAGT TGGATCTTTA TGATGTTCGC CTCCTGTACG
TCTGCTGCCG TACTGTTCTG GGGATCGATT GAGATCTACT ACTACATCTC CACCCCGCCG
TTTGGCTTAG AACCGAACTC GACAGGGGCG AAAGAGTTGG GGCTGGCTTA CAGCTTGTTC
CACTGGGGAC CTCTGCCGTG GGCCACTTAC AGCTTCCTTT CAGTCGCCTT CGCTTACTTC
TTCTTTGTCC GCAAAATGGA AGTGATTCGC CCCAGCTCGA CACTGGTGCC GCTGGTAGGT
GAAAAACACG CCAAAGGGTT GTTCGGCACT ATCGTCGACA ACTTCTATCT CGTCGCCTTG
ATCTTCGCGA TGGGTACCAG TCTGGGCCTT GCCACGCCGC TGGTGACCGA GTGTATGCAA
TGGTTGTTTG GCATTCCGCA TACCCTGCAA CTGGACGCTA TCATCATTAC CTGCTGGATT
ATCCTCAACG CCATTTGCGT CGCTTGCGGT CTGCAAAAAG GGGTACGTAT CGCCAGTGAC
GTGCGTAGTT ACCTGAGCTT CCTGATGCTG GGTTGGGTGT TCATTGTCAG CGGTGCCAGC
TTCATCATGA ACTACTTCAC CGATTCGGTG GGGATGTTGC TGATGTATCT GCCGCGCATG
TTGTTCTATA CCGATCCCAT CGCTAAAGGC GGCTTCCCGC AGGGCTGGAC CGTGTTCTAC
TGGGCATGGT GGGTGATTTA TGCTATCCAG ATGAGTATCT TCCTCGCCCG CATCTCCCGT
GGTCGTACTG TGCGTGAACT GTGCTTCGGC ATGGTGCTGG GGCTGACAGC GTCAACCTGG
ATCCTGTGGA CTGTACTCGG TAGTAACACT CTGCTGTTGA TAGATAAAAA CATCATCAAC
ATTCCAAATC TGATCGAACA GTACGGTGTG GCGCGCGCCA TCATTGAAAC CTGGGCCGCT
CTGCCACTCA GCACCGCCAC CATGTGGGGC TTCTTCATCC TCTGCTTTAT TGCCACCGTT
ACGCTGGTTA ACGCCTGCTC TTATACCCTG GCGATGTCCA CTTGCCGCGA AGTACGCGAT
GGTGAAGAAC CACCTCTGCT GGTGCGTATC GGTTGGTCAA TTCTGGTTGG CATTATCGGT
ATTGTTCTGC TGGCGCTCGG CGGCCTGAAA CCGATTCAAA CCGCCATTAT CGCCGGAGGA
TGCCCGCTGT TCTTCGTCAA CATTATGGTG ACGCTCTCCT TTATTAAAGA CGCGAAACAG
AACTGGAAAG ATTAA
 
Protein sequence
MKNEKRKTGI EPKVFFPPLI IVGILCWLTV RDLDAANVVI NAVFSYVTNV WGWAFEWYMV 
VMLFGWFWLV FGPYAKKRLG NEPPEFSTAS WIFMMFASCT SAAVLFWGSI EIYYYISTPP
FGLEPNSTGA KELGLAYSLF HWGPLPWATY SFLSVAFAYF FFVRKMEVIR PSSTLVPLVG
EKHAKGLFGT IVDNFYLVAL IFAMGTSLGL ATPLVTECMQ WLFGIPHTLQ LDAIIITCWI
ILNAICVACG LQKGVRIASD VRSYLSFLML GWVFIVSGAS FIMNYFTDSV GMLLMYLPRM
LFYTDPIAKG GFPQGWTVFY WAWWVIYAIQ MSIFLARISR GRTVRELCFG MVLGLTASTW
ILWTVLGSNT LLLIDKNIIN IPNLIEQYGV ARAIIETWAA LPLSTATMWG FFILCFIATV
TLVNACSYTL AMSTCREVRD GEEPPLLVRI GWSILVGIIG IVLLALGGLK PIQTAIIAGG
CPLFFVNIMV TLSFIKDAKQ NWKD