Gene EcDH1_3292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3292 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3539459 
End bp3541492 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content55% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionACX40916 
Protein GI260450494 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACC TTTCACACAG CAGGGAAAAG GACAAAATCA ATCCGGTGGT GTTTTACACC 
TCCGCCGGAC TGATTTTGTT GTTTTCCCTG ACAACGATCC TGTTTCGCGA CTTCTCGGCC
CTGTGGATTG GCCGCACGCT GGACTGGGTT TCTAAAACCT TCGGTTGGTA CTATCTGCTG
GCGGCAACGC TCTATATTGT CTTTGTGGTC TGTATCGCTT GTTCGCGTTT TGGTTCGGTG
AAGCTCGGGC CAGAACAATC CAAACCGGAA TTCAGCCTGC TGAGTTGGGC GGCGATGCTG
TTTGCTGCCG GGATCGGTAT CGACCTGATG TTCTTCTCCG TAGCCGAACC GGTAACGCAG
TATATGCAGC CGCCGGAAGG CGCGGGACAG ACGATTGAGG CCGCGCGTCA GGCGATGGTC
TGGACGCTGT TTCACTACGG CTTAACCGGC TGGTCGATGT ATGCGCTGAT GGGCATGGCG
CTCGGATACT TTAGCTATCG TTATAATTTG CCGCTCACCA TCCGCTCGGC GCTGTACCCG
ATCTTCGGTA AACGGATTAA CGGGCCGATA GGTCACTCAG TGGATATTGC AGCGGTGATC
GGCACTATCT TCGGTATTGC CACTACGCTC GGTATCGGTG TGGTGCAGCT TAACTATGGC
TTGAGCGTAC TGTTTGATAT TCCCGATTCG ATGGCGGCGA AAGCGGCACT GATCGCCTTG
TCGGTGATAA TCGCCACGAT CTCTGTCACC TCCGGTGTCG ATAAGGGCAT TCGCGTGTTA
TCGGAGCTTA ATGTCGCGCT GGCGCTGGGA TTGATCCTGT TCGTATTGTT TATGGGCGAC
ACTTCGTTCC TGCTTAATGC ACTGGTGCTG AATGTTGGCG ACTATGTGAA TCGCTTTATG
GGCATGACGC TCAACAGTTT TGCCTTCGAC CGTCCGGTTG AGTGGATGAA TAACTGGACG
CTCTTCTTCT GGGCATGGTG GGTGGCATGG TCGCCGTTTG TCGGCTTGTT CCTGGCGCGT
ATCTCGCGTG GGCGTACCAT TCGCCAGTTC GTGCTGGGCA CGTTGATTAT TCCGTTTACC
TTCACGCTGT TATGGCTCTC GGTGTTCGGC AATAGCGCGC TGTATGAAAT CATCCACGGC
GGCGCGGCAT TTGCCGAGGA AGCGATGGTC CATCCGGAGC GCGGCTTCTA CAGCCTGCTG
GCGCAGTATC CGGCGTTTAC CTTTAGCGCC TCCGTCGCCA CCATTACTGG CCTGCTGTTT
TATGTGACCT CGGCGGACTC CGGGGCGCTG GTGCTGGGGA ATTTCACCTC GCAGCTTAAA
GATATCAACA GCGACGCCCC CGGCTGGCTG CGCGTCTTCT GGTCGGTGGC GATTGGCCTG
CTGACGCTCG GCATGCTGAT GACTAACGGG ATATCCGCGC TGCAAAACAC CACGGTGATT
ATGGGGCTGC CGTTCAGCTT TGTGATCTTC TTCGTGATGG CGGGGTTGTA TAAATCTCTG
AAGGTAGAAG ATTACCGCCG TGAAAGTGCC AACCGCGATA CCGCACCGCG ACCGCTGGGG
CTTCAGGATC GCCTGAGCTG GAAAAAACGT CTCTCGCGCC TGATGAATTA TCCGGGCACG
CGTTACACTA AACAGATGAT GGAGACGGTC TGTTACCCGG CAATGGAAGA AGTGGCGCAG
GAGTTGCGGT TGCGCGGCGC GTACGTGGAG CTAAAAAGCC TGCCACCGGA AGAGGGACAG
CAGTTGGGTC ATCTGGATTT GTTGGTGCAT ATGGGCGAAG AGCAAAACTT TGTCTATCAG
ATTTGGCCGC AGCAATATTC GGTGCCGGGC TTTACCTACC GCGCACGCAG CGGTAAATCG
ACCTACTACC GGCTGGAAAC CTTCCTGTTA GAAGGCAGCC AGGGCAACGA CCTGATGGAC
TACAGCAAAG AGCAGGTGAT CACCGATATT CTTGACCAGT ACGAGCGGCA CCTTAACTTT
ATTCATCTCC ATCGTGAAGC GCCGGGCCAT AGCGTGATGT TCCCGGACGC GTGA
 
Protein sequence
MTDLSHSREK DKINPVVFYT SAGLILLFSL TTILFRDFSA LWIGRTLDWV SKTFGWYYLL 
AATLYIVFVV CIACSRFGSV KLGPEQSKPE FSLLSWAAML FAAGIGIDLM FFSVAEPVTQ
YMQPPEGAGQ TIEAARQAMV WTLFHYGLTG WSMYALMGMA LGYFSYRYNL PLTIRSALYP
IFGKRINGPI GHSVDIAAVI GTIFGIATTL GIGVVQLNYG LSVLFDIPDS MAAKAALIAL
SVIIATISVT SGVDKGIRVL SELNVALALG LILFVLFMGD TSFLLNALVL NVGDYVNRFM
GMTLNSFAFD RPVEWMNNWT LFFWAWWVAW SPFVGLFLAR ISRGRTIRQF VLGTLIIPFT
FTLLWLSVFG NSALYEIIHG GAAFAEEAMV HPERGFYSLL AQYPAFTFSA SVATITGLLF
YVTSADSGAL VLGNFTSQLK DINSDAPGWL RVFWSVAIGL LTLGMLMTNG ISALQNTTVI
MGLPFSFVIF FVMAGLYKSL KVEDYRRESA NRDTAPRPLG LQDRLSWKKR LSRLMNYPGT
RYTKQMMETV CYPAMEEVAQ ELRLRGAYVE LKSLPPEEGQ QLGHLDLLVH MGEEQNFVYQ
IWPQQYSVPG FTYRARSGKS TYYRLETFLL EGSQGNDLMD YSKEQVITDI LDQYERHLNF
IHLHREAPGH SVMFPDA