Gene EcDH1_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2006 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2164207 
End bp2165709 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content52% 
IMG OID 
Productamino acid/peptide transporter 
Protein accessionACX39663 
Protein GI260449241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0749107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCACTG CAAACCAAAA ACCAACTGAA AGCGTCAGTT TGAACGCTTT CAAACAACCG 
AAGGCGTTCT ATCTCATCTT CTCGATTGAG TTATGGGAAC GTTTTGGTTA TTACGGCCTA
CAAGGAATTA TGGCTGTTTA CCTGGTTAAA CAACTGGGTA TGTCTGAAGC GGATTCAATC
ACCCTTTTCT CTTCCTTTAG TGCCCTGGTT TATGGTCTGG TCGCTATCGG CGGCTGGTTA
GGTGACAAGG TACTGGGTAC TAAACGCGTA ATTATGCTCG GCGCTATTGT GCTGGCGATT
GGTTATGCTC TGGTTGCCTG GTCTGGTCAC GACGCCGGTA TCGTTTATAT GGGTATGGCG
GCTATTGCGG TCGGTAACGG CCTGTTTAAA GCTAACCCGT CTTCTCTGCT TTCTACATGC
TATGAGAAAA ACGACCCGCG TCTGGACGGT GCATTCACCA TGTACTACAT GTCCGTCAAC
ATCGGCTCTT TCTTCTCTAT GATTGCTACG CCGTGGCTGG CCGCGAAATA CGGCTGGAGT
GTTGCGTTTG CGTTGAGCGT TGTAGGCCTG CTGATCACTA TCGTTAACTT CGCCTTCTGC
CAACGCTGGG TTAAACAGTA CGGTTCAAAA CCAGACTTCG AGCCTATCAA CTACCGTAAC
CTGCTGCTGA CCATTATTGG TGTTGTGGCA CTGATCGCTA TCGCCACCTG GCTGCTGCAC
AATCAGGAAG TTGCGCGTAT GGCGCTGGGC GTTGTTGCCT TCGGTATCGT GGTTATCTTC
GGTAAAGAAG CCTTCGCGAT GAAAGGTGCT GCGCGTCGTA AAATGATCGT TGCCTTCATC
CTGATGCTCG AAGCCATTAT CTTCTTCGTG CTGTACAGCC AGATGCCAAC GTCACTGAAC
TTCTTTGCGA TTCGTAACGT TGAGCACTCC ATTCTGGGTC TGGCCGTAGA ACCTGAGCAG
TATCAGGCAC TGAACCCGTT CTGGATCATC ATCGGTAGTC CGATTCTGGC CGCTATCTAT
AACAAGATGG GCGATACCCT GCCGATGCCA ACCAAGTTTG CAATCGGCAT GGTGATGTGT
TCTGGTGCGT TCCTGATTCT GCCGCTGGGT GCGAAATTCG CGTCTGACGC TGGTATCGTG
TCTGTAAGCT GGCTGGTCGC AAGCTATGGC CTGCAGAGCA TCGGGGAACT GATGATCTCT
GGTCTGGGTC TGGCAATGGT TGCTCAACTC GTTCCGCAGC GTCTGATGGG CTTCATTATG
GGTAGCTGGT TCCTGACCAC TGCCGGTGCA AACCTGATTG GTGGTTATGT TGCGGGTATG
ATGGCTGTGC CGGATAACGT TACCGATCCG CTGATGTCAC TGGAAGTCTA TGGTCGCGTA
TTCTTGCAGA TTGGTGTCGC TACTGCCGTT ATTGCAGTAC TGATGCTGCT GACCGCGCCG
AAACTGCACC GCATGACGCA GGATGACGCT GCAGACAAAG CGGCGAAAGC AGCCGTAGCG
TAA
 
Protein sequence
MSTANQKPTE SVSLNAFKQP KAFYLIFSIE LWERFGYYGL QGIMAVYLVK QLGMSEADSI 
TLFSSFSALV YGLVAIGGWL GDKVLGTKRV IMLGAIVLAI GYALVAWSGH DAGIVYMGMA
AIAVGNGLFK ANPSSLLSTC YEKNDPRLDG AFTMYYMSVN IGSFFSMIAT PWLAAKYGWS
VAFALSVVGL LITIVNFAFC QRWVKQYGSK PDFEPINYRN LLLTIIGVVA LIAIATWLLH
NQEVARMALG VVAFGIVVIF GKEAFAMKGA ARRKMIVAFI LMLEAIIFFV LYSQMPTSLN
FFAIRNVEHS ILGLAVEPEQ YQALNPFWII IGSPILAAIY NKMGDTLPMP TKFAIGMVMC
SGAFLILPLG AKFASDAGIV SVSWLVASYG LQSIGELMIS GLGLAMVAQL VPQRLMGFIM
GSWFLTTAGA NLIGGYVAGM MAVPDNVTDP LMSLEVYGRV FLQIGVATAV IAVLMLLTAP
KLHRMTQDDA ADKAAKAAVA