Gene EcDH1_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3345 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3595426 
End bp3596853 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content54% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX40965 
Protein GI260450543 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCATAC GTTTAGAAGG TTATATGCAA ACAACACAAC AAAATGCGCC ACTGAAGCGC 
ACAATGAAAA CGCGTCACCT GATTATGCTT TCCTTGGGCG GCGTGATTGG CACAGGATTA
TTCTTCAATA CCGGGTACAT CATTTCCACC ACTGGAGCGG CGGGAACGCT GCTGGCCTAT
CTGATTGGTG CGCTGGTGGT CTGGCTGGTT ATGCAGTGTC TGGGCGAGCT GTCGGTCGCG
ATGCCGGAGA CCGGAGCGTT TCACGTTTAT GCCGCGCGCT ATCTTGGTCC GGCTACCGGG
TATACCGTGG CCTGGCTTTA CTGGCTGACC TGGACCGTGG CGCTGGGTTC GAGCTTTACC
GCCGCTGGAT TCTGTATGCA GTACTGGTTT CCACAGGTGC CGGTATGGGT CTGGTGCGTG
GTGTTCTGCG CGATTATTTT TGGTCTGAAT GTTATCTCCA CGCGCTTTTT TGCCGAAGGG
GAGTTCTGGT TCTCGCTGGT CAAAGTGGTC ACTATCATCG CCTTTATCAT CCTCGGTGGG
GCGGCGATTT TCGGCTTTAT TCCGATGCAG GATGGCTCGC CCGCGCCGGG GCTGAGTAAT
ATCACGGCAG AAGGCTGGTT CCCGCACGGT GGCTTACCGA TTTTGATGAC TATGGTGGCA
GTGAACTTTG CTTTTTCGGG TACCGAGCTT ATCGGCATTG CCGCCGGTGA AACGGAAAAC
CCGCGCAAAG TTATCCCGGT AGCGATTCGT ACTACCATCG CGCGACTGAT TATTTTCTTT
ATCGGCACCG TGTTTGTGCT GGCAGCGCTG ATCCCGATGC AGCAGGTGGG CGTGGAGAAA
AGCCCGTTTG TGCTGGTATT TGAGAAAGTA GGGATCCCGT ACGCCGCTGA TATTTTTAAC
TTCGTGATCC TGACGGCTAT TCTTTCTGCA GCGAACTCCG GGTTATATGC CTCCGGGCGC
ATGCTGTGGT CGTTGTCGAA TGAACGTACG CTACCGGCCT GTTTTGCGCG AGTAACGAAA
AACGGCGTGC CACTGACGGC GCTGTCGGTC AGTATGCTCG GTGGTGTGCT GGCGCTGTTT
TCCAGCGTGG TGGCCCCGGA CACGGTATTT GTTGCGCTGT CGGCAATCTC CGGGTTTGCG
GTGGTAGCGG TGTGGCTGAG TATCTGCGCC TCGCATTTTG TTTTTCGTCG CCGTCATCTG
CAACAAGGTA AGGCATTGAG TGAATTACAT TATCGCGCGC CGTGGTATCC GCTGGTGCCA
GTATTAGGTT TTGTGCTGTG CCTGGTGGCC TGTGTTGGGC TGGCATTCGA TCCAGCGCAG
AGAATTGCGT TGTGGTGCGG GTTACCGTTT GTTGCGTTGT GCTATGGTGC TTATTTCCTT
ACTCAACCCC GAAACGCAAA ACAGGAGCCA GAACATGTCG CAGAATAA
 
Protein sequence
MSIRLEGYMQ TTQQNAPLKR TMKTRHLIML SLGGVIGTGL FFNTGYIIST TGAAGTLLAY 
LIGALVVWLV MQCLGELSVA MPETGAFHVY AARYLGPATG YTVAWLYWLT WTVALGSSFT
AAGFCMQYWF PQVPVWVWCV VFCAIIFGLN VISTRFFAEG EFWFSLVKVV TIIAFIILGG
AAIFGFIPMQ DGSPAPGLSN ITAEGWFPHG GLPILMTMVA VNFAFSGTEL IGIAAGETEN
PRKVIPVAIR TTIARLIIFF IGTVFVLAAL IPMQQVGVEK SPFVLVFEKV GIPYAADIFN
FVILTAILSA ANSGLYASGR MLWSLSNERT LPACFARVTK NGVPLTALSV SMLGGVLALF
SSVVAPDTVF VALSAISGFA VVAVWLSICA SHFVFRRRHL QQGKALSELH YRAPWYPLVP
VLGFVLCLVA CVGLAFDPAQ RIALWCGLPF VALCYGAYFL TQPRNAKQEP EHVAE