Gene EcDH1_1256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1256 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1352987 
End bp1354243 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content48% 
IMG OID 
Productnucleoside transporter 
Protein accessionACX38930 
Protein GI260448508 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00000485515 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG CGATGCGCTT AAAGGTAATG TCCTTTTTGC AATATTTTAT CTGGGGGAGC 
TGGCTGGTTA CCCTCGGCTC TTACATGATT AATACTCTTC ATTTCACCGG CGCTAATGTT
GGCATGGTTT ACAGTTCCAA AGGGATCGCC GCGATTATTA TGCCTGGTAT AATGGGGATC
ATCGCAGACA AATGGCTGCG CGCAGAACGT GCATACATGC TGTGTCACCT GGTGTGTGCG
GGCGTACTTT TTTATGCGGC ATCCGTAACT GATCCGGATA TGATGTTTTG GGTGATGTTA
GTCAATGCGA TGGCGTTTAT GCCGACTATT GCGTTATCGA ACAGCGTCTC TTATTCCTGT
CTTGCCCAGG CAGGGCTTGA CCCGGTGACC GCTTTCCCGC CCATTCGCGT TTTTGGTACG
GTGGGGTTCA TTGTCGCGAT GTGGGCAGTA AGCCTGCTGC ATCTGGAATT GAGTAGTCTG
CAGCTGTATA TCGCGTCCGG TGCGTCATTG CTGCTGTCGG CTTATGCGCT GACTTTGCCG
AAGATTCCGG TTGCGGAGAA AAAAGCGACC ACATCGCTTG CCAGCAAGCT GGGTCTGGAT
GCCTTCGTGC TGTTTAAAAA TCCACGCATG GCCATCTTTT TCCTCTTTGC CATGATGCTG
GGTGCGGTAC TGCAAATTAC CAACGTTTTT GGTAATCCGT TCCTACATGA TTTCGCCCGT
AACCCGGAGT TTGCTGACAG TTTTGTGGTG AAATATCCCT CCATTTTACT GTCAGTTTCA
CAGATGGCAG AAGTGGGCTT TATACTGACT ATCCCATTCT TTTTAAAGCG ATTTGGCATT
AAAACCGTCA TGCTGATGAG TATGGTGGCC TGGACGCTGC GCTTTGGCTT CTTCGCCTAT
GGCGATCCGT CAACAACCGG ATTTATTTTG CTGCTGCTGT CGATGATTGT TTATGGCTGT
GCATTCGATT TCTTCAATAT TTCTGGTTCG GTATTTGTCG AACAGGAAGT TGATTCCAGC
ATTCGTGCCA GCGCGCAGGG GCTCTTTATG ACCATGGTAA ATGGTGTCGG CGCATGGGTT
GGCTCGATTC TGAGTGGCAT GGCAGTAGAT TACTTTTCGG TGGATGGCGT AAAAGACTGG
CAAACTATCT GGCTGGTGTT TGCAGGATAT GCTCTTTTTC TCGCAGTGAT ATTTTTCTTT
GGGTTTAAAT ATAATCATGA CCCTGAAAAG ATAAAGCATC GAGCGGTGAC TCATTAA
 
Protein sequence
MSIAMRLKVM SFLQYFIWGS WLVTLGSYMI NTLHFTGANV GMVYSSKGIA AIIMPGIMGI 
IADKWLRAER AYMLCHLVCA GVLFYAASVT DPDMMFWVML VNAMAFMPTI ALSNSVSYSC
LAQAGLDPVT AFPPIRVFGT VGFIVAMWAV SLLHLELSSL QLYIASGASL LLSAYALTLP
KIPVAEKKAT TSLASKLGLD AFVLFKNPRM AIFFLFAMML GAVLQITNVF GNPFLHDFAR
NPEFADSFVV KYPSILLSVS QMAEVGFILT IPFFLKRFGI KTVMLMSMVA WTLRFGFFAY
GDPSTTGFIL LLLSMIVYGC AFDFFNISGS VFVEQEVDSS IRASAQGLFM TMVNGVGAWV
GSILSGMAVD YFSVDGVKDW QTIWLVFAGY ALFLAVIFFF GFKYNHDPEK IKHRAVTH