Gene EcDH1_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0425 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp447268 
End bp448557 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content50% 
IMG OID 
Productsun protein 
Protein accessionACX38115 
Protein GI260447693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AACGTAATTT ACGTAGCATG GCGGCCCAGG CCGTTGAACA AGTCGTCGAG 
CAAGGGCAAT CATTAAGCAA CATTCTGCCA CCGCTCCAGC AAAAAGTTTC CGATAAAGAC
AAAGCACTTC TTCAAGAGTT GTGCTTTGGC GTACTGCGTA CGCTTTCGCA GTTAGACTGG
CTGATTAATA AGTTAATGGC CCGTCCGATG ACCGGCAAAC AGCGGACCGT GCATTACCTG
ATTATGGTTG GTTTGTATCA ACTGCTTTAT ACCCGCATTC CACCTCATGC TGCGCTGGCT
GAAACGGTTG AAGGCGCTAT CGCAATTAAG CGTCCGCAAC TTAAAGGGTT GATAAACGGT
GTATTACGCC AGTTCCAGCG TCAGCAAGAA GAGTTATTAG CCGAGTTTAA TGCCAGTGAT
GCACGTTATC TGCATCCTTC CTGGTTGCTG AAGCGTCTGC AAAAAGCGTA TCCAGAGCAG
TGGCAATCCA TCGTCGAAGC CAATAACCAG CGTCCGCCAA TGTGGCTGCG TATTAATCGT
ACGCATCATT CCCGCGACAG CTGGCTTGCA TTGCTGGATG AAGCAGGAAT GAAAGGTTTC
CCGCATGCGG ATTACCCTGA TGCTGTACGT CTGGAAACAC CTGCACCTGT TCATGCGCTA
CCTGGTTTTG AAGACGGATG GGTTACCGTT CAGGATGCAT CAGCACAAGG TTGCATGACC
TGGCTTGCGC CACAAAACGG TGAACACATT TTGGATCTTT GTGCCGCCCC CGGCGGTAAA
ACAACGCATA TCCTTGAGGT GGCACCAGAA GCGCAGGTTG TTGCGGTTGA TATCGACGAA
CAGCGCCTCT CTCGGGTTTA CGACAATTTA AAACGCCTTG GTATGAAGGC GACCGTGAAA
CAAGGTGATG GCCGTTACCC TTCTCAATGG TGTGGCGAGC AACAGTTTGA TCGCATTTTA
TTAGATGCGC CTTGTTCAGC AACCGGTGTG ATTCGTCGCC ATCCAGATAT TAAATGGTTA
CGTCGCGATC GCGATATCCC GGAACTCGCG CAATTGCAGT CTGAAATTCT CGACGCCATT
TGGCCGCATT TAAAAACCGG TGGAACTCTG GTCTATGCCA CCTGTTCGGT GTTACCGGAA
GAGAATAGCC TGCAGATTAA AGCCTTTTTG CAACGTACCG CTGATGCCGA ACTTTGCGAA
ACAGGAACAC CAGAGCAACC GGGTAAACAA AATCTACCTG GTGCCGAAGA GGGCGACGGC
TTCTTTTACG CTAAGCTAAT CAAAAAGTGA
 
Protein sequence
MKKQRNLRSM AAQAVEQVVE QGQSLSNILP PLQQKVSDKD KALLQELCFG VLRTLSQLDW 
LINKLMARPM TGKQRTVHYL IMVGLYQLLY TRIPPHAALA ETVEGAIAIK RPQLKGLING
VLRQFQRQQE ELLAEFNASD ARYLHPSWLL KRLQKAYPEQ WQSIVEANNQ RPPMWLRINR
THHSRDSWLA LLDEAGMKGF PHADYPDAVR LETPAPVHAL PGFEDGWVTV QDASAQGCMT
WLAPQNGEHI LDLCAAPGGK TTHILEVAPE AQVVAVDIDE QRLSRVYDNL KRLGMKATVK
QGDGRYPSQW CGEQQFDRIL LDAPCSATGV IRRHPDIKWL RRDRDIPELA QLQSEILDAI
WPHLKTGGTL VYATCSVLPE ENSLQIKAFL QRTADAELCE TGTPEQPGKQ NLPGAEEGDG
FFYAKLIKK