Gene EcDH1_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1995 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2154345 
End bp2156357 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID 
ProductFusaric acid resistance protein conserved region 
Protein accessionACX39652 
Protein GI260449230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000064393 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAT CGTCATGGTC CTTGCGCAAT TTGCCCTGGT TCAGGGCCAC GCTGGCGCAA 
TGGCGTTATG CGTTACGCAA TACCATTGCC ATGTGTCTGG CGCTGACGGT TGCCTATTAT
TTAAATCTGG ATGAACCCTA TTGGGCGATG ACCTCGGCTG CAGTGGTTAG CTTTCCCACC
GTTGGCGGTG TTATCAGCAA AAGCCTCGGA CGCATCGCTG GCAGTTTGCT CGGAGCCATT
GCGGCACTGC TTCTTGCCGG GCATACGCTC AATGAGCCGT GGTTTTTTCT ATTGAGCATG
TCGGCGTGGC TTGGCTTTTG TACCTGGGCC TGTGCGCACT TCACGAATAA CGTCGCGTAT
GCATTTCAAC TGGCGGGCTA CACGGCTGCC ATCATCGCCT TTCCGATGGT TAATATTACT
GAGGCCAGCC AGCTGTGGGA TATCGCTCAG GCGCGCGTTT GCGAGGTAAT AGTCGGTATT
TTGTGCGGCG GCATGATGAT GATGATCCTG CCGAGCAGTT CCGATGCTAC TGCCCTTTTA
ACCGCATTGA AAAACATGCA CGCCCGATTA CTGGAACATG CCAGTTTACT CTGGCAGCCT
GAAACAACCG ATGCCATTCG TGCAGCACAT GAAGGGGTGA TTGGGCAGAT ACTGACCATG
AATTTGCTGC GTATCCAGGC TTTCTGGAGC CACTATCGTT TTCGCCAGCA AAACGCGCGC
CTTAATGCGC TGCTCCACCA GCAATTACGT ATGACCAGTG TCATCTCCAG CCTGCGACGT
ATGTTGCTCA ACTGGCCCTC ACCGCCAGGT GCCACACGAG AAATTCTCGA ACAGTTGCTG
ACGGCGCTCG CCAGTTCGCA AACAGATGTT TACACCGTCG CACGTATTAT CGCCCCGCTA
CGCCCGACCA ACGTCGCCGA CTATCGGCAC GTCGCCTTCT GGCAGCGACT ACGTTATTTT
TGCCGCCTTT ATCTGCAAAG TAGTCAGGAA TTACATCGTC TGCAAAGCGG TGTAGATGAT
CATACCAGAC TCCCACGGAC ATCCGGCCTG GCTCGTCATA CCGATAACGC CGAAGCTATG
TGGAGCGGGC TGCGTACATT TTGTACGTTG ATGATGATTG GCGCATGGAG TATTGCTTCG
CAATGGGATG CCGGTGCCAA TGCATTAACG CTGGCAGCAA TTAGCTGCGT ACTCTACTCC
GCCGTCGCAG CACCGTTTAA GTCGTTGTCA CTTCTGATGC GCACGCTGGT GTTACTTTCG
CTATTCAGCT TTGTGGTCAA ATTTGGTCTG ATGGTCCAGA TTAGCGATCT GTGGCAATTT
TTACTGTTTC TCTTTCCACT GCTGGCGACA ATGCAGCTTC TTAAATTGCA GATGCCAAAA
TTTGCCGCAT TGTGGGGGCA ACTGATTGTT TTTATGGGTT CTTTTATCGC TGTCACTAAT
CCCCCGGTGT ATGATTTTGC TGATTTTCTT AACGATAATC TGGCAAAAAT CGTTGGCGTC
GCGTTGGCGT GGTTAGCGTT CGCCATTCTG CGTCCAGGAT CGGATGCTCG TAAAAGCCGC
CGCCATATTC GCGCGCTGCG CCGGGATTTT GTCGATCAGC TAAGCCGCCA TCCAACACTG
AGTGAAAGCG AATTTGAATC GCTCACTTAT CATCACGTCA GTCAGTTGAG TAACAGCCAG
GATGCGCTGG CTCGCCGTTG GTTATTACGC TGGGGTGTAG TGCTGCTGAA CTGTTCTCAT
GTTGTCTGGC AATTGCGCGA CTGGGAATCG CGTTCCGATC CGTTATCGCG AGTACGGGAT
AACTGTATTT CACTGTTGCG GGGAGTGATG AGTGAGCGTG GCGTTCAGCA AAAATCACTG
GCGGCCACAC TTGAAGAATT ACAGCGGATT TGCGACAGCC TTGCCCGTCA TCATCAACCT
GCCGCCCGTG AGCTGGCGGC AATTGTCTGG CGGCTGTACT GCTCGCTTTC GCAACTTGAG
CAAGCACCAC CGCAAGGTAC GCTGGCCTCT TAA
 
Protein sequence
MNASSWSLRN LPWFRATLAQ WRYALRNTIA MCLALTVAYY LNLDEPYWAM TSAAVVSFPT 
VGGVISKSLG RIAGSLLGAI AALLLAGHTL NEPWFFLLSM SAWLGFCTWA CAHFTNNVAY
AFQLAGYTAA IIAFPMVNIT EASQLWDIAQ ARVCEVIVGI LCGGMMMMIL PSSSDATALL
TALKNMHARL LEHASLLWQP ETTDAIRAAH EGVIGQILTM NLLRIQAFWS HYRFRQQNAR
LNALLHQQLR MTSVISSLRR MLLNWPSPPG ATREILEQLL TALASSQTDV YTVARIIAPL
RPTNVADYRH VAFWQRLRYF CRLYLQSSQE LHRLQSGVDD HTRLPRTSGL ARHTDNAEAM
WSGLRTFCTL MMIGAWSIAS QWDAGANALT LAAISCVLYS AVAAPFKSLS LLMRTLVLLS
LFSFVVKFGL MVQISDLWQF LLFLFPLLAT MQLLKLQMPK FAALWGQLIV FMGSFIAVTN
PPVYDFADFL NDNLAKIVGV ALAWLAFAIL RPGSDARKSR RHIRALRRDF VDQLSRHPTL
SESEFESLTY HHVSQLSNSQ DALARRWLLR WGVVLLNCSH VVWQLRDWES RSDPLSRVRD
NCISLLRGVM SERGVQQKSL AATLEELQRI CDSLARHHQP AARELAAIVW RLYCSLSQLE
QAPPQGTLAS