Gene EcDH1_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3814 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4105486 
End bp4107927 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content55% 
IMG OID 
ProductVacB and RNase II family 3'-5' exoribonuclease 
Protein accessionACX41416 
Protein GI260450994 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAAG ATCCTTTCCA GGAACGCGAA GCTGAAAAAT ACGCGAATCC CATCCCTAGT 
CGGGAATTTA TCCTCGAACA TTTAACCAAA CGTGAAAAAC CGGCCAGCCG TGATGAGCTG
GCGGTAGAAC TGCACATTGA AGGCGAAGAG CAGCTTGAAG GCCTGCGTCG CCGCCTGCGC
GCGATGGAGC GCGATGGTCA ACTGGTCTTC ACTCGTCGTC AGTGCTATGC GCTGCCGGAA
CGCCTCGACC TGGTGAAAGG TACCGTTATT GGCCACCGTG ATGGCTACGG CTTTCTGCGG
GTTGAAGGGC GTAAAGATGA TTTGTATCTC TCCAGCGAGC AGATGAAAAC CTGCATTCAT
GGCGATCAGG TGCTGGCTCA GCCGCTGGGT GTTGACCGTA AAGGTCGTCG TGAAGCGCGT
ATTGTCCGCG TACTGGTGCC AAAAACCAGC CAGATTGTTG GTCGCTACTT TACCGAAGCG
GGCGTCGGCT TTGTGGTTCC TGACGACAGC CGTCTGAGCT TCGATATCTT AATCCCGCCC
GATCAGATCA TGGGCGCGCG GATGGGCTTT GTGGTCGTAG TCGAACTGAC TCAGCGTCCG
ACTCGCCGCA CCAAAGCGGT GGGTAAAATC GTCGAAGTGC TGGGCGACAA TATGGGCACC
GGCATGGCGG TTGATATCGC TCTGCGTACC CATGAAATTC CGTACATCTG GCCGCAGGCT
GTTGAGCAAC AGGTTGCCGG GCTGAAAGAA GAAGTGCCGG AAGAAGCAAA AGCGGGCCGT
GTTGATCTGC GCGATTTACC GCTGGTCACC ATTGATGGCG AAGACGCCCG TGACTTTGAC
GATGCAGTTT ACTGCGAGAA AAAACGCGGC GGCGGCTGGC GTTTATGGGT CGCGATTGCC
GACGTCAGCT ACTATGTGCG TCCGTCAACG CCGCTGGACA GAGAAGCGCG TAACCGTGGC
ACGTCGGTGT ACTTCCCTTC GCAGGTTATC CCGATGCTGC CGGAAGTGCT CTCTAACGGC
CTGTGTTCGC TCAACCCGCA GGTAGACCGC CTGTGTATGG TGTGCGAGAT GACGGTTTCG
TCGAAAGGCC GCCTGACGGG CTACAAATTC TACGAAGCGG TGATGAGCTC TCACGCGCGT
CTGACCTACA CCAAAGTCTG GCATATTCTG CAGGGCGATC AGGATCTGCG CGAGCAGTAC
GCCCCGCTGG TTAAGCATCT CGAAGAGTTG CATAACCTCT ATAAAGTGCT GGATAAAGCC
CGTGAAGAAC GCGGTGGGAT CTCATTTGAG AGCGAAGAAG CGAAGTTCAT TTTCAACGCT
GAACGCCGTA TTGAACGTAT CGAACAGACC CAGCGTAACG ACGCGCACAA ATTAATTGAA
GAGTGCATGA TTCTGGCGAA TATCTCGGCG GCGCGTTTCG TTGAGAAAGC GAAAGAACCG
GCACTGTTCC GTATTCACGA CAAGCCGAGC ACCGAAGCGA TTACCTCTTT CCGTTCAGTG
CTGGCGGAGC TGGGGCTGGA ACTGCCGGGC GGTAACAAGC CGGAACCGCG TGACTACGCG
GAGCTGCTGG AGTCGGTTGC CGATCGTCCT GATGCAGAAA TGCTGCAAAC CATGCTGCTG
CGCTCGATGA AACAGGCGAT TTACGATCCA GAAAACCGTG GTCACTTTGG CCTGGCATTG
CAGTCCTATG CGCACTTTAC TTCGCCGATT CGTCGTTATC CAGACCTGAC GCTGCACCGC
GCCATTAAAT ATCTGCTGGC GAAAGAGCAG GGGCATCAGG GCAACACCAC TGAAACCGGC
GGCTACCATT ATTCGATGGA AGAGATGCTG CAACTGGGTC AGCACTGTTC GATGGCGGAA
CGTCGTGCCG ACGAAGCAAC GCGCGATGTG GCTGACTGGC TGAAGTGTGA CTTCATGCTC
GACCAGGTAG GTAACGTCTT TAAAGGCGTA ATTTCCAGCG TCACTGGCTT TGGCTTCTTC
GTCCGTCTGG ACGACTTGTT CATTGATGGT CTGGTCCATG TCTCTTCGCT GGACAATGAC
TACTATCGCT TTGACCAGGT AGGGCAACGC CTGATGGGGG AATCCAGCGG CCAGACTTAT
CGCCTGGGCG ATCGCGTGGA AGTTCGCGTC GAAGCGGTTA ATATGGACGA GCGCAAAATC
GACTTTAGCC TGATCTCCAG CGAACGCGCA CCGCGCAACG TCGGTAAAAC GGCGCGCGAG
AAAGCGAAAA AAGGCGATGC AGGTAAAAAA GGCGGCAAGC GTCGTCAGGT CGGTAAAAAG
GTAAACTTTG AGCCAGACAG CGCCTTCCGC GGTGAGAAAA AAACGAAGCC GAAAGCGGCG
AAGAAAGACG CGAGAAAAGC GAAAAAGCCA TCGGCGAAAA CGCAGAAAAT AGCTGCAGCG
ACCAAAGCGA AGCGTGCGGC GAAGAAAAAA GTGGCAGAGT GA
 
Protein sequence
MSQDPFQERE AEKYANPIPS REFILEHLTK REKPASRDEL AVELHIEGEE QLEGLRRRLR 
AMERDGQLVF TRRQCYALPE RLDLVKGTVI GHRDGYGFLR VEGRKDDLYL SSEQMKTCIH
GDQVLAQPLG VDRKGRREAR IVRVLVPKTS QIVGRYFTEA GVGFVVPDDS RLSFDILIPP
DQIMGARMGF VVVVELTQRP TRRTKAVGKI VEVLGDNMGT GMAVDIALRT HEIPYIWPQA
VEQQVAGLKE EVPEEAKAGR VDLRDLPLVT IDGEDARDFD DAVYCEKKRG GGWRLWVAIA
DVSYYVRPST PLDREARNRG TSVYFPSQVI PMLPEVLSNG LCSLNPQVDR LCMVCEMTVS
SKGRLTGYKF YEAVMSSHAR LTYTKVWHIL QGDQDLREQY APLVKHLEEL HNLYKVLDKA
REERGGISFE SEEAKFIFNA ERRIERIEQT QRNDAHKLIE ECMILANISA ARFVEKAKEP
ALFRIHDKPS TEAITSFRSV LAELGLELPG GNKPEPRDYA ELLESVADRP DAEMLQTMLL
RSMKQAIYDP ENRGHFGLAL QSYAHFTSPI RRYPDLTLHR AIKYLLAKEQ GHQGNTTETG
GYHYSMEEML QLGQHCSMAE RRADEATRDV ADWLKCDFML DQVGNVFKGV ISSVTGFGFF
VRLDDLFIDG LVHVSSLDND YYRFDQVGQR LMGESSGQTY RLGDRVEVRV EAVNMDERKI
DFSLISSERA PRNVGKTARE KAKKGDAGKK GGKRRQVGKK VNFEPDSAFR GEKKTKPKAA
KKDARKAKKP SAKTQKIAAA TKAKRAAKKK VAE