Gene EcDH1_3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3408 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3652603 
End bp3654321 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content55% 
IMG OID 
Productprolyl-tRNA synthetase 
Protein accessionACX41024 
Protein GI260450602 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.132012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACTA GCCAATACCT GCTCTCCACT CTCAAGGAGA CACCTGCCGA CGCCGAGGTG 
ATCAGCCATC AGCTGATGCT GCGCGCCGGG ATGATCCGCA AGCTGGCCTC CGGGTTATAT
ACCTGGCTGC CGACCGGCGT GCGCGTTCTG AAAAAAGTCG AAAACATCGT GCGTGAAGAG
ATGAACAACG CCGGTGCGAT CGAGGTGTCG ATGCCGGTGG TTCAGCCAGC CGATTTGTGG
CAAGAGAGTG GTCGTTGGGA ACAGTACGGT CCGGAACTGC TGCGTTTTGT TGACCGTGGC
GAGCGTCCGT TCGTACTCGG CCCAACTCAT GAAGAAGTTA TCACTGACCT GATTCGTAAC
GAGCTTAGCT CTTACAAACA GCTGCCGCTG AACTTCTATC AGATCCAGAC CAAGTTCCGC
GACGAAGTGC GTCCGCGTTT CGGCGTCATG CGTTCCCGCG AATTCCTGAT GAAAGATGCT
TACTCTTTCC ATACTTCTCA GGAATCCCTG CAGGAAACCT ACGATGCAAT GTATGCGGCC
TACAGCAAAA TCTTCAGCCG CATGGGGCTG GATTTCCGCG CCGTACAAGC CGACACCGGT
TCTATCGGCG GCAGCGCCTC TCACGAATTC CAGGTGCTGG CGCAGAGCGG TGAAGACGAT
GTGGTCTTCT CCGACACCTC TGACTATGCA GCGAACATTG AACTGGCAGA AGCTATCGCG
CCGAAAGAAC CGCGCGCTGC TGCTACCCAG GAAATGACGC TGGTTGATAC GCCGAACGCG
AAAACCATCG CGGAACTGGT TGAACAGTTC AATCTGCCGA TTGAGAAAAC GGTTAAGACT
CTGCTGGTTA AAGCGGTTGA AGGCAGCAGC TTCCCGCAGG TTGCGCTGCT GGTGCGCGGT
GATCACGAGC TGAACGAAGT TAAAGCAGAA AAACTGCCGC AGGTTGCAAG CCCGCTGACT
TTCGCGACCG AAGAAGAAAT TCGTGCCGTG GTTAAAGCCG GTCCGGGTTC ACTGGGTCCG
GTAAACATGC CGATTCCGGT GGTGATTGAC CGTACCGTTG CGGCGATGAG TGATTTCGCT
GCTGGTGCTA ACATCGATGG TAAACACTAC TTCGGCATCA ACTGGGATCG CGATGTCGCT
ACCCCGGAAG TTGCAGATAT CCGTAACGTG GTGGCTGGCG ATCCAAGCCC GGATGGCCAG
GGTAGGCTGC TGATCAAACG TGGTATCGAA GTTGGTCACA TCTTCCAGCT GGGTACCAAG
TACTCCGAAG CACTGAAAGC CTCCGTACAG GGTGAAGATG GCCGTAACCA AATCCTGACG
ATGGGTTGCT ACGGTATCGG GGTAACGCGT GTGGTAGCTG CGGCGATTGA GCAGAACTAC
GACGAACGAG GCATCGTATG GCCTGACGCT ATCGCGCCGT TCCAGGTGGC GATTCTGCCG
ATGAACATGC ACAAATCCTT CCGCGTACAA GAGCTTGCTG AGAAACTGTA CAGCGAACTG
CGTGCACAAG GTATCGAAGT GCTGCTGGAT GACCGCAAAG AGCGTCCGGG CGTGATGTTT
GCTGATATGG AACTGATCGG TATTCCGCAC ACTATTGTGC TGGGCGACCG TAACCTCGAC
AACGACGATA TCGAATATAA ATATCGTCGC AACGGCGAGA AACAGTTAAT TAAGACTGGT
GACATCGTCG AATATCTGGT GAAACAGATT AAAGGCTGA
 
Protein sequence
MRTSQYLLST LKETPADAEV ISHQLMLRAG MIRKLASGLY TWLPTGVRVL KKVENIVREE 
MNNAGAIEVS MPVVQPADLW QESGRWEQYG PELLRFVDRG ERPFVLGPTH EEVITDLIRN
ELSSYKQLPL NFYQIQTKFR DEVRPRFGVM RSREFLMKDA YSFHTSQESL QETYDAMYAA
YSKIFSRMGL DFRAVQADTG SIGGSASHEF QVLAQSGEDD VVFSDTSDYA ANIELAEAIA
PKEPRAAATQ EMTLVDTPNA KTIAELVEQF NLPIEKTVKT LLVKAVEGSS FPQVALLVRG
DHELNEVKAE KLPQVASPLT FATEEEIRAV VKAGPGSLGP VNMPIPVVID RTVAAMSDFA
AGANIDGKHY FGINWDRDVA TPEVADIRNV VAGDPSPDGQ GRLLIKRGIE VGHIFQLGTK
YSEALKASVQ GEDGRNQILT MGCYGIGVTR VVAAAIEQNY DERGIVWPDA IAPFQVAILP
MNMHKSFRVQ ELAEKLYSEL RAQGIEVLLD DRKERPGVMF ADMELIGIPH TIVLGDRNLD
NDDIEYKYRR NGEKQLIKTG DIVEYLVKQI KG