Gene EcDH1_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3087 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3314959 
End bp3316344 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content54% 
IMG OID 
Productcysteinyl-tRNA synthetase 
Protein accessionACX40713 
Protein GI260450291 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000869328 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAA TCTTCAATAC TCTGACACGC CAAAAAGAGG AATTTAAGCC TATTCACGCC 
GGGGAAGTCG GCATGTACGT GTGTGGAATC ACCGTTTACG ATCTCTGTCA TATCGGTCAC
GGGCGTACCT TTGTTGCTTT TGACGTGGTT GCGCGCTATC TGCGTTTCCT CGGCTATAAA
CTGAAGTATG TGCGCAACAT TACCGATATC GACGACAAAA TCATCAAACG CGCCAATGAA
AATGGCGAAA GCTTTGTGGC GATGGTGGAT CGCATGATCG CCGAAATGCA CAAAGATTTT
GATGCTTTGA ACATTCTGCG CCCGGATATG GAGCCGCGCG CGACGCACCA TATCGCAGAA
ATTATTGAAC TCACTGAACA ACTGATCGCC AAAGGTCACG CTTATGTGGC GGACAACGGC
GACGTGATGT TCGACGTCCC GACCGATCCA ACTTATGGCG TGCTGTCGCG TCAGGATCTC
GACCAGCTGC AGGCAGGCGC GCGCGTTGAC GTGGTCGACG ACAAACGCAA CCCAATGGAC
TTCGTTCTGT GGAAGATGTC GAAAGAGGGC GAACCGAGCT GGCCGTCTCC GTGGGGCGCG
GGTCGTCCTG GCTGGCACAT TGAATGTTCG GCAATGAACT GCAAGCAGCT GGGTAACCAC
TTTGATATCC ACGGCGGCGG TTCAGACCTG ATGTTCCCGC ACCACGAAAA CGAAATCGCG
CAGTCCACCT GTGCCCATGA TGGTCAGTAT GTGAACTACT GGATGCACTC GGGGATGGTG
ATGGTTGACC GCGAGAAGAT GTCCAAATCG CTGGGTAACT TCTTTACCGT GCGCGATGTG
CTGAAATACT ACGACGCGGA AACCGTGCGT TACTTCCTGA TGTCGGGCCA CTATCGCAGC
CAGTTGAACT ACAGCGAAGA GAACCTGAAG CAGGCGCGTG CGGCGCTGGA GCGTCTCTAC
ACTGCGCTGC GCGGCACAGA TAAAACCGTT GCGCCTGCCG GTGGCGAAGC GTTTGAAGCG
CGCTTTATTG AAGCGATGGA CGACGATTTC AACACCCCGG AAGCCTATTC CGTACTGTTT
GATATGGCGC GTGAAGTAAA CCGTCTGAAA GCAGAAGATA TGGCAGCGGC GAATGCAATG
GCATCTCACC TGCGTAAACT TTCCGCTGTA TTGGGCCTGC TGGAGCAAGA ACCGGAAGCG
TTCCTGCAAA GCGGCGCGCA GGCAGACGAC AGCGAAGTGG CTGAGATTGA AGCGTTAATT
CAACAGCGTC TGGATGCCCG TAAAGCGAAA GACTGGGCGG CGGCGGATGC GGCGCGTGAT
CGTCTTAACG AGATGGGGAT CGTGCTGGAA GATGGCCCGC AAGGGACCAC CTGGCGTCGT
AAGTAA
 
Protein sequence
MLKIFNTLTR QKEEFKPIHA GEVGMYVCGI TVYDLCHIGH GRTFVAFDVV ARYLRFLGYK 
LKYVRNITDI DDKIIKRANE NGESFVAMVD RMIAEMHKDF DALNILRPDM EPRATHHIAE
IIELTEQLIA KGHAYVADNG DVMFDVPTDP TYGVLSRQDL DQLQAGARVD VVDDKRNPMD
FVLWKMSKEG EPSWPSPWGA GRPGWHIECS AMNCKQLGNH FDIHGGGSDL MFPHHENEIA
QSTCAHDGQY VNYWMHSGMV MVDREKMSKS LGNFFTVRDV LKYYDAETVR YFLMSGHYRS
QLNYSEENLK QARAALERLY TALRGTDKTV APAGGEAFEA RFIEAMDDDF NTPEAYSVLF
DMAREVNRLK AEDMAAANAM ASHLRKLSAV LGLLEQEPEA FLQSGAQADD SEVAEIEALI
QQRLDARKAK DWAAADAARD RLNEMGIVLE DGPQGTTWRR K