Gene EcDH1_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0903 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp968763 
End bp970064 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content51% 
IMG OID 
ProductRNA methyltransferase, TrmA family 
Protein accessionACX38586 
Protein GI260448164 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000019395 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAT TCTACTCTGC AAAACGACGC ACGACGACGC GTCAGATCAT AACCGTTTCA 
GTCAACGACC TCGACTCTTT TGGTCAGGGC GTGGCGCGAC ATAACGGCAA AACGCTATTT
ATCCCCGGAT TATTGCCGCA GGAAAACGCG GAAGTTACTG TTACTGAAGA TAAAAAACAG
TATGCCCGCG CTAAAGTCGT ACGCCGGTTA AGCGATAGCC CGGAACGCGA AACGCCACGC
TGTCCTCATT TTGGCGTATG CGGTGGCTGT CAGCAACAAC ACGCCAGCGT GGATTTACAG
CAGCGAAGCA AAAGTGCGGC ACTCGCCCGA TTAATGAAAC ACGATGTCTC TGAAGTGATC
GCCGATGTTC CCTGGGGCTA TCGCCGTCGC GCGCGTTTAA GTCTGAACTA CTTACCGAAA
ACACAGCAAC TTCAGATGGG GTTTCGCAAA GCGGGCTCCA GTGACATTGT CGACGTCAAA
CAATGCCCCA TTTTAGCGCC CCAACTTGAA GCATTGCTGC CCAAAGTCAG GGCATGTCTG
GGCAGCTTAC AAGCTATGCG CCATCTTGGT CATGTTGAAC TGGTACAGGC AACCAGCGGC
ACGCTGATGA TTTTGCGCCA TACCGCACCG CTAAGTTCGG CAGATCGCGA AAAACTGGAA
CGCTTTTCGC ATTCTGAAGG CCTGGATCTG TATCTCGCCC CCGATAGTGA GATACTCGAA
ACCGTCTCTG GTGAGATGCC CTGGTATGAC TCAAACGGGT TGCGCTTAAC TTTTAGCCCG
CGCGATTTTA TTCAGGTCAA TGCGGGTGTG AACCAAAAAA TGGTAGCGCG TGCGTTGGAA
TGGCTGGATG TGCAACCTGA AGATCGCGTA CTGGATCTGT TCTGCGGTAT GGGCAACTTT
ACACTGCCAT TGGCGACACA AGCTGCCAGT GTGGTCGGTG TAGAAGGTGT TCCGGCGCTG
GTGGAAAAAG GCCAGCAGAA TGCGCGTCTT AATGGCTTAC AGAATGTGAC GTTTTATCAC
GAAAATCTTG AAGAAGATGT CACAAAGCAG CCGTGGGCGA AAAACGGCTT CGATAAAGTG
TTGCTGGACC CGGCGCGAGC AGGTGCCGCA GGTGTTATGC AGCAAATTAT AAAACTGGAA
CCTATTCGTA TAGTTTATGT ATCCTGTAAC CCTGCAACGC TGGCTCGGGA TAGCGAAGCG
TTATTAAAAG CAGGATATAC CATTGCGCGA CTGGCGATGC TGGATATGTT CCCACACACG
GGACATCTGG AATCGATGGT ACTTTTCTCG CGCGTTAAAT AG
 
Protein sequence
MAQFYSAKRR TTTRQIITVS VNDLDSFGQG VARHNGKTLF IPGLLPQENA EVTVTEDKKQ 
YARAKVVRRL SDSPERETPR CPHFGVCGGC QQQHASVDLQ QRSKSAALAR LMKHDVSEVI
ADVPWGYRRR ARLSLNYLPK TQQLQMGFRK AGSSDIVDVK QCPILAPQLE ALLPKVRACL
GSLQAMRHLG HVELVQATSG TLMILRHTAP LSSADREKLE RFSHSEGLDL YLAPDSEILE
TVSGEMPWYD SNGLRLTFSP RDFIQVNAGV NQKMVARALE WLDVQPEDRV LDLFCGMGNF
TLPLATQAAS VVGVEGVPAL VEKGQQNARL NGLQNVTFYH ENLEEDVTKQ PWAKNGFDKV
LLDPARAGAA GVMQQIIKLE PIRIVYVSCN PATLARDSEA LLKAGYTIAR LAMLDMFPHT
GHLESMVLFS RVK