Gene EcHS_A3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3244 
Symbolgcp 
ID5592217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3252976 
End bp3253989 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content57% 
IMG OID640922362 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001459858 
Protein GI157162540 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.43858e-17 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT 
GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC
GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CACGTGCGCA AAACCGTACC GTTGATCCAG
GCGGCGCTAA AGGAGTCTGG CTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA
GGCCCTGGAT TAGTCGGCGC ACTGCTGGTT GGCGCAACCG TGGGGCGTTC TCTGGCGTTT
GCCTGGAACG TTCCGGCAAT CCCGGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG
CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTCGCGCTGC TGGTGTCCGG CGGTCATACG
CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT
GCCGCCGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGA
CCGTTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTCTT CCCGCGTCCG
ATGACCGACC GTCCGGGGCT GGATTTCAGT TTCTCCGGTC TGAAAACCTT CGCGGCAAAT
ACCATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA
GATGCGGTGG TCGATACGCT GATGATTAAG TGCAAGCGAG CGTTGGATCA GACTGGCTTT
AAGCGACTGG TCATGGCAGG CGGCGTGAGT GCTAACCGTA CGTTACGGGC GAAGCTGGCT
GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAGTT TTGTACTGAT
AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT
CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCGGAGT TACCGGCCGC GTAA
 
Protein sequence
MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ 
AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM
LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG
PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE
DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD
NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA