Gene EcHS_A4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4100 
Symbol 
ID5592770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4091304 
End bp4092284 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content44% 
IMG OID640923204 
ProductAP endonuclease 
Protein accessionYP_001460663 
Protein GI157163345 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGACAA TAAATAACGC AAGAAAGATT CTACAACGTG TCGATACTCT TCCTCTTTAT 
TTACATGCCT ATGCCTTTCA TTTAAATATG CGGCTGGAAA GAGTATTGCC TGCTGATTTA
CTTGATATCG CGAGTGAAAA TAATCTGCGG GGCGTCAAAA TTCATGTTCT GGATGGAGAG
CGTTTTTCTC TTGGTAATAT GGACGATAAA GAACTCTCTG CCTTTGGTGA TAAAGCCCGC
CGTCTGAACC TTGATATTCA TATTGAAACC AGCGCCTCAG ATAAGGCATC TATTGACGAA
GCCGTCGCCA TCGCGTTGAA AACTGGGGCA TCGTCCGTAC GTTTTTATCC ACGTTATGAA
GGTAATTTGC GCGACGTATT ATCGATCATC GCTAACGACA TTGCCTATGT ACGGGAAACG
TATCAGGACA GCGGCCTGAC TTTTACGATC GAGCAGCATG AAGATTTAAA AAGCCATGAG
CTGGTGTCGT TGGTCAAAGA AAGTGAGATG GAGTCTCTTT CCTTACTGTT TGATTTTGCG
AACATGATCA ATGCAAATGA GCATCCCATC GACGCTTTAA AAACGATGGC GCCGCATATT
ACCCAGGTGC ATATCAAAGA TGCATTGATT GTTAAAGAAC AGGGTGGTCT GGGCCATAAA
GCCTGTATTT CAGGTCAGGG AGATATGCCC TTCAAAGCGT TATTAACGCA CCTTATCTGC
CTGGGCGATG ATGAGCCGCA GGTGACGGCA TATGGCCTGG AAGAAGAGGT TGATTATTAT
GCTCCGGCGT TCCGCTTTGA AGACGAAGAT GATAATCCGT GGATCCCTTA TCGCCAGATG
AGTGAAACAC CACTACCAGA AAATCATTTA CTGGATGCGC GGTTACGTAA AGAAAAAGAA
GATGCGATTA ATCAGATAAA TCATGTGCGT AACGTACTAC AACAAATCAA ACAAGAGGCA
AGCCATCTTC TGAACCACTA A
 
Protein sequence
MVTINNARKI LQRVDTLPLY LHAYAFHLNM RLERVLPADL LDIASENNLR GVKIHVLDGE 
RFSLGNMDDK ELSAFGDKAR RLNLDIHIET SASDKASIDE AVAIALKTGA SSVRFYPRYE
GNLRDVLSII ANDIAYVRET YQDSGLTFTI EQHEDLKSHE LVSLVKESEM ESLSLLFDFA
NMINANEHPI DALKTMAPHI TQVHIKDALI VKEQGGLGHK ACISGQGDMP FKALLTHLIC
LGDDEPQVTA YGLEEEVDYY APAFRFEDED DNPWIPYRQM SETPLPENHL LDARLRKEKE
DAINQINHVR NVLQQIKQEA SHLLNH