Gene EcHS_A4527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4527 
Symbol 
ID5592199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4532707 
End bp4534656 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content47% 
IMG OID640923623 
Productphage integrase family site specific recombinase 
Protein accessionYP_001461063 
Protein GI157163745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.259982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAGC GTTATAACCT CTATCGTCGA ACCAGCGGCA TTTATGTTGT CCGCATTAGC 
GTTCCCCAAC GTTTTCGTCG ATACGCAGGA CAGTGTGAGA TTCACACTTC AACAGGCACT
CATGATCTTC ATGAAGCAAA GCAGAAGTCC GCGCTCCTGT TGGCTGTCTG GTATCAGACC
TTACAAGAGT ATGAACAATT GGATTACCGA ACTTTAAGTG ACTGCGCCCC GCTGCTTGCT
GGTGAGGGGA TGATCTCGCT TTCTAACTTT GCTCAGTCAA TCGAGTTGCC TATATCGCAA
TTGATTCGAG AGGTGATTAA TCGTAACCTC CCGGTATTCT GGCTGGCGAC TGGTCAGTTC
GGTTTCTATG TTGATGAATT TAATGCAGTA GAGCGGGAAC CCGGTGCAAA ACGAGAAAAA
CAGTCTGATG ATGAAAAGGA TCAACCTAAA GAAGTCATCA TTCTCAATAG CGCGTTTGAG
CTGGGTATCG AGAGCTTCGC AAATGGTTAT CTCCGCCCCT TCAATCCCCG GCATACTTTA
GATTGTCTGT TGAGCGCTGG AGTATCCGAA GGAGAGGCTG CATTTCGAAC TAGTGGTGAT
AACCAAAGTG GAGGTTGGTT CTTCGATTTA CCCGGCGTAG ATATAACTGC TGATAGCCTC
TTGATTAGCA AAGTTCATGC TGAAGGCCTT CGACTTACAT GGCTGGTTAA GACCACGCCA
CCAGCAGTTA GCATTCACCC TGCCGTGCCT CTTGTCGCCC CTGTTATCGC TAACGAATAT
GTTCACCGCA AACATTACAA TGAAAACTTG TCATGGCTTC GTGAAGAGTA TTTGAAACAT
CGGCGTAAGG GCAAGGTATC AGAAGCGGCG CTCCGCGATA TTCGCTATTA CTTCGATTTG
ATGATCGAAG TGATGGGGGA TATTCAGTTG GAAGATTTCG ACCGTGATTT CCTCCGGGCT
TATGAGAGCA AGTTGCGCAC AATTCCTGCT AACCGTAATT TGATGAAAGG TAAGCACGGG
GTTAAGACGC TGGATGAGTT AATCGCCAAA GCGGCAGAAT GTGGCGATAA ACTGATGACA
GAAGAGTCTG TCAAAAAGTA TATCAACGGC CTTTATGGTG CAATGGAGTG GGCTGTTGAT
GACGGTAAGT TTCTGAAATC GCCATGCGAC AACTTTTTCC CTCCCGATGA CAAAGGTGAG
CGAGAGCAGG ATCACACTGA CATATTTGAA CCGCATGAAA TTAAGGCAAT TTTTTCGCAA
CCGTGGTTTG TGGCTGGAAC TGTTGAACGT AATGCGCAAG GGCGATTCCA TCAATATTGC
CCGTTTCACT ATTGGGCGCC GTTGTTGGGC TTGATGACGG GGGCAAGGGT TAACGAGATT
GCACAGTTAA TGCTGGACGA TGTTCTGGCA GATGACGGCG TTTATTACCT GAACCTTGAA
AGCGATAGCG AAAACGGAAA GAAACTAAAA AACGCCCAAT CCCGCCGCAA GATTCCGGTT
CATTCTACGC TGATTGAACT CGGTTTTATC GAGTATGTGG ATGCGTTGAA AGCTGCCGGG
TATGACCGTC TTTTTCCCGA GCTTAAACCA CATAAAACTA AAGGCTATGG TAGGCCGGTT
TCCGCATGGT TCAATGAATC ATTGCTTGCG GGTCGATTAA AACTTGAAAG AGACAGAAGC
AAATCTTTCC ACTCTTTCCG GCATTCTGTT TCAACTTTGC TTAAAGAGAA GGGTGTTAGT
TCGGAACTGC GTGGGCAGCT ACTTGGGCAT GTGCGCGGCA AAACAGAAAC TGAAGTGCGA
TACAGCAAAG ATTTAAAACC GGTTCACATG GTTGAGGTTG TCGAAAAGAT TGATTTTTCT
TTGCCCGAGA TAGCGAGATT CAACATTCCT GATGGGCTGG ATGCTGTAAG TGATGCGCTG
CGAAGAAAGC GTGGCAAACA AACAGGTTGA
 
Protein sequence
MSQRYNLYRR TSGIYVVRIS VPQRFRRYAG QCEIHTSTGT HDLHEAKQKS ALLLAVWYQT 
LQEYEQLDYR TLSDCAPLLA GEGMISLSNF AQSIELPISQ LIREVINRNL PVFWLATGQF
GFYVDEFNAV EREPGAKREK QSDDEKDQPK EVIILNSAFE LGIESFANGY LRPFNPRHTL
DCLLSAGVSE GEAAFRTSGD NQSGGWFFDL PGVDITADSL LISKVHAEGL RLTWLVKTTP
PAVSIHPAVP LVAPVIANEY VHRKHYNENL SWLREEYLKH RRKGKVSEAA LRDIRYYFDL
MIEVMGDIQL EDFDRDFLRA YESKLRTIPA NRNLMKGKHG VKTLDELIAK AAECGDKLMT
EESVKKYING LYGAMEWAVD DGKFLKSPCD NFFPPDDKGE REQDHTDIFE PHEIKAIFSQ
PWFVAGTVER NAQGRFHQYC PFHYWAPLLG LMTGARVNEI AQLMLDDVLA DDGVYYLNLE
SDSENGKKLK NAQSRRKIPV HSTLIELGFI EYVDALKAAG YDRLFPELKP HKTKGYGRPV
SAWFNESLLA GRLKLERDRS KSFHSFRHSV STLLKEKGVS SELRGQLLGH VRGKTETEVR
YSKDLKPVHM VEVVEKIDFS LPEIARFNIP DGLDAVSDAL RRKRGKQTG