Gene EcHS_A0909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0909 
Symbol 
ID5594426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp917273 
End bp918304 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content47% 
IMG OID640920079 
Productphage integrase family site specific recombinase 
Protein accessionYP_001457646 
Protein GI157160328 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.046478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTAC GAAAACTCAC CACAGGAAAA TGGCTTTGCG AATGTTACCC CGCCGGACGT 
AGTGGGCGTC GTGTGCGTAA ACAATTCGCC ACCAAAGGCG AAGCACTGGC TTTTGAGCGT
CACACGATGG AAGAAACCGA AGCAAAGCCC TGGCTGGGTG AATCAGTGGA TCGTCGAACA
CTGAAAGACG TGGTTGAGCT ATGGTTCAAA CTACATGGTA AATCACTGAC TGCTGGGCAG
CATGTCTATG ACAAATTGCT GCTGATGGTT GACGCTCTGG GCAATCCCCT TGCAACTGAT
CTCACATCTA AAATGTTTGC CCACTATCGA GATAAACGCC TGACAGGTGA GATCTACTTC
AGCGAGAAAT GGAAGAAAGG AGCAAGCCCG GTCACCATTA ACCTGGAGCA AAGCTATCTA
AGTAGTGTTT TTAGCGAACT ATCCCGCCTG GGCGAATGGT CGTATCCGAA CCCACTGGAG
AACATGCGAA AATTCACCAT CGCAGAAAAA GAGATGGCAT GGCTTACCCA TGAGCAGATT
GTTGAATTGC TGGCTGATTG CAAACGTCAG GACCCAATTC TGGCACTGGT AGTTAAGATA
TGCTTAAGCA CAGGCGCACG TTGGCGTGAA GCCGTAAATC TTACCCGCTC ACAGGTGACC
AAATACCGAA TTACCTTTGT AAGAACGAAG GGGAAGAAAA ACAGAAGCAT CCCTATCAGT
AAAGAGCTTT ACGAAGAGAT CATGGCGCTT GATGGGTTCA ATTTCTTTAC AGACTGCTAT
TTTCAATTTT TATCCGTGAT GGAAAAAACG TCTATCGTGC TCCCTCGCGG TCAACTGACA
CACGTTCTGC GCCATACGTT TGCGGCGCAC TTCATGATGT CGGGTGGAAA TATCCTGGCC
TTACAAAAAA TTCTCGGGCA TCACGATATA AAAATGACTA TGCGTTACGC ACATCTGGCA
CCGGATCACC TGGAAACTGC ATTACGGTTT AATCCGCTGG CAACACTACC AACATCAATA
GCAAGTTTTT GA
 
Protein sequence
MAVRKLTTGK WLCECYPAGR SGRRVRKQFA TKGEALAFER HTMEETEAKP WLGESVDRRT 
LKDVVELWFK LHGKSLTAGQ HVYDKLLLMV DALGNPLATD LTSKMFAHYR DKRLTGEIYF
SEKWKKGASP VTINLEQSYL SSVFSELSRL GEWSYPNPLE NMRKFTIAEK EMAWLTHEQI
VELLADCKRQ DPILALVVKI CLSTGARWRE AVNLTRSQVT KYRITFVRTK GKKNRSIPIS
KELYEEIMAL DGFNFFTDCY FQFLSVMEKT SIVLPRGQLT HVLRHTFAAH FMMSGGNILA
LQKILGHHDI KMTMRYAHLA PDHLETALRF NPLATLPTSI ASF