Gene EcHS_A0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0329 
Symbol 
ID5590906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp333837 
End bp335045 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content49% 
IMG OID640919515 
Productsite-specific recombinase, phage integrase family protein 
Protein accessionYP_001457101 
Protein GI157159783 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0019463 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTA ATGCGCGACA GGTAGACGCA GCTAAACCCA GAGAGAAAGC CTATAAGCTG 
GCAGATGGTG CTGGTTTGTA TCTTGAGGTT GTTCCCTCTG GTTCCAGATA CTGGCGGATG
AAATATCGCT TCAATGGAAA AGAGAAGCGT ATGGCCTTTG GTGTCTATCC GGCAGTGTCC
CTTGCACAAG CGAGGGCACT GCGTGACGAC GCCAAGAAAA AGCTTGCCGA AGGTATCGAT
CCATCGCTTG CCAAGAAAGA AGAAAAGCTG GTTCGAGATG TGCAGCTAAA TAATACGTTT
CAGGCTGTGG CACTTGAATG GCATGGAACG AAGGTGAGCC GATGGTCAGA AGGTTATGCC
TCGGACATTA TCGAAGCCTT TAATAAAGAT ATTTTTCCCT ATATTGGCCA ACAACCGGTG
AATGAAATCA AACCGCTGGT TCTGCTTAAT GTGCTGCGTC GAATTGAAAG CCGTGGCGCG
ACAGAGAAGG CCAAGAAGGT TCGCCAGCGT TGCAGCGAAG TCTTTCGTTA CGCCATCGTA
ACCGGTCGTG CGGAATACAA TCCTGCAGCG GATCTTACCA GCGCGATGTC AGGGCATGAA
TCGAAGCATT ATCCCTTCCT TACTGTTGAG GAGTTACCAG ACTTTTTTAA AGCTCTCGCA
GGCTACACAG GAAGCCCGTT AGTTGTTCTT GCGGCACGTC TGCTGATCCT CACGGGAGTT
CGCACTGGCG AACTCCGAGG TGCTTTCTGG AGTGAGTTTG ATCTTGAAAA AGCGGTGTGG
GAGATACCTG CCGAGCGTAT GAAGATGAAA CGGCCTCACC TTGTCCCCCT CTCTACCCAA
GCGCTGGAAA TCGTACAGCA ACTCAAAGTG ATGTCAGGGC AATATCCACT GGTGTTCCCG
GGACGTAATG ATCCCCGCAA GACGATGAGT GAAGCGAGTA TTAATCAGGT TTTTAAGCGG
ATTGGATATA CGGGGAGGGT AACGGGGCAT GGTTTCCGTC ACACGATGAG TACGATTTTG
CATGAGGAAG GTTTCAATAC AGCGTGGATT GAAACCCAGC TGGCTCACGT TGATAAAAAT
GCGATTCGTG GGACGTATAA CCATGCGTTG TATTTGGAAG GACGTAGGGA GATGATGCAG
TGGTATGCGG ACTATATAAA CGCTAATCAT AATTCATGTA TTTCTTTGCT GGTGAACGCT
ACAAAGTAG
 
Protein sequence
MKLNARQVDA AKPREKAYKL ADGAGLYLEV VPSGSRYWRM KYRFNGKEKR MAFGVYPAVS 
LAQARALRDD AKKKLAEGID PSLAKKEEKL VRDVQLNNTF QAVALEWHGT KVSRWSEGYA
SDIIEAFNKD IFPYIGQQPV NEIKPLVLLN VLRRIESRGA TEKAKKVRQR CSEVFRYAIV
TGRAEYNPAA DLTSAMSGHE SKHYPFLTVE ELPDFFKALA GYTGSPLVVL AARLLILTGV
RTGELRGAFW SEFDLEKAVW EIPAERMKMK RPHLVPLSTQ ALEIVQQLKV MSGQYPLVFP
GRNDPRKTMS EASINQVFKR IGYTGRVTGH GFRHTMSTIL HEEGFNTAWI ETQLAHVDKN
AIRGTYNHAL YLEGRREMMQ WYADYINANH NSCISLLVNA TK