Gene EcE24377A_0292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0292 
Symbol 
ID5587084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp319243 
End bp320565 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content50% 
IMG OID640924017 
Productphage integrase family protein 
Protein accessionYP_001461446 
Protein GI157155411 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAG TGATAACTCA GGATCGACAG ATAGCAAGCC TTGTTCCCCC TGTAGGAAAA 
AACCGGATTG TCGTCAGCGT CAAATCAAAA GCAGGTGCGG GGTTGTATAT CGAATCCCGA
TCAGGGAGCC AAACAAAGAG TTGGCTCTAC CGTCCCTACC TCAACGGCAA GCAGATAAAA
ATTACCCTGG GAGCTTATCC TGCAATGACG CTGGCCCAGG CTCGTGAAGC ACATGCTGAA
GCCATTGAGT TAGTGAAGCA GGGAATTGAC CCCAGATACG TTCGTAAAAC TGAAAAATCG
AATAATGAGC AGATGCCCAT ATTTTCTGAA TTGTGGGAAA ACTGGTTGTC ATTCAGAGAA
ACCAGCAAGC CAATAGGGGC TCGTACACTG GCTGATTATA AGGGAACGTA TCGTCGCCAT
CTTGAGCAGG GGCTAGGCTC CGTGAGAGTG TGCGATTTAT CGCGTTCGTA TTTGTTTGCA
CACCTGAGCA AAGTCCGACA ATCCAGCGCT GAAGGTGTGC GCAAAGGGCT TATCATCCTG
AATATGACAC TTGACCATGC CACACTTCAG GGATTAATCG AGCTCAACCC TGCCCGTCTG
TTAAAACCGG CGATGTTTGG TGCTTCCATG GCTAAACCGC GTGAGCGCTG GTTATCGCAG
GATGAGCTTC AACGGCTCTG GAAAACACTA GAAGAAGCTA CTGCTGGTGG CGGTTCAGTT
AGCACCGGTG GAAAAGGTAT CGCTTCCAGT GTCGTTTTAT CCCATGCGAT AGCTAACACA
CTGAAGCTGA TTATTCTGAC CGGCGTTCGC CGCTCTGAAG CGGTTAAAAT GCGCTGGGAT
CAAATCAACG GCGATCGCTG GACAATACCG GAAACAAAGA ATGGTAAAAG TCATGTAGTA
ACACTGCATC CGCTGGCACT GTCTATCCTG AAGAAACAGC GAATTATCTC CGAAGGCGCA
TTTGTCTTTG AGTCCACCAG CAACCAGGGA TTCCCTATAA CCGGAGACGC TGTCACACGA
GCACTGGAGC GCCTGCGTAA AAAATATCTG GCAGAGCTGG AGCCATTTTC TCCCCATGAT
TTACGCCGCA GTGTCGCTAC TGGTTGCGCT GAATATCTGG ATGCCCCGGA GCGCCTGATT
GAGCGATTGT TAAACCACAT TCCTAAAGAC CGCCTGATCA GAACCTATCA GGTCGGGCAG
CAGGCGGAAA AATTGCGTAG TTTGTTTTTG AGCTGGGGGG ATTTTGTGGA GCGGTATGTA
GCGCAGTCTG CCAGCCATGA GGTGACGGAT AATGTTGTTC AGGTTAAGTT TGGCGGCAGA
TAA
 
Protein sequence
MATVITQDRQ IASLVPPVGK NRIVVSVKSK AGAGLYIESR SGSQTKSWLY RPYLNGKQIK 
ITLGAYPAMT LAQAREAHAE AIELVKQGID PRYVRKTEKS NNEQMPIFSE LWENWLSFRE
TSKPIGARTL ADYKGTYRRH LEQGLGSVRV CDLSRSYLFA HLSKVRQSSA EGVRKGLIIL
NMTLDHATLQ GLIELNPARL LKPAMFGASM AKPRERWLSQ DELQRLWKTL EEATAGGGSV
STGGKGIASS VVLSHAIANT LKLIILTGVR RSEAVKMRWD QINGDRWTIP ETKNGKSHVV
TLHPLALSIL KKQRIISEGA FVFESTSNQG FPITGDAVTR ALERLRKKYL AELEPFSPHD
LRRSVATGCA EYLDAPERLI ERLLNHIPKD RLIRTYQVGQ QAEKLRSLFL SWGDFVERYV
AQSASHEVTD NVVQVKFGGR