Gene EcSMS35_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0299 
Symbol 
ID6143653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp306879 
End bp308063 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content49% 
IMG OID641615196 
Productphage integrase family site specific recombinase 
Protein accessionYP_001742404 
Protein GI170681995 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000127187 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTTA ATGCTCGACA GGTAGATGCT GCTAAACCCA GAGAGAAAGC CTACAAGCTA 
GCAGATGGTG CAGGCTTGTA TCTTGAAGTT GTTCCTTCTG GTTCTCGATA CTGGCGGATG
AAATATCGCT TCAATGGAAA AGAGAAACGT ATGGCTTTTG GTGTCTATCC GGCAGTGTCC
CTTGCACAAG CGAGGGCACT GCGTGATGAA GCCAAGAAAA AGCTGGCCGA AGGTATCGAC
CCATCGTTTG CCAAGAAAGA AGAAAAGTTG GTTCGCGATG TGCAGCTCAA TAATACGTTT
CAGGCTGTGG CACTTGAATG GCACGGAACG AAGGTGAGCC GGTGGTCAGA AGGTTATGCC
TCCGACATTA TCGAAGCCTT CAATAAAGAT ATTTTCCCTT ATATTGGCCA ACTGCCGGTG
AATGACATCA AGCCTTTGGT TCTGCTGAAT GTGCTACGTC GAATGGAAAG CCGTGGCGCG
ACAGAGAAGG CCAAGAAGGT TCGCCAGCGT TGCAGTGAAG TCTTTCGTTA CGCCATCGTT
ACCGGTCGTG CGGAATACAA TCCTGCAGCG GATCTAACCA GCGCAATGTC AGGGCATGAA
TCGAAGCATT ATCCCTTCCT TACTGTTGAG GAGTTACCAG ACTTCTTTAA AGCTCTCGCA
GGCTACACAG GAAGCCCGTT AGTTGTTCTT GCCGCTCGTC TGCTGATCCT TACAGGAGTT
CGTACTGGCG AGCTACGAGG TGCTTTCTGG AGTGAGTTTG ATCTTGAAAA AGCAGTGTGG
GAAATACCTG CAGAGCGTAT GAAGATGAAA CGGCCTCACC TTGTCCCCCT ATCTACCCAA
GCGCTGGAAA TCGTACAACA ACTCAAGGTG ATATCTGGGC AATATCCACT GGTATTCCCA
GGGCGAAATG ATCCCCGCAA GACGATGAGT GAAGCGAGTA TGAATCAGGT ATTCAAACGG
ATTGGGTATA CGGGGAAGGT AACGGGGCAT GGTTTCCGTC ACACGATGAG TACGATTTTG
CACGAGGAAG GGTTCAATAC GGCATGGATT GAAACCCAGC TTGCGCATGT CGATAAGAAT
GCGATTCGTG GGACGTACAA CCATGCTTTG TATCTGGAAG GGCGGAGGGA GATGATGCAG
TGGTATGCTG ATTGCATTGG AAGAATTGGT AATGATGTCA ATTGA
 
Protein sequence
MKLNARQVDA AKPREKAYKL ADGAGLYLEV VPSGSRYWRM KYRFNGKEKR MAFGVYPAVS 
LAQARALRDE AKKKLAEGID PSFAKKEEKL VRDVQLNNTF QAVALEWHGT KVSRWSEGYA
SDIIEAFNKD IFPYIGQLPV NDIKPLVLLN VLRRMESRGA TEKAKKVRQR CSEVFRYAIV
TGRAEYNPAA DLTSAMSGHE SKHYPFLTVE ELPDFFKALA GYTGSPLVVL AARLLILTGV
RTGELRGAFW SEFDLEKAVW EIPAERMKMK RPHLVPLSTQ ALEIVQQLKV ISGQYPLVFP
GRNDPRKTMS EASMNQVFKR IGYTGKVTGH GFRHTMSTIL HEEGFNTAWI ETQLAHVDKN
AIRGTYNHAL YLEGRREMMQ WYADCIGRIG NDVN