Gene EcSMS35_4753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4753 
Symbol 
ID6143639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4851586 
End bp4852851 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content46% 
IMG OID641619567 
Productphage integrase family site specific recombinase 
Protein accessionYP_001746674 
Protein GI170682520 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTAA CAGATATCAA AGTCAGAGCA GCCAAGCCAA CGGATAAGCA ATATAAGCTG 
ACTGATGGTG GCGGTATGCA TCTGCTTGTC CATCCAAATG GTTCTAAGTA CTGGCGTTTG
CAGTACCGTT ATGAGGGAAA GCAAAAAATG CTGGCACTTG GGGTTTATCC TGAAATCACA
CTAGCGGATG CCAGAGTACG TCGTGACGAG GCGCGTAAGC TGCTTGCGAA TGGCGTCGAT
CCGGGAGACA AAAAGAAAAA TGATAAGGTT GAACAGAGTA AAGCACGAAC CTTTAAAGAA
GTCGCGATTG AGTGGCATGG CACCAATAAA AAGTGGTCTG AAGATCACGC CCATCGTGTG
CTAAAAAGTC TGGAAGATAA TCTTTTTGCA GCGCTTGGTG AACGTAATAT CGCTGAGTTA
AAAACTCGAG ATTTATTAGC ACCCATTAAG GCCGTAGAAA TGTCTGGACG TCTTGAAGTG
GCCGCTCGTC TTCAGCAGCG CACTACAGCC ATCATGCGCT ATGCAGTGCA AAGTGGGTTA
ATTGATTATA ACCCGGCACA AGAGATGGCT GGGGCGGTTG CTTCCTGTAA TCGACAACAT
CGTCCCGCGC TTGAATTAAA GCGCATCCCT GAGTTGCTTA CAAAAATAGA TAGCTATACT
GGTAGGCCGC TAACCCGATG GGCGACAGAA CTCTCTTTGC TGATCTTTAT TCGGTCCAGT
GAGCTGCGTT TTGCTCGTTG GTCAGAGATC GATTTCGAAG CGTCTATATG GACTATCCCA
CCGGAGCGGG AGCCTATTCC TGGAGTGAAA CATTCCCATA GAGGCTCAAA AATGCGTACA
ACGCATCTAG TGCCTCTTTC AACGCAAGCT CTTGCAATTT TAAAGCAGAT AAAACAGTTT
TGTGGGGCCC ATGACTTGAT ATTTATTGGT GATCACGATT CGCACAAACC CATGAGTGAG
AATACGGTAA ATAGTGCGTT ACGGGTCATG GGGTATGATA CAAAAGTAGA GGTTTGTGGT
CATGGCTTTC GAACAATGGC CTGTAGTTCA TTGGTCGAAT CAGGTTTGTG GTCTCGTGAT
GCTGTTGAAC GTCAGATGAG CCACATGGAG CGAAATTCAG TGAGGGCCGC GTATATCCAT
AAAGCAGAGC ATCTGGAAGA ACGCCGCTTG ATGCTACAAT GGTGGGCCGA TTTTCTGGAT
GCAAACAGAG AAAAATTTAT CAGTCCATTT GAATATGCAA AGATTAATAA TCCATTAAAA
CAGTAA
 
Protein sequence
MALTDIKVRA AKPTDKQYKL TDGGGMHLLV HPNGSKYWRL QYRYEGKQKM LALGVYPEIT 
LADARVRRDE ARKLLANGVD PGDKKKNDKV EQSKARTFKE VAIEWHGTNK KWSEDHAHRV
LKSLEDNLFA ALGERNIAEL KTRDLLAPIK AVEMSGRLEV AARLQQRTTA IMRYAVQSGL
IDYNPAQEMA GAVASCNRQH RPALELKRIP ELLTKIDSYT GRPLTRWATE LSLLIFIRSS
ELRFARWSEI DFEASIWTIP PEREPIPGVK HSHRGSKMRT THLVPLSTQA LAILKQIKQF
CGAHDLIFIG DHDSHKPMSE NTVNSALRVM GYDTKVEVCG HGFRTMACSS LVESGLWSRD
AVERQMSHME RNSVRAAYIH KAEHLEERRL MLQWWADFLD ANREKFISPF EYAKINNPLK
Q