Gene EcSMS35_2277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2277 
Symbol 
ID6146630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2301700 
End bp2302908 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content49% 
IMG OID641617151 
Productphage integrase family site specific recombinase 
Protein accessionYP_001744324 
Protein GI170680457 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0249375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0747113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTAT TGACGGATAC TAAAGCAAGG CATATCAAAC CTGATGACAA ACCATTGCCC 
CATGGGGGAA TTACAGGACT GACTCTTCAT CCTTCTTCAG TAAAGGGGAG GGGTAAATGG
GTTTTTCGTT ATGTAAGTCC GGTGACACAA AAAAGGCGTA ATGCTGGATT GGGAACTTAT
CCTGAGGTCA GTATTGCTGA AGCTGCACGT ACTGCCCGGA TAATGCGAGA GCAACTTGCT
GCAGGTGATG ATCCTCTGGA GATTAAAAAG GCTGAAGCTG AAAAAGTTGT TATCCCAACA
TTTGCCGATG CAGCCAGGCG TGTACATGCA GAACTGTCTC CTGGATGGGA AAATCCAAAG
CATGTAAGGC AGTGGTTATC GACGCTTGAG AATTACGCGT TTCCTCAACT GGGAGCAAAA
ACGCTGGATT CGATTACGGC TGCGGACGTG GCAGAAACAC TGCGTCCAGT CTGGTTAACC
TTGTCAGAAA CGGCAAGCCG GGTTAAACAG CGCATTCATG TTGTTATGCA GTGGGGCTGG
GCGCATGGTT TTTGTGTGGC GAATCCTGTT GATGTGGTTG ATCATTTGCT TCCACAGCAA
TCAAGAGGAC GTGATGAACA CCAGCCGGCA ATGCCCTGGA GGCAGTTACC GCTTTTTGTG
GCGACCAGTG TGTATACAGA TGAACCTTAT AATGTTACCC GGGCACTGTT ATTAATGGTG
ATACTGACAG CAACCCGCTC GGGCGAAGCA AGGGGAATGC GCTGGGCTGA AATTGATTTT
CATAAGCGGA TATGGACGAT ACCCGCAGAA AGAATGAAAG CCAGGATACA GCATCGTGTT
CCTTTATCCC GACAGGCCAT TCACGTTCTG GAAAATATAC GTGGTCTGCA TGACGAACTG
GTGTTTCCTT CTCCCAGAAA GCAGCAGATC CTTTCAGATA TGGTGTTGAC GAGTTTTCTG
CGTAAAAAGA AGGCCATCAG TGATATACCC GGACGAGTGG CTACAGCACA TGGTTTTCGT
TCAACATTCA GGGACTGGTG TAGCGAACAG GGATATTCGC GGGATTTGGC GGAAAGGGCG
CTTGCCCATA CGCTGAAAAA TAAGGTTGAG GCGGCATATC ACCGGACTGA TCTGCTGGAT
CAGCGTATAC CGATGATGCA GGCATGGGCG GATTATGTGA TGTCTCAGAT TATGGAAAAC
CAGCGATGA
 
Protein sequence
MAVLTDTKAR HIKPDDKPLP HGGITGLTLH PSSVKGRGKW VFRYVSPVTQ KRRNAGLGTY 
PEVSIAEAAR TARIMREQLA AGDDPLEIKK AEAEKVVIPT FADAARRVHA ELSPGWENPK
HVRQWLSTLE NYAFPQLGAK TLDSITAADV AETLRPVWLT LSETASRVKQ RIHVVMQWGW
AHGFCVANPV DVVDHLLPQQ SRGRDEHQPA MPWRQLPLFV ATSVYTDEPY NVTRALLLMV
ILTATRSGEA RGMRWAEIDF HKRIWTIPAE RMKARIQHRV PLSRQAIHVL ENIRGLHDEL
VFPSPRKQQI LSDMVLTSFL RKKKAISDIP GRVATAHGFR STFRDWCSEQ GYSRDLAERA
LAHTLKNKVE AAYHRTDLLD QRIPMMQAWA DYVMSQIMEN QR