Gene EcSMS35_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1150 
Symbol 
ID6143687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1173551 
End bp1174576 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content42% 
IMG OID641616028 
Productphage integrase family site specific recombinase 
Protein accessionYP_001743215 
Protein GI170682438 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000198353 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000625111 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAAGAC GAAGGAAAAA TCCTGAACAC GAAAAATTAC CTCCAAATGT ATACCCAAAT 
AAATATAGTT ATGTATGGAA ACCAACATCC AGAGAATCTG TCACACTAAC CGCCATCAAG
GATGGTTTAG CTGCTTTATG GAAAAAGTAT GAGGAAACTG TAAATAATCG CGATCGTGCA
ATGACATTCG GTCGCTTGTG GGAAAAATTC CTCGCCAGCG CCTATTACAG TGACCTTAGT
CCAAGAACAC AAAAAGATTA TCTGCAACAT CAAAAAAAGT TGCTTGCCGT ATTCGGTAAG
GTACCAGCGG ATTCCATAAA ACCAGAACAC ATCCGTCGAT ACATGGACAA AAGAGGGGAG
CAGAGTAAAA CGCAAGCCAA CCATGAAAAA AGCAGTATGT CCCGTGTTTA CAGTTGGGGG
TATGAGCGAG GGTACGTGAA GGCTAACCCA TGTGCAGGTG TAAGTAAATT CAAGGCCAAA
AACCGCGAAC GATATGTAAC CGACAAAGAA TACCAGGCAG TATTAAGCGT TGCACCTCTT
CCTGTTTTTA TCGCAATGGA AATTGCCTAT CTGTGTGCAG CGAGGGTTTC CGATGTGTTA
TCGCTGAAAT GGGAACAGAT TGGAAACGAC GGGATATTCA TCCAGCAAGG GAAAACCGGA
AAAAAACAGA TAAAAGCATG GAGTCCACGA TTACAGGCAG CGATCGAAAA AGCAAAACAG
TTACCAAAAT CTGCCTATGT GATCAGCAAT CAATACGGCA ACCGATATAT GTACAAAGGC
TTTAACGAAA TGTGGGTAGA TGCAAGAAAT CGTGCTGGAA AAATTTCAGG TATTTTAACC
GACTTCACCT TTCATGATCT GAAGGCGAAA GGAATTTCAG ACTATGAAGG AAGCAGCCGG
GATAAGCAAC TTTTCTCTGG TCACAAAACC GAAGGGCAAG TGCTAATCTA TGACAGGAAG
GTTAAAGTTT CACCAACACT TGATGTCCCG TTACCTGAAA ATATTCCAAG AAAATATTCC
AAGTAA
 
Protein sequence
MGRRRKNPEH EKLPPNVYPN KYSYVWKPTS RESVTLTAIK DGLAALWKKY EETVNNRDRA 
MTFGRLWEKF LASAYYSDLS PRTQKDYLQH QKKLLAVFGK VPADSIKPEH IRRYMDKRGE
QSKTQANHEK SSMSRVYSWG YERGYVKANP CAGVSKFKAK NRERYVTDKE YQAVLSVAPL
PVFIAMEIAY LCAARVSDVL SLKWEQIGND GIFIQQGKTG KKQIKAWSPR LQAAIEKAKQ
LPKSAYVISN QYGNRYMYKG FNEMWVDARN RAGKISGILT DFTFHDLKAK GISDYEGSSR
DKQLFSGHKT EGQVLIYDRK VKVSPTLDVP LPENIPRKYS K