Gene EcSMS35_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2228 
Symbol 
ID6147231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2247888 
End bp2249231 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content53% 
IMG OID641617104 
Productrecombination factor protein RarA 
Protein accessionYP_001744278 
Protein GI170682500 
COG category[L] Replication, recombination and repair 
COG ID[COG2256] ATPase related to the helicase subunit of the Holliday junction resolvase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.331515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAATC TGTCGCTCGA TTTTTCGGAT AATACTTTTC AACCTCTGGC CGCGCGTATG 
CGGCCAGAAA ATTTAGCACA GTATATCGGC CAGCAACATT TGCTGGCTGC GGGGAAGCCG
TTGCCGCGCG CTATCGAAGC CGGGCATTTG CATTCTATGA TCCTCTGGGG GCCGCCGGGT
ACCGGCAAAA CAACCCTCGC TGAAGTGATT GCCCGCTATG CGAACGCTGA TGTGGAACGT
ATTTCTGCCG TCACCTCTGG TGTGAAAGAG ATTCGCGAGG CGATCGAGCG CGCTCGGCAA
AACCGCAATG CAGGTCGCCG CACTATTCTT TTTGTTGACG AAGTTCACCG TTTCAACAAA
AGCCAGCAGG ATGCATTTCT GCCACATATT GAAGACGGCA CCATCACTTT TATTGGCGCA
ACCACTGAAA ACCCGTCGTT TGAGCTTAAT TCGGCACTGC TTTCCCGTGC CCGTGTCTAT
CTGTTGAAAT CCCTGAGTAC AGAGGATATT GAGCAAGTAC TGACTCAGGC GATGGAAGAC
AAAACCCGTG GCTATGGTGG TCAGGATATT GTTCTGCCAG ATGAAACACG ACGCGCCATT
GCTGAACTGG TGAATGGCGA CGCGCGCCGG GCGTTAAATA CGCTGGAAAT GATGGCGGAT
ATGGCCGAAG TCGATGATAG CGGTAAGCGG GTCCTGAAAC CTGAATTACT GACCGAAATC
GCCGGTGAAC GTAGCGCCCG CTTTGATAAC AAAGGCGATC GCTTTTACGA TCTGATTTCC
GCATTGCATA AGTCGGTACG TGGTAGCGCA CCCGATGCGG CGCTGTACTG GTATGCGCGA
ATTATTACCG CCGGTGGCGA TCCGTTATAT GTTGCGCGTC GTTGCCTGGC AATTGCTTCC
GAAGATGTCG GCAATGCCGA CCCACGGGCG ATGCAGGTGG CAATTGCGGC CTGGGATTGC
TTTACTCGCG TTGGCCCGGC GGAAGGTGAA CGCGCCATTG CTCAGGCGAT TGTTTACCTG
GCCTGCGCGC CAAAAAGCAA CGCTGTCTAC ACCGCGTTTA AAGCCGCGCT GGCCGATGCT
CGCGAACGTC CCGATTATGA CGTGCCGGTT CATTTGCGTA ATGCGCCGAC GAAATTAATG
AAGGAAATGG GCTACGGGCA GGAATATCGT TACGCTCATG ATGAAGCAAA CGCTTATGCT
GCCGGTGAGG TTTATTTCCC GCCGGAAATA GCACAAACAC GCTATTATTT CCCGACAAAC
AGGGGCCTTG AAGGCAAGAT TGGCGAAAAG CTCGCCTGGC TGGCTGAACA GGATCAAAAT
AGCCCCATAA AACGCTACCG TTAA
 
Protein sequence
MSNLSLDFSD NTFQPLAARM RPENLAQYIG QQHLLAAGKP LPRAIEAGHL HSMILWGPPG 
TGKTTLAEVI ARYANADVER ISAVTSGVKE IREAIERARQ NRNAGRRTIL FVDEVHRFNK
SQQDAFLPHI EDGTITFIGA TTENPSFELN SALLSRARVY LLKSLSTEDI EQVLTQAMED
KTRGYGGQDI VLPDETRRAI AELVNGDARR ALNTLEMMAD MAEVDDSGKR VLKPELLTEI
AGERSARFDN KGDRFYDLIS ALHKSVRGSA PDAALYWYAR IITAGGDPLY VARRCLAIAS
EDVGNADPRA MQVAIAAWDC FTRVGPAEGE RAIAQAIVYL ACAPKSNAVY TAFKAALADA
RERPDYDVPV HLRNAPTKLM KEMGYGQEYR YAHDEANAYA AGEVYFPPEI AQTRYYFPTN
RGLEGKIGEK LAWLAEQDQN SPIKRYR