Gene EcSMS35_4315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4315 
Symbol 
ID6144994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4412447 
End bp4414732 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content53% 
IMG OID641619136 
Productreplication gene A protein 
Protein accessionYP_001746260 
Protein GI170681873 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.908807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000153162 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGTTA AAGCCTCCGG GCGTTTTGTC CCTCCGTCAG CATTTGCCGC AGGCACCGGT 
AAGATGTTTA CCGGTGCTTA TGCATGGAAC GCGCCACGCG AGGCCGTCGG GCGCGAAAGA
CCCCTTACAC GTGATGAGAT GCGTCAGGTG CAAGGTGTTT TATCCACGAT TAACCGCCTG
CCTTACTTTT TGCGCTCGCT GTTTACTTCA CGCTATGACT ACATCCGGCG CAATAAAAGC
CCGGTGCACG GGTTTTATTT CCTCACATCC ACTTTTCAGC GTCGTTTATG GCCGCGCATT
GAGCGTGTGA ATCAGCGCCA TGAAATGAAC ACCGACGCGT CGTTGCTGTT TCTGGCAGAG
CGTGACCACT ATGCGCGCCT GCCGGGAATG AATGACAAGG AGCTGAAAAA GTTTGCCGCC
CGTATCTCAT CGCAGCTTTT CATGATGTAT GAGGAACTCT GCGATGCCTG GGTGGATGCC
CATGGCGAAA AAGAATCGCT GTTTACGGAT GAGGCGCAGG CTCACCTGTA TGGTCATGTT
GCTGGCGCTG CACGTGCTTT CAATATTTCC CCGCTGTACT GGAGAAAATA CCGTAAAGGG
CAGATGACCA CGAGGCAGGC ATATTCTGCC ATTGCCCGCC TGTTTAACGA TGAGTGGTGG
ATTAGTCAGC TTAAAGGCCA GCGTATGCGC TGGCATGAGG CGTTACTGAT TGCTGTCGGG
GAGGTCAATA AAGACCGTTC ACCTTATGCC AGTAAACATG CCATTCGTGA TGTGCGTGCG
CGCCGCCAGG CAAATCTGGA ATTTCTTAAA TCGTGTGACC TTGAAAACAG GGAAACCGGC
GAGCGCATCG ACCTTATCAG TAAGGTGATG GGCAGTATTT CTAATCCTGA AATTCGCCGG
ATGGAGCTGA TGAACACCAT TGCCGGTATT GAGCGTTACG CCGCTGCAGA GGGTGATGTG
GGGATGTTTA TCACGCTGAC CGCGCCGTCA AAGTATCACC CGACACGTCA GGTAGGAAAA
GGCGAAAGTA AAACCGTGCA GCTTAATCAC GGCTGGAACG ATGAGGCATT TAATCCAAAG
GATGCGCAGC GTTATCTCTG CCGTATCTGG AGCCTGATGC GCACGGCATT CAAGGATAAT
GATTTACAGG TCTACGGTTT GCGTGTCGTC GAGCCACACC ACGACGGAAC GCCGCACTGG
CATATGATGC TTTTTTGTAA TTCGCGCCAG CGTAACCAGA TTATCGAAAT CATGCGTCGC
TATGCGCTCA AAGAGGATGG CGACGAAAGA GGAGCCGCGC GAAACCGTTT TCAGGCAAAA
CACCTTAATC GGGGCGGTGC TGCGGGGTAT ATCGCGAAAT ACATTTCAAA AAACATCGAC
GGCTATGCAC TGGATGGTCA GCTCGATAAC GATACCGGCA GGCCGCTGAA AGACACTGCC
GCGGCTGTTA CCGCATGGGC GTCAACGTGG CGCATTCCGC AATTTAAAAC GGTTGGCCTA
CCGACAATGG GGGCTTACCG TGAACTACGC AAATTACCTC GCGGCGTCAG TATTGCTGAT
GAGTTTGACG AACGCGTCGA GGCTGCTCGC GCTGCCGCAG ACAGTGGTGA TTTTGCGTTG
TATATCAGCG CGCAGGGTGG GGCAAATGTC CCGCGCGATT GTCAGACTGT CAGGGTTGCC
CGTAGCCCGT CGAGTGACGT TAACGAGTAC GAGGAAGAAG TCGAGAGAGT GGTCGGTATT
TACGCGCCGC ATCTCGGCGC GCGTCATATT CATATCACCA GAACGACGGA CTGGCGCATT
GTGCCGAAAG TTCCGGTCGT TGAGCCTTTG ACTTTAAAAA GCGGCATCGC CGCGCCTCGG
AGTCCTGTCA ATAACTGTGG AAAGCTCACC GGCAGTGATA CTTCGTTACC GGCTCCCACA
CCTTATGAAC ATGCCGCAGC CGTGCTTAAT CTGGTTGATG ACGGTGTTAT CGAATGGAAT
GAGCCAGAGG TCGTGAGGGC GCTCAGAGGT GCATTAAAAC ACGAACTGAG AACACCAAAT
CGTCAGCAGA GAAACGGAAG CCCGTTAAAA CCACATGAAA TAGCGCCATC GGCCAGACTG
ACCCGGTCGG AACGAACGCA AATTACCCGT ATCCGCGTTG ACCTTGCTCA GAACGGTATC
AGGCCGCAGC GATGGGAGCT TGAGGCGCTG GCGCGTGGCG CGACCGTAAA TTATGACGGG
AAAAAATTCA CGTATCCGGT CGCTGATGAG TGGCCGGGAT TCTCAACAGT AATGGAGTGG
ACATAA
 
Protein sequence
MAVKASGRFV PPSAFAAGTG KMFTGAYAWN APREAVGRER PLTRDEMRQV QGVLSTINRL 
PYFLRSLFTS RYDYIRRNKS PVHGFYFLTS TFQRRLWPRI ERVNQRHEMN TDASLLFLAE
RDHYARLPGM NDKELKKFAA RISSQLFMMY EELCDAWVDA HGEKESLFTD EAQAHLYGHV
AGAARAFNIS PLYWRKYRKG QMTTRQAYSA IARLFNDEWW ISQLKGQRMR WHEALLIAVG
EVNKDRSPYA SKHAIRDVRA RRQANLEFLK SCDLENRETG ERIDLISKVM GSISNPEIRR
MELMNTIAGI ERYAAAEGDV GMFITLTAPS KYHPTRQVGK GESKTVQLNH GWNDEAFNPK
DAQRYLCRIW SLMRTAFKDN DLQVYGLRVV EPHHDGTPHW HMMLFCNSRQ RNQIIEIMRR
YALKEDGDER GAARNRFQAK HLNRGGAAGY IAKYISKNID GYALDGQLDN DTGRPLKDTA
AAVTAWASTW RIPQFKTVGL PTMGAYRELR KLPRGVSIAD EFDERVEAAR AAADSGDFAL
YISAQGGANV PRDCQTVRVA RSPSSDVNEY EEEVERVVGI YAPHLGARHI HITRTTDWRI
VPKVPVVEPL TLKSGIAAPR SPVNNCGKLT GSDTSLPAPT PYEHAAAVLN LVDDGVIEWN
EPEVVRALRG ALKHELRTPN RQQRNGSPLK PHEIAPSARL TRSERTQITR IRVDLAQNGI
RPQRWELEAL ARGATVNYDG KKFTYPVADE WPGFSTVMEW T