Gene EcSMS35_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0142 
Symbol 
ID6143098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp157021 
End bp157941 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content49% 
IMG OID641615043 
ProductISNCY family transposase 
Protein accessionYP_001742259 
Protein GI170682409 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAC CGAGTACCAC ACCGCATGAT GCGGTGTTTA AACAATTTTT AATGCATGCG 
GAAACGGCTC GTGACTTTCT GGATATCCAT TTGCCAGCGG AACTACGCGA ACTGTGTGAC
CTCGACACGC TGCATCTTGA GTCGGGGAGT TTTATTGAAG AAAGCCTGAA AGGGCACAGC
ACTGACGTGC TCTATTCCGT GCAAATGCAG GGTAATACGG GCTATCTACA TGTTGTAATT
GAACACCAAA GCAAGCCGGA CAAAAAAATG GCCTTTCGCA TGATGCGTTA TTCTATTGCT
GCCATGCACC GGCATCTGGA GGCAGATCAC GATAAGCTGC CGCTGGTGGT GCCGATTTTG
TTTTATCAGG GCGAGGCCAC GCCTTATCCA CTCTCAATGT GCTGGTTTGA TATGTTTTAC
TCGCCGGAGC TGGCGCGACG CGTCTATAAC AGTCCTTTCC CGCTGGTGGA TATCACTATC
ACACCGGATG ACGAAATCAT GCAACATCGG CGGATTGCGA TTCTCGAACT GCTGCAAAAA
CATATTCGCC AACGCGACTT AATGTTATTG CTGGAGCAAC TGGTCACGCT GATAGACGAA
GGGTACACTA GCGGAAGTCA GTTAGTTGCC ATGCAAAACT ATATGCTGCA ACGCGGTCAT
ACTGAACAAG CGGATTTGTT TTATGGTGTG CTGAGAGACA GGGAAACGGG AGGGGAGTCT
ATGATGACGC TGGCGCAGTG GTTTGAAGAG AAGGGAAGAC AGGAGGAAAG GCAGGAGGTA
AGACAGGAGG TAATACAAGA GGTTAGACAG GAAGTAAGAC AGGAATTCGC CCTGCGTTTT
CTGAGTAAAG GGATGTCTCG GGAAGACGTT GCAGAGATGG CAAATTTACC TCTTGCTGAG
GTTGATAAGC TGATTAGCTA A
 
Protein sequence
MDAPSTTPHD AVFKQFLMHA ETARDFLDIH LPAELRELCD LDTLHLESGS FIEESLKGHS 
TDVLYSVQMQ GNTGYLHVVI EHQSKPDKKM AFRMMRYSIA AMHRHLEADH DKLPLVVPIL
FYQGEATPYP LSMCWFDMFY SPELARRVYN SPFPLVDITI TPDDEIMQHR RIAILELLQK
HIRQRDLMLL LEQLVTLIDE GYTSGSQLVA MQNYMLQRGH TEQADLFYGV LRDRETGGES
MMTLAQWFEE KGRQEERQEV RQEVIQEVRQ EVRQEFALRF LSKGMSREDV AEMANLPLAE
VDKLIS