Gene EcSMS35_0792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0792 
Symbol 
ID6144585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp792770 
End bp793822 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content49% 
IMG OID641615680 
Producthypothetical protein 
Protein accessionYP_001742872 
Protein GI170683357 
COG category[S] Function unknown 
COG ID[COG2828] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.533425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TACCCTGCGT GATGATGCGA GGTGGAACCT CGAGGGGCGC GTTCCTGTTA 
GCGGAACATC TACCCGAAGA TCAAACGCAG CGCGATAAAA TATTGATGGC AATTATGGGT
TCCGGTAACG ATCTGGAAAT TGACGGTATT GGCGGCGGTA ATCCGCTGAC CAGTAAAGTC
GCCATTATTA GCCGCTCCAG CGATCCGCAT GCAGATGTCG ATTATCTGTT TGCTCAAGTT
ATCGTCCATG AGCAACGTGT CGATACCACG CCTAACTGCG GCAATATGCT GTCTGGCGTT
GGGGCATTCG CCATTGAAAA TGGTTTGATT GCAGCGACTT CGCCAGTTAC CCGCGTACGT
ATCCGCAACG TCAATACGGG TACGTTCATC GAAGCTGATG TGCAAACGCC AAATGGTGTT
GTCGAGTACG AGGGTAGCGC CAGAATTGAC GGCGTACCGG GTACTGCCGC ACCGGTTGCG
CTCACTTTCC TGAATGCCGC CGGAACGAAA ACCGGAAAAG TCTTCCCGAC AGATAATCAG
ATTGATTATT TTGACGATGT CCCGGTGACC TGTATCGATA TGGCGATGCC AATTGTCATT
ATTCCTGCTG AATATCTGGG AAAAACAGGC TATGAATTAC CGGCAGAGCT GGATGCCGAC
AAAGCATTAT TAGCCCGCAT TGAATCTATC CGTCTACAAG CGGGTAAAGC AATGGGCTTA
GGCGATGTCA GTAATATGGT TATCCCTAAA CCTGTACTTA TTTCTCCAGC GCAGAAAGGC
GGGGCAATTA ATGTGCGTTA TTTTATGCCG CATTCTTGCC ATCGCGCTTT GGCGATAACC
GGTGCTATTG CTATTTCCAG TAGTTGTGCA TTGGAAGGCA CCGTCACCCG ACAAATCGTC
CCTTCTGTAG GATACGGCAA TATCAATATT GAACACCCCA GTGGTGCGCT CGACGTTCAT
TTAAGTAATG AAGGTCAGGA TGCCACGACG TTACGCGCAT CTGTTATTCG GACGACCAGA
AAAATATTTT CCGGTGAAGT TTATCTTCCC TGA
 
Protein sequence
MKKIPCVMMR GGTSRGAFLL AEHLPEDQTQ RDKILMAIMG SGNDLEIDGI GGGNPLTSKV 
AIISRSSDPH ADVDYLFAQV IVHEQRVDTT PNCGNMLSGV GAFAIENGLI AATSPVTRVR
IRNVNTGTFI EADVQTPNGV VEYEGSARID GVPGTAAPVA LTFLNAAGTK TGKVFPTDNQ
IDYFDDVPVT CIDMAMPIVI IPAEYLGKTG YELPAELDAD KALLARIESI RLQAGKAMGL
GDVSNMVIPK PVLISPAQKG GAINVRYFMP HSCHRALAIT GAIAISSSCA LEGTVTRQIV
PSVGYGNINI EHPSGALDVH LSNEGQDATT LRASVIRTTR KIFSGEVYLP