Gene EcSMS35_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4500 
Symbol 
ID6143773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4598294 
End bp4599832 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content32% 
IMG OID641619316 
Producthypothetical protein 
Protein accessionYP_001746428 
Protein GI170680249 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATG ATGTGGTGCA AGGAAATAAT AAATTAGATC TTGATTTACT ACGTAATTTT 
AATGGGGTGC CAGGTTTAAA TAGAGATAAC TTTATTTATA TCAGTGATAT ATTTTTAAAT
ATAAAACAAC GGAACGAAAA AAATCATGCA ATAAATATGT TTCGTGAAGT CTCAATCAGT
AATGATATTA TAAGCGTAAA ATTTTATAGA AATGAAGAAA TAGAATGTGC TTGTGATTTT
CTGATGGATA AAGATGCGCA GGGGTATACT GACCTGTCTG ATTTGGATTT AACAAGTTGT
CATTTTAAAG GTGACGTTAT TTCGAAGGTG TCTTTCATAT CATCAAATCT ACAACATGTA
ACATTCGAAT GTAAAGAAAT AGGGGATTGC AATTTTACTA CTGCAACAGT TGATAATGTC
ATATTTAAAT GTCGACGTTT ACACAATGTG ATTTTTATCA AAGCGACTGG CGAATATGTC
GATTTTAGCC AAAGTATTCT TGATACAGTT GACTTCTCGC GGAGCCAGCT TACTCATAGT
AATTTTCGTG AATGTCAGAT TAGAAATTCA AAGTTCAATA ATTGTTATCT TTATGCTTCG
CACTTCACCA GAGCAGAATT TCTTTCTACC AAAGAAATAT CATTTATTAA ATCGAATCTG
ACAGCTGTTA TGTTTGATCA TGTGCGAATG TCGACAGGGA ATTTTAAAGA TTGCATTACA
GAACGATTGG AATTAACTAT TGATTATTCA GATATATTTG GGAATGAAGA ACTTGATGGT
TATATCAATA ACATTATAAA AATGATTGAT ACATTGCCAG ATAATGCAAT GATATTGAAA
TCCGTTCTGG CAGTAAAACT GGTGATGCAA TTAAAAATTC TTAATATTGT TAATAAAAAC
TTTATTGAGA ATATGAAGAA AATATTTAGC CATTGTCCTT ATATAAAAGA TCCAATTATA
CGTAGTTATA TCCATCCTGA TGAAGATAAC AAGTTCGATA ATTTTATGCG TCAAAATCGA
TTCAGTAAGA TGCATTTCGA TACCCAACAG ATGATCGATT TTATTAACAG ATTTAATATG
AATAAATGGC TGATTGATCA AAATAACAAT TTTTTTATCC AACTTATCGA TCAGGCTCTA
CGATCAACGG ATGATACGAT CAAAGCAAAT GCCTGGCATC TTTATAAAGA GTGGATTCGT
AGTGATGATG TTTCACCTTT ATTTATAGAA ATTGAAGATA ATTTAAGAAC CTTTAACACG
AATGAATTAA CACGAAATGA TAATATCTTT ATCCTGTTCT CCTCTGTCGA TGATGGGCCA
GTTATGGTGG TAAGCTCCCA GCGCTTACAT GATATGTTGA ATCCTACAAA AGATACCAAT
TGGAATTCCA CGTATATCTA CAAATCCAGA CATGAGATGT TGCCTGTTAA TCTTACTCCG
GAAACACTTT TCGGCTCCAA ATCTCAGGAT AAACATGCGC TTTTCCCCAT TTTTACTGCG
AGTTGGCGAG CTAATCGTAT AAAGAATAAA GGTATTTAA
 
Protein sequence
MKHDVVQGNN KLDLDLLRNF NGVPGLNRDN FIYISDIFLN IKQRNEKNHA INMFREVSIS 
NDIISVKFYR NEEIECACDF LMDKDAQGYT DLSDLDLTSC HFKGDVISKV SFISSNLQHV
TFECKEIGDC NFTTATVDNV IFKCRRLHNV IFIKATGEYV DFSQSILDTV DFSRSQLTHS
NFRECQIRNS KFNNCYLYAS HFTRAEFLST KEISFIKSNL TAVMFDHVRM STGNFKDCIT
ERLELTIDYS DIFGNEELDG YINNIIKMID TLPDNAMILK SVLAVKLVMQ LKILNIVNKN
FIENMKKIFS HCPYIKDPII RSYIHPDEDN KFDNFMRQNR FSKMHFDTQQ MIDFINRFNM
NKWLIDQNNN FFIQLIDQAL RSTDDTIKAN AWHLYKEWIR SDDVSPLFIE IEDNLRTFNT
NELTRNDNIF ILFSSVDDGP VMVVSSQRLH DMLNPTKDTN WNSTYIYKSR HEMLPVNLTP
ETLFGSKSQD KHALFPIFTA SWRANRIKNK GI