Gene EcSMS35_4822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4822 
Symbol 
ID6145041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4909284 
End bp4910387 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content38% 
IMG OID641619626 
Producthypothetical protein 
Protein accessionYP_001746733 
Protein GI170681364 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT CTATCGACAT TTCAGAACTT ATTCAATTAG GGAAGAAAAT GTTACCAGAA 
GGAGTCGATT TTTTTCTGGA TGAATCCCCT ATTGACTTTG ATCCTATAGA TATTGAGTTA
TCCACGGGTA AAGAAGTTAG TATCGAAGAT CTTGACCCTG GTAGCGGGCT AATCTCTTAT
CATGGCCGCC AGGTTCTTTT ATATATTAGG GACCATTCAG GGCGTTATGA TGCGGCTATC
ATCGATGGCG AAAAAGGAAA ACGTTTTCAT ATTGCCTGGT GCAGAACTCT TGATGAAATG
CGCCATAAAA ATCGATTTGA AAGGTATCAT GCAACTAACC GCATAGATGG TTTATTCGAA
ATTGATGATG GTTCAGGCCG GAGCCAGGAT GTTGATTTAC GGGTATGTAT GAATTGTCTG
GAACGACTTA ATTACAAAGG AAGTATTGAT AAACAAAGGA AAAGAGAGAT TTTTAAATCA
TTCTCATTAA ATGAGTTTTT TTCAGATTAT AGTACCTGTT TTCGTCATAT GCCTAAGGGT
ATCTATGACA AAACAAATAG TGGGTATGTC GAAAACTGGA AAGATATATC AAAATCAATA
CGAGAAAAGG CCAAGTATAC TTGTAATGAT TGTGGTGTGA ATTTATCAAC CGCCAAAAAC
TTGTGCCATG TCCATCATAA AAATGGCATC AAATATGATA ATCACCATGA AAACCTTCTT
GTTCTGTGTA AGGATTGCCA TCGTAAACAG CCCCTCCATG AAGGTATATT CGTTACCCAA
GCTGAGATGG CTATCATTCA ACGTTTACGT TCCCAACAAG GGTTATTAAA AGCCGAATCC
TGGAATGAAA TATATGACCT GACTGATCCA TCAGTACATG GTGATATTAA TATGATGCAA
CATAAAGGCT TTCAACCTCC TGTTCCTGGG TTAGATCTTC AAAACTCAGA ACATGAAATT
ATTGCAACCG TAGAAGCAGC ATGGCCAGGC CTTAAAATTG CAGTTAACCT TACTCCCGCC
GAAGTCGAAG GATGGAGAAT ATATACCGTG GGTGAGCTGG TTAAAGAAAT ACAAACAGGA
GCCTTTACGT CAGCAACGTT GTAA
 
Protein sequence
MKLSIDISEL IQLGKKMLPE GVDFFLDESP IDFDPIDIEL STGKEVSIED LDPGSGLISY 
HGRQVLLYIR DHSGRYDAAI IDGEKGKRFH IAWCRTLDEM RHKNRFERYH ATNRIDGLFE
IDDGSGRSQD VDLRVCMNCL ERLNYKGSID KQRKREIFKS FSLNEFFSDY STCFRHMPKG
IYDKTNSGYV ENWKDISKSI REKAKYTCND CGVNLSTAKN LCHVHHKNGI KYDNHHENLL
VLCKDCHRKQ PLHEGIFVTQ AEMAIIQRLR SQQGLLKAES WNEIYDLTDP SVHGDINMMQ
HKGFQPPVPG LDLQNSEHEI IATVEAAWPG LKIAVNLTPA EVEGWRIYTV GELVKEIQTG
AFTSATL