Gene EcSMS35_4744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4744 
Symbol 
ID6145960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4843468 
End bp4844970 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content53% 
IMG OID641619559 
Producthypothetical protein 
Protein accessionYP_001746667 
Protein GI170681088 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0318785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.98435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC CCCTGTTAAT TGCCCGCACG CCGGACACAG AACTGTTTTT ACTGCCGGGA 
ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC TGTCACGTTG
CAAAAACTGG CAGAGTCATT GTCGGAAATC GGTGTTCCGG TGTTTATGGC CGATGTGAAA
GGCGATCTGA CCGGCGTCGA GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CGCAAGGCTT
AAAAATATCG GCGTCAATGA CTGGCAACCG CATGCCAATC CGGTGGTGGT GTGGGATATC
TTTGGCGAGA AAGGCCATCC GGTGCGGGCG ACGGTTTCGG ATCTGGGGCC GCTGTTGCTG
GCACGACTGT TGAATCTCAA CGATGTGCAA TCTGGCGTGC TGAATATCAT TTTCCGCATT
GCTGACGATC AGGGATTGTT GCTGCTCGAC TTTAAAGATC TGCGGGCAAT TACCCAGTAC
ATCGGCGATA ACGCCAAATC CTTCCAGAAT CAGTACGGAA ATATCAGTAG CGCATCGGTT
GGTGCCATCC AGCGCGGATT ACTGTCGCTG GAACAGCAAG GCGCAGCACA CTTCTTTGGT
GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATG CCAACGGTAA AGGCGTTATC
AATATCCTCA GCGCCGAGAA ACTTTATCAG ATGCCGAAAC TGTACGCCGC CAGCCTGCTG
TGGATGCTTT CAGAATTGTA TGAACAATTG CCTGAAGCGG GCGATCTGGA GAAACCAAAA
CTGGTGTTTT TCTTCGACGA GGCACATCTG CTGTTTAACG ATGCACCGCA GGTACTGCTG
GATAAGATTG AGCAGGTGAT AAGGCTTATT CGCTCAAAAG GCGTGGGCGT CTGGTTCGTT
TCGCAAAACC CGTCTGATAT TCCGGATAAT GTGCTCGGGC AGCTAGGTAA TCGCGTTCAG
CACGCTTTGC GGGCTTTTAC GCCAAAAGAT CAGAAAGCAG TGAAGGCAGC GGCGCAAACC
ATGCGGGCCA ATCCGGCATT TGATACCGAA AAGGCAATCC AGGAACTGGG GACCGGCGAG
GCGTTAATCT CGTTTCTCGA TGCAAAAGGA AGTCCTTCTG TGGTGGAACG GGCGATGGTG
ATCGCACCTT GTTCGCGAAT GGGGCCGGTG ACGGAAGATG AGCGTAATGG CCTGATTAAT
CACTCTCCGG TGTATGGCAA ATATGAAGAT GAGGTGGACC GCGAGTCCGC CTATGAGATG
CTGCAAAAAG GCTTTCAGGC CAGTACCGAG CAGCAAAATA ATCCCCCCGT GAAAGGTAAA
GAGGTGGCGG TGGATGACGG TATTCTTGGT GGATTGAAGG ATATTTTGTT TGGCACTACC
GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCGAAAAG CGCCGCTCGC
CAGGTGACGA ATCAGATTGT ACGCGGGATG TTGGGGAGTT TGCTGGGGGG GAGAAAAAGG
TAA
 
Protein sequence
MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK 
GDLTGVEQAG TASEKLLARL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL
ARLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV
GAIQRGLLSL EQQGAAHFFG EPMLDIKDWM RTDANGKGVI NILSAEKLYQ MPKLYAASLL
WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV
SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE
ALISFLDAKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED EVDRESAYEM
LQKGFQASTE QQNNPPVKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR
QVTNQIVRGM LGSLLGGRKR