Gene EcSMS35_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1174 
Symbol 
ID6143132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1186534 
End bp1187589 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content54% 
IMG OID641616052 
Producthypothetical protein 
Protein accessionYP_001743239 
Protein GI170680695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000255184 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000402769 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGTAT TACTTCGACC TGTTCTGGTA CCGGAACTCG GGCTGGTGAT CGTTAAGCCG 
GGCCGTGAAT CCATGCCGGT ATTCCACAAT ACCCGGGTAC TGGTGGAGCC GGAACCGAAA
AGCATGCGCG GTCTGCCGTC CGGAGTCGTC CCTGCCGTTC GCCAGCCGCT GGCGGAGGAT
AAATCATTAC TGCCATTTTT CAGCGATGAG CGGGTGATTC GTGCTGCTGG CGGCGCTGGG
GCACTGTCTG ACTGGCTGTT GCGTCATGTC AAATCCTGCC AGTGGCCTCA TGGTGACTAT
CATCACAGTG AAATCGTCAT ACATCGTTAC GGTACCGGCG CGATGGTGTT GTGCTGGCAC
TGCGACAACC AGTTGCGTGA CCAGACATCC GAATCACTCG GGCAACTTGC TCATCAAAAC
CTGTCAGCAT GGATGATTGA CGTCATACGC CATGCAATGA ATGGCACGCA GGAGCGGGAA
TTGTCGCTGG CTGAATTATC CTGGTGGGCG GTCTGCAATC AGGTGGCGGA CGCGCTTCCG
GAGGCAGTAT TACGTCGTTC TCTGGGGTTA CGTGCGGAAA AAATCCGCTC CTTGTACCGC
GAAAGCGACA TCGTACCGGG AGAGCAGACC GCCACCAGCA TACTGAAGCA GCGCACAAAA
AATCTTGCGC CGTTGCCTCA TGCCCACCAG CAAAACCCGC CACAGGAAAA GACGGTGGTC
AGCATTGCCG TTGATCCGGA GTCACCGGCT CAGTATCTCC AGCGCCAGAA ACCACAACGG
GAAGAGATGC CTGTATACAC GCGTTGGGTA AAAACGCAGA AATGCATGAC GTGCGGTAAT
CAGGCAGATG ATCCGCATCA CATCATTGGT CATGGACTGG GAGGGATGGG AACAAAGGCT
GATGATTTGT TTGTTATTCC GCTGTGCCGT AAATGTCATA ACGAACTGCA CGCCGGGGTA
AAAGATTTTG AAGAAAAACA CGGCAGCCAG CTGTTGTTGC TGATTCGTTT TTTAATGCAC
GCGAGAAATT CGGGTGTCCT GAAGTGGAAA GCATGA
 
Protein sequence
MRVLLRPVLV PELGLVIVKP GRESMPVFHN TRVLVEPEPK SMRGLPSGVV PAVRQPLAED 
KSLLPFFSDE RVIRAAGGAG ALSDWLLRHV KSCQWPHGDY HHSEIVIHRY GTGAMVLCWH
CDNQLRDQTS ESLGQLAHQN LSAWMIDVIR HAMNGTQERE LSLAELSWWA VCNQVADALP
EAVLRRSLGL RAEKIRSLYR ESDIVPGEQT ATSILKQRTK NLAPLPHAHQ QNPPQEKTVV
SIAVDPESPA QYLQRQKPQR EEMPVYTRWV KTQKCMTCGN QADDPHHIIG HGLGGMGTKA
DDLFVIPLCR KCHNELHAGV KDFEEKHGSQ LLLLIRFLMH ARNSGVLKWK A