Gene EcSMS35_4883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4883 
Symbol 
ID6144391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4999070 
End bp5000350 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content52% 
IMG OID641619687 
Producthypothetical protein 
Protein accessionYP_001746794 
Protein GI170681238 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.82212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.899072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC TCATTGAACT CAGACGCGCC AAAATGTTGG CGCTCTCTTT ACTGCTTATC 
GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG
AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG
CTATTTCGCC GCGTGCCGAT TCCGATCATT TCTCGTCATA CGGCGATTAT CCCGCGTAAT
AAAGACCGGA TTGGCGAAAA TCTCGGTCAG TTCGTGCAGG AAAAATTTCT CGATACCCAA
TCGCTGGTGG CATTGATTCG ACGTCACGAA CCGGCGTTGT TGATTGGCAA CTGGTTTAGT
CAGCCAGAAA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG TGGTTTTCTT
GAACTGACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCAGTCCA TCGGGCGATT
GATAAGGTCG ATCTTTCCGG CACCAGTGCG TTGATGCTGG AGAGTATGAC CAAAAACGAT
CGTCATCAGG TGCTGCTGGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT
AAATCGCGCA AGTTTATCGC CCAGCAGATT GTTCGCTGGC TGGAGAGTGA GCATCCACTG
AAAGCCAAAA TTTTGCCCAC TGAATGGCTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC
GCGGTGAATT CTTTGCTTGA TGATATTAGT CGCGATCGTG CGCATCAGAT CCGCCATGCG
TTTGATCGCG CCACCTTCGC CCTGATCGAC AAGCTGAAAA ACGATCCGGA AATGGCAGCG
CGAGCCGATG CCGTAAAAAG CTATCTGAAA GAAGATGAAG CTTTTAATCG CTATCTCAGT
GAATTGTGGG GGGATTTACG GGAATGGCTG AAAGTGGATA TCAACAGTGA AGATTCTCGT
GTGAAAGAAC GTATCGCACG AGCGGGTCAA TGGTTTGGTG AAACGTTAAT TGCCGATGAT
GCCTTGCGGG CGTCGTTAAA TGGTCACCTG GAACAAGCCG CGCACCGCGT CGCGCCTGAG
TTTTCCGCAT TCCTGACGCG CCACATCAGC GATACAGTAA AAAGCTGGGA TGCGCGGGAT
ATGTCGCGGC AAATCGAGTT AAATATTGGC AAAGATCTGC AGTTTATCCG TGTCAACGGT
ACGCTAGTTG GCGGTTGTAT TGGGCTAATT TTGTATTTGC TGTCGCAGCT CCCGGCCTTG
TTCCCCCTCG GCAATTTTTA G
 
Protein sequence
MNKLIELRRA KMLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA 
LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PALLIGNWFS
QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND
RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD
AVNSLLDDIS RDRAHQIRHA FDRATFALID KLKNDPEMAA RADAVKSYLK EDEAFNRYLS
ELWGDLREWL KVDINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRVAPE
FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL
FPLGNF