Gene EcSMS35_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3653 
Symbol 
ID6146260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3712951 
End bp3714255 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content54% 
IMG OID641618480 
Producthypothetical protein 
Protein accessionYP_001745620 
Protein GI170680136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.874459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0000157987 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATCTGT ATATTCAGAT TATCGTGGTG GCGTGCCTGA CGGGTATGAC ATCGCTTCTG 
GCGCATCGCT CGGCAGCCGT TTTTCATGAC GGTATCCGCC CGATCCTGCC GCAACTGATT
GAAGGCTATA TGAACCGTCG CGAGGCGGGG AGTATCGCTT TTGGTCTGAG CATTGGTTTT
GTGGCCTCGG TGGGGATCTC TTTTACCCTG AAAACCGGGC TGCTCAACGC ATGGTTACTC
TTTCTTCCTA CCGATATCCT CGGCGTACTG GCGATAAACA GCCTGATGGC GTTTGGTCTT
GGCGCTATCT GGGGCGTGTT GATCCTTACT TGCCTGTTGC CGGTAAACCA GCTGCTGACC
GCGCTACCGG TGGATGTATT AGGTAGCCTC GGGGAATTAA GCTCGCCGGT GGTTTCAGCT
TTTGCACTCT TCCCGCTGGT GGCGATTTTC TACCAGTTTG GCTGGAAGCA AAGTCTGATC
GCCGCCGTGG TGGTACTGAT GACCCGCGTG GTCGTCGTGC GCTATTTCCC ACATCTTAAC
CCTGAATCCA TCGAAATCTT TATTGGTATG GTGATGCTGC TGGGGATCGC GATAACTCAC
GACCTGCGTC ATCGTGATGA AAATGACATT GATGCCAGCG GGCTTTCGGT GTTTGAAGAA
CGCACGTCAC GGATTATCAA AAACTTACCG TATATCGCCA TCGTGGGAGC ATTGATTGCC
GCCGTTGCCA GCATGAAGAT TTTCGCCGGT AGTGAAGTGT CGATCTTCAC TCTGGAGAAA
GCGTACTCCG CAGGCGTAAC GCCGGAACAA TCGCAAACAC TAATCAACCA GGCAGCTCTG
GCGGAATTTA TGCGCGGGCT GGGTTTTGTG CCGATGATTG CCACCACCGC GCTAGCCACC
GGCGTGTATG CAGTTGCGGG CTTTACCTTT GTTTATGCGG TGGGCTATCT CTCGCCGAAT
CCGATGGTTG CAGCGGTATT AGGCGCAGTG GTTATTTCGG CGGAAGTCCT GCTGCTTCGT
TCGATCGGCA AATGGCTGGG GCGCTACCCG TCGGTGCGTA ATGCGTCGGA TAACATCCGT
AACGCCATGA ATATGCTGAT GGAAGTGGCG TTGCTGGTCG GTTCGATCTT TGCAGCAATC
AAAATGGCGG GCTATACCGG ATTCTCTATC GCAGTTGCCA TTTACTTCCT CAACGAATCC
CTGGGCCGTC CGGTACAGAA AATGGCGGCA CCGGTCGTGG CAGTAATGAT CACCGGTATT
CTGCTGAATG TTCTTTACTG GCTTGGCCTG TTCGTTCCGG CTTAA
 
Protein sequence
MDLYIQIIVV ACLTGMTSLL AHRSAAVFHD GIRPILPQLI EGYMNRREAG SIAFGLSIGF 
VASVGISFTL KTGLLNAWLL FLPTDILGVL AINSLMAFGL GAIWGVLILT CLLPVNQLLT
ALPVDVLGSL GELSSPVVSA FALFPLVAIF YQFGWKQSLI AAVVVLMTRV VVVRYFPHLN
PESIEIFIGM VMLLGIAITH DLRHRDENDI DASGLSVFEE RTSRIIKNLP YIAIVGALIA
AVASMKIFAG SEVSIFTLEK AYSAGVTPEQ SQTLINQAAL AEFMRGLGFV PMIATTALAT
GVYAVAGFTF VYAVGYLSPN PMVAAVLGAV VISAEVLLLR SIGKWLGRYP SVRNASDNIR
NAMNMLMEVA LLVGSIFAAI KMAGYTGFSI AVAIYFLNES LGRPVQKMAA PVVAVMITGI
LLNVLYWLGL FVPA