Gene EcSMS35_A0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0142 
Symbol 
ID6106471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp109546 
End bp111684 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content50% 
IMG OID641614881 
Producthypothetical protein 
Protein accessionYP_001740022 
Protein GI170650811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTG AAATACGCCA CTGCAATAAC ATCGTACGAG CACATATCAC CCTCACTGCC 
GATAAACTGA ACATTAAGTT TGCGCCAAAT GGTACTGGAA AAAGTACGCT GTCACGGGCC
ATAAGCTGTG CGGCGCGGGA CGACATTCAG GGACTACAGG CACTGATGCC CTTCCGGCTG
CGCGGAGAAA ACCCCGATAG CACCGGGCCC ATTGTCATCG GTGCTGACGG GATTGGGGAC
GTGATGTGCT TCAACGAGGA GTACGTCAGT CAGTTTACGT TCCAACCGGA CGAGCTCATC
AGTGATAGCT TTAATATTCT TATCCGCAAT CAAGCCCATG CGGAAAGAGA GCGTGAAATA
GAAGAAATGA CGCAAAAGAT CCGGGCTGTT TTCACGGATC ACACTGAGCT GAACTCTCTG
ATAGACCATC TACAGGAACT CAGTAATGCT TTCAGATCAA CCAGTTCCGG GATTTCACGT
TCTTCAACCG GCATGAGAGG TCTGTCAGGC GGAAATAAAA TACACCACAT TCCGGCTGGT
CTTGAGAACT ATCAACCCTA CATCCGCAGT GAACGCCGCG TGGAATGGAT TGACTGGCAG
ACAAAGGGAC TCGAGTTTTC GCCCCTTTCG GATGGCTGCT GTCCGTTCTG TACCGGTGAT
ATCACGGGAA AGGAAGCACA AATTCGCCAA GTCAGAGAGG AATACGATAA ATCCACCATT
AAAAACCTGA CCGCTATCAT CAGACTGGTG GAAAACCTCG GTAATTACCT GACGGAGAGC
GCCAGAGAAC GCCTTCTGGC CATTACTATG CTTCAGAACG GTCCGGAAGC CGAACATATC
GAGTACCTAG TGGCGTTGAA ACGCCAGACC GATACGCTGA CAGAGAAACT CACTGCGCTG
AGAGGCCTGA ATGTTTTCAG CCTGCAGGAA CAGCAGAACG TTCGGGAGGT GCTCACTGCC
AGGTTGATTG ACCTGCAGTT CTTCCCTGAT CTGCAATCCG AACTCATGCA GGGGATCACC
GACAGACTGA ACGCGGCCCT TATGGACCTG ATAAACCTTG CCGGACCGCT GCAGGGAAAA
ATTAACCGGC ACCGGGACAG CATGATCCGG CTGATCGCAC AGCACAAAAC AAACATCAAT
AATTTCCTCA CTTATGCAGG CTATAAATAC AGGGTGGATA TAGCCGGCGA GGGAGAGCAG
AGAAAACTCA GGCTACGACA CATAGATTTT GACGGGTACG TCAGCGGCGG TAGCCAGCAT
CTCAGCTATG GAGAACGGAA CGCCTTTGCG ATTGTGTTAT TCATGTATGA ATGCCTGTCA
AAAAACCCGG GACTGATTAT CCTTGATGAT CCCATATCTT CTTTTGATAA AAACAAGAAG
TTTGCCATTC TTGAAATGCT GTTCAGACGT GCAAGTGGTG AATGCCTGAA GAACCGGACT
GTACTTATGC TGACGCATGA CGTGGAGCCT GTGATTGACA CGCTGAAGTC AGTCAGAAGG
TTGTTCAGTA ATCAGGTGAC CGCCTCCTGC CTGCGCCTGT CTGCTGGAGT CATAGAAGAA
TTACCTGTTA ACGACGGCGA TATCATGACA TTCATGCAGA TCTGCAAATC CATCACCGCG
TCAGCAGACT GTGAGGAGAT CATTAAGCTC ATCTATCTTC GTCGCTACTT TGAAATCGTT
GACGAACGTG GTGATGCTTA TCAGCTTCTA TCCAATCTAT TCCATCGACG CGTCGTCCCG
CTGGATTATC GCGAGCCTGC CGCTGCGGGC TCGGGCTATC CCAAAATGGC TCCTGAGAAA
ATTCAGCAGG CCTTGCGGGA TATCCGGGAG TATGTGGACA GTTTTGATTA CCCGCGACTT
CAGGCTCTCG TCAGCAGCCC AGATGAAATA AAAAACCTGT ACCGTCGCTG TCGTAACTGC
TATGAAAAAC TGCAGGTATT CCGTTTACTG GAACTGGATC AGGACCATCC GGTAATACGA
AAATTCGTTA ACGAAACGTA CCACATTGAA AATGAGTTCA TCTGTCAACT AGATCCGTCC
AGATTCGATC TCATCCCGGA GTACGTGATT ATGGAATGTG ACAAGCTCAT CGCCCTGCCA
CCGGCAGCAA ACCAGAGCTC GGTTGCCCGT ATCGCTTGA
 
Protein sequence
MDIEIRHCNN IVRAHITLTA DKLNIKFAPN GTGKSTLSRA ISCAARDDIQ GLQALMPFRL 
RGENPDSTGP IVIGADGIGD VMCFNEEYVS QFTFQPDELI SDSFNILIRN QAHAEREREI
EEMTQKIRAV FTDHTELNSL IDHLQELSNA FRSTSSGISR SSTGMRGLSG GNKIHHIPAG
LENYQPYIRS ERRVEWIDWQ TKGLEFSPLS DGCCPFCTGD ITGKEAQIRQ VREEYDKSTI
KNLTAIIRLV ENLGNYLTES ARERLLAITM LQNGPEAEHI EYLVALKRQT DTLTEKLTAL
RGLNVFSLQE QQNVREVLTA RLIDLQFFPD LQSELMQGIT DRLNAALMDL INLAGPLQGK
INRHRDSMIR LIAQHKTNIN NFLTYAGYKY RVDIAGEGEQ RKLRLRHIDF DGYVSGGSQH
LSYGERNAFA IVLFMYECLS KNPGLIILDD PISSFDKNKK FAILEMLFRR ASGECLKNRT
VLMLTHDVEP VIDTLKSVRR LFSNQVTASC LRLSAGVIEE LPVNDGDIMT FMQICKSITA
SADCEEIIKL IYLRRYFEIV DERGDAYQLL SNLFHRRVVP LDYREPAAAG SGYPKMAPEK
IQQALRDIRE YVDSFDYPRL QALVSSPDEI KNLYRRCRNC YEKLQVFRLL ELDQDHPVIR
KFVNETYHIE NEFICQLDPS RFDLIPEYVI MECDKLIALP PAANQSSVAR IA