Gene EcSMS35_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1553 
Symbol 
ID6146271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1538723 
End bp1540735 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID641616430 
Productfusaric acid resistance domain-containing protein 
Protein accessionYP_001743608 
Protein GI170682942 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.266646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAT CGTCATGGTC CTTGCGCAAT TTGCCCTGGT TCAGGGCCAC GCTGGCGCAA 
TGGCGTTATG CGTTACGCAA TACCATTGCC ATGTGTCTGG CGCTGACGGT TGCCTATTAT
TTAAATCTGG ATGAACCCTA TTGGGCGATG ACCTCGGCTG CCGTGGTTAG CTTTCCCACC
GTGGGCGGTG TCATCAGCAA AAGCCTCGGA CGCATCGCTG GCAGTTTGCT CGGAGCCATT
GCGGCACTGC TTCTTGCCGG ACATACACTT AATGAGCCGT GGTTTTTTCT TTTGAGCATG
GCGGCGTGGC TTGGCTTTTG TACCTGGGCC TGTGCGCACT TCACGAATAA TGTCGCGTAT
GCATTTCAAC TGGCGGGCTA CACGGCTGCC ATCATCGCCT TTCCGATGGT TAATATTACT
GAGGCCAGCC AGCTGTGGGA TATCGCTCAG GCGCGCGTTT GCGAGGTGAT TGTCGGTATT
TTGTGCGGCG GCATGATGAT GATGATCCTG CCGAGCAATT CCGATGCTAC AGCCCTTTTA
ACCGCATTGA AAAACATGCA CGCCCGATTG CTGGAACATG CCAGTTTACT CTGGCAGCCT
GAAACAACCG ATGCCATTCG TGCAGCACAT GAAGGGGTGA TTGGGCAGAT ACTGACCATG
AATTTGCTCC GTATCCAGGC TTTCTGGAGC CACTATCGTT TTCGCCAGCA AAACGCACGC
CTTAATGCGC TGCTCCACCA GCAATTACGT ATGACCAGTG TCATCTCCAG CCTGCGACGT
ATGTTGCTCA ACTGGCCCTC ACCGCCAGGT GCCACACGAG AAATTCTCGA ACAGCTGCTG
ACGGCGCTCG CCAGTTCGCA AACAGATGTT TACACCGTCG CACGTATTAT CGCCCCGCTA
CGCCCGACCA ACGTCGCCGA CTATCGGCAC GTCGCCTTCT GGCAGCGACT ACGTTATTTT
TGCCGACTTT ATCTGCAAAG TAGTCAGGAA TTACATCGTC TGCAAAGCGG TGTAGATGAT
CATACCAGAC TCCCACGGAC ATCCGGCCTG GCTCGTCATA CCGATAACGC CGAAGCTATG
TGGAGCGGGC TGCGTACATT TTGTACGTTG ATGATGATTG GCACATGGAG TATTGCTTCG
CAATGGGATG CCGGTGCCAA TGCATTAACG CTGGCAGCGA TTAGCTGCGT ACTCTACTCC
GCCGTCGCAG CTCCGTTTAA GTCGTTGTCA CTGCTGATGC GTACGCTGGT GTTACTTTCG
CTATTCAGCT TTGTGGTCAA ATTTGGCCTG ATGGTCCAGA TTAGCGATCT GTGGCAATTT
TTACTGTTTC TCTTTCCACT GCTGGCGACG ATGCAGCTTC TTAAATTGCA GATGCCCAAA
TTTGCCGCAC TGTGGGGGCA ACTGATTGTT TTTATGGGTT CGTTTATCGC TGTCACTAAT
CCCCCGGTGT ATGATTTTGC TGATTTTCTT AACGATAATC TGGCAAAAAT CGTTGGCGTT
GCGCTGGCGT GGTTAGCGTT CGCCATTCTG CGTCCAGGAT CGGATGCTCG TAAAAGTCGC
CGTCATATTC GCGCGCTGCG CCGGGATTTT GTCGATCAAC TAAGCCGCCA TCCATCACTG
AGTGAAAGCG AATTTGAATC GCTCACTTAT CATCACGTCA GTCAGTTGAG TAACAGCCAG
GATGCGATGG CTCGCCGTTG GTTATTACGC TGGGGTGTAG TGCTGCTGAA CTGTTCTCAT
GTTGTCTGGC AATTGCGCGA CTGGGAATCG CGTTCCGATC CGTTATCGCG AGTACGGGAT
AACTGTATTT CACTGTTGCG CGGAGTGATG AGTGAGCGTG GCGTTCAGCA AAAATCACTG
GCGGCCACAC TTGAAGAATT ACAGCGAATT TGCGACAGCC TTGCCCGTCA TCATCAACCT
GCCGCCCGTG AGCTGGCGGC AATTGTCTGG CGGCTGTACT GCTCGCTTTC GCAACTTGAG
CAAGCACCGC CGCAAGGTAC GCTGGCCTCT TAA
 
Protein sequence
MNASSWSLRN LPWFRATLAQ WRYALRNTIA MCLALTVAYY LNLDEPYWAM TSAAVVSFPT 
VGGVISKSLG RIAGSLLGAI AALLLAGHTL NEPWFFLLSM AAWLGFCTWA CAHFTNNVAY
AFQLAGYTAA IIAFPMVNIT EASQLWDIAQ ARVCEVIVGI LCGGMMMMIL PSNSDATALL
TALKNMHARL LEHASLLWQP ETTDAIRAAH EGVIGQILTM NLLRIQAFWS HYRFRQQNAR
LNALLHQQLR MTSVISSLRR MLLNWPSPPG ATREILEQLL TALASSQTDV YTVARIIAPL
RPTNVADYRH VAFWQRLRYF CRLYLQSSQE LHRLQSGVDD HTRLPRTSGL ARHTDNAEAM
WSGLRTFCTL MMIGTWSIAS QWDAGANALT LAAISCVLYS AVAAPFKSLS LLMRTLVLLS
LFSFVVKFGL MVQISDLWQF LLFLFPLLAT MQLLKLQMPK FAALWGQLIV FMGSFIAVTN
PPVYDFADFL NDNLAKIVGV ALAWLAFAIL RPGSDARKSR RHIRALRRDF VDQLSRHPSL
SESEFESLTY HHVSQLSNSQ DAMARRWLLR WGVVLLNCSH VVWQLRDWES RSDPLSRVRD
NCISLLRGVM SERGVQQKSL AATLEELQRI CDSLARHHQP AARELAAIVW RLYCSLSQLE
QAPPQGTLAS