Gene EcSMS35_3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3406 
Symbol 
ID6145506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3485569 
End bp3486879 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content57% 
IMG OID641618235 
Producthypothetical protein 
Protein accessionYP_001745384 
Protein GI170681817 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATT CGACTTTAAA TCCGTTATGG CAGCGTTACA TCCTCGCCGT TCAGGAGGAA 
GTAAAACCGG CGCTGGGATG TACTGAACCG ATTTCACTGG CGCTGGCGGC GGCGGTTGCT
GCGGCAGAAC TGGAAGGTCC GGTTGAACGT GTAGAAGCCT GGGTTTCGCC AAATCTGATG
AAGAACGGTC TGGGCGTCAC CGTTCCCGGC ACGGGAATGG TGGGGCTGCC GATTGCGGCG
GCGCTGGGAG CGTTAGGTGG AAATGCTAAC GCCGGGCTGG AAGTGCTGAA AGATGCAACT
GCGCAGGCAA TTGCCGATGC CAAAGCACTG CTGGCGGCGG GGAAAGTCTC AGTTAAGATC
CAGGAACCTT GCGATGAAAT CCTCTTCTCA CGCGCCAAAG TCTGGAACGG TGAGAAGTGG
GCGTGTGTCA CCATTGTCGG CGGGCATACC AACATTGTGC ATATTGAGAC GCACAATGGT
GTGGTGTTTA CCCAGCAGGC GTGTGTGACA GAGGGCGAGC AAGAGTCGCC GCTGACGGTG
CTTTCCAGGA CGACGCTGGC TGAGATCCTG AAGTTCGTCA ATGAAGTCCC GTTTGCGGCG
ATCCGCTTTA TTCTCGATTC CGCGAAGTTA AATTGCGCGT TATCGCAGGA AGGCTTGAGC
GGTAACTGGG GGCTGCATAT TGGCGCGACG CTGGAAAAAC AGTGCGCGCG CGGCTTGCTG
GCGAAAGATC TCTCTTCATC CATTGTGATT CGTACCAGCG CGGCATCCGA TGCGCGTATG
GGCGGCGCTA CGCTTCCGGC AATGAGTAAC TCCGGCTCGG GTAACCAGGG GATCACTGCA
ACAATGCCTG TGGTGGTGGT AGCAGAACAC TTCGGGGCCG ATGATGAACG GCTGGCGCGT
GCGCTGATGC TTTCGCATTT GAGCGCGATT TACATCCATA ACCAGTTACC GCGTTTGTCT
GCGCTTTGTG CCGCAACGAC CGCAGCAATG GGGGCCGCCG CCGGGATGGC ATGGCTGGTG
GATGGGCGTT ATGAAACCAT TTCGATGGCG ATCAGCAGTA TGATCGGCGA TGTCAGCGGC
ATGATTTGCG ATGGTGCGTC GAACAGCTGC GCGATGAAGG TTTCGACCAG TGCTTCGGCT
GCGTGGAAAG CGGTGTTAAT GGCGCTGGAT GATACCGCAG TGACCGGCAA TGAAGGGATT
GTGGCGCATG ATGTTGAGCA GTCGATTGCC AACCTGTGTG CGTTAGCAAG CCATTCGATG
CAGCAAACGG ATCGGCAGAT TATCGAGATT ATGGCGAGCA AGGCCAGATA A
 
Protein sequence
MFDSTLNPLW QRYILAVQEE VKPALGCTEP ISLALAAAVA AAELEGPVER VEAWVSPNLM 
KNGLGVTVPG TGMVGLPIAA ALGALGGNAN AGLEVLKDAT AQAIADAKAL LAAGKVSVKI
QEPCDEILFS RAKVWNGEKW ACVTIVGGHT NIVHIETHNG VVFTQQACVT EGEQESPLTV
LSRTTLAEIL KFVNEVPFAA IRFILDSAKL NCALSQEGLS GNWGLHIGAT LEKQCARGLL
AKDLSSSIVI RTSAASDARM GGATLPAMSN SGSGNQGITA TMPVVVVAEH FGADDERLAR
ALMLSHLSAI YIHNQLPRLS ALCAATTAAM GAAAGMAWLV DGRYETISMA ISSMIGDVSG
MICDGASNSC AMKVSTSASA AWKAVLMALD DTAVTGNEGI VAHDVEQSIA NLCALASHSM
QQTDRQIIEI MASKAR