Gene EcSMS35_A0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0076 
Symbol 
ID6106573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp56954 
End bp58456 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content32% 
IMG OID641614823 
Producthypothetical protein 
Protein accessionYP_001739964 
Protein GI170650830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.000174847 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAT TTTATCAATT TAGGGATGAA CAGCGAAAGG AGCTTGAACA ACATGATTTT 
TATAGTTTGA TTTCTTCAGA TTGTATAGCG TTGAAAGACA AACTATTATT TGCTCCTGTT
ATGGCTCATT TCATAATGAA CTTCAGAGAC ATGAATAAAT GGGTTATCAG GTTTGATAAC
AATGATAATG AATATAAATC TGTTATAAAC GGTGGAACAA TCGAGGATGA AACACATTCA
AGATTGTTTC TGGAAGACTG GAGGAAACTA TATATAGATG ACAAACTTAA CTGGAAAGCA
AGTGATGTTA TATACTGGTT GTTTATTAGT CGAGAAATGG AGTGTTTCCG AAAATTTGGT
ATTGATTTTA TGAGACTTTG TGTAGATGAT GGAGGAGACC CAATACTTCG ATATTCTCAC
TCCGAGTCAG GAGAAACTTG CGGTAATATA TTCTTTTCAA GAATTAGTCC TATTGCTGAT
CAAGTTGCCA ATCATTTGGG AATATCACTC CGTTATTTTG GAACATTTCA CCTTAATCTT
GAAAATGGAC ATGTATGGAA GTCAGAAGGT GTTTTTGAAA ATATAGAGTT GTCACCAGAT
TCTTATAAGA AAATGGCTAC TCTATCAAAG AGAATGTTTG ATATATTTGA AGGAATTCAT
GACTCTTTTT ATAATTACCT GTCCAGTTAT GTTCTTAATG GAAGTCATCC GTCATTTTTT
GAATCATTAC CTGTAGGGAA AAATGTTGCA CCTATATACC CTGAATTTGT GATAGAAAAC
AAAAGCCATA ACGATGGTAG ACATATTGAA CATATAAACA ATTACCTGGA AAAAATATCG
AGTCATGAGT TCTTTAAATG GCTGATTAAC ACCTCAATAG ACCCTCAATT GAAATTGAAA
AGTTTCATAC CTCTTTGGAT TGTTGATATT ATGGGGTATA GAGATATTAA TAAATATGTT
TTTACATATG AACAGCCTGA ATCAGAAAGT GAAAAGATTA TAAATGATTA TGCATTACAC
TTGTCAGAGC ATAGCCGTTT ATTTTATCAT GACTGGAAGT CACTTCAACT TGATGATATG
TTACGTTGGA GTGCCAGTGA TACTCTTGAG TTTATTTTTC TTAATTCAGA TATGGATATG
CATAGAGAAA ATATAGTTAA GTTTTCTTTG TTCGGATTAA AACACAGAGA TCCTGTTATC
AGGTTCTGGT TTATGATGAT ACTGGAGTTA AGTGGAAAAG AATTTTTCTC TCATGTTGGA
GATATAGCTT TACAGGTGGA AAGTAAATAT AATATTTATC TCCCATATTT ATGTGGACGC
CATGCAACAG AAAATGAGCA TGAAGCATAT AATAATATGT ATGAGCATTT TATGGTAAAG
GAACTTAGCC CTGAACAAAG TGATCTAATA ATACAAATTA CAGACATGGT TATGCGGTCA
TTATTGAATA ATTTGGATAT CTCATATCGA TATGTAGTAA ATAATTTATT GGCAGCTCGT
TAG
 
Protein sequence
MKKFYQFRDE QRKELEQHDF YSLISSDCIA LKDKLLFAPV MAHFIMNFRD MNKWVIRFDN 
NDNEYKSVIN GGTIEDETHS RLFLEDWRKL YIDDKLNWKA SDVIYWLFIS REMECFRKFG
IDFMRLCVDD GGDPILRYSH SESGETCGNI FFSRISPIAD QVANHLGISL RYFGTFHLNL
ENGHVWKSEG VFENIELSPD SYKKMATLSK RMFDIFEGIH DSFYNYLSSY VLNGSHPSFF
ESLPVGKNVA PIYPEFVIEN KSHNDGRHIE HINNYLEKIS SHEFFKWLIN TSIDPQLKLK
SFIPLWIVDI MGYRDINKYV FTYEQPESES EKIINDYALH LSEHSRLFYH DWKSLQLDDM
LRWSASDTLE FIFLNSDMDM HRENIVKFSL FGLKHRDPVI RFWFMMILEL SGKEFFSHVG
DIALQVESKY NIYLPYLCGR HATENEHEAY NNMYEHFMVK ELSPEQSDLI IQITDMVMRS
LLNNLDISYR YVVNNLLAAR