Gene EcSMS35_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2075 
Symbol 
ID6144621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2089438 
End bp2090490 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content49% 
IMG OID641616951 
Producthypothetical protein 
Protein accessionYP_001744127 
Protein GI170680794 
COG category[R] General function prediction only 
COG ID[COG1054] Predicted sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.602347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.562441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTGT TACACAACCG CATTTCCAAC GACGCGTTAA AAGCCAAAAT GTTGGCTGAG 
AGCGAACCGC GAACCACCAT TTCGTTTTAT AAGTATTTCC ACATCGCCGA TCCTAAGGCG
ACCCGTGACG CTTTATATCA GCTGTTTACC GCGCTGAATG TTTTTGGGCG AGTGTATCTG
GCGCATGAGG GCATTAACGC GCAAATCAGC GTACCTGCGA GCAATGTTGA AACATTTCGC
GCGCAGCTTT ATGCCTTCGA CCCGGCTTTA GAGGGCTTAC GCCTGAATAT CGCGTTGGAA
GATGACGGGA AATCCTTCTG GGTACTGCGC ATGAAGGTCC GCGATCGTAT CGTTGCCGAC
GGTATTGACG ATCCTCACTT TGATGCCAGC AATGTGGGTG AGTATCTGCA AGCGGCGGAA
GTGAACGCCA TGCTTGACGA TCCCGATGCA TTGTTTATCG ACATGCGTAA CCACTATGAG
TATGAAGTGG GGCACTTTGA AAACGCGCTG GAAATTCCGG CAGATACCTT CCGTGAGCAG
CTGCCAAAAG CAGTTGAGAT GATGCAGGCA CATAAAGATA AAAAAATCGT CATGTACTGC
ACCGGCGGCA TTCGTTGTGA AAAAGCCAGT GCCTGGATGA AACATAACGG ATTCAATAAA
GTCTGGCATA TCGAGGGTGG AATTATTGAA TACGCCCGTA AGGCGCGCGA GCAGGGCTTG
CCGGTGCGTT TTATTGGCAA AAATTTTGTT TTTGACGAGC GGATGGGCGA ACGTATATCT
GATGAGATTA TCGCGCATTG CCACCAGTGC GGTGCGCCGT GCGACAGCCA TACCAACTGT
AAAAATGATG GCTGCCATCT GCTTTTTATT CAGTGTCCAG TATGTGCGGA AAAATACAAA
GGTTGTTGTA GTGAGATTTG CTGCGAAGAA AGCGCGTTAC CGCCAGAGGA ACAGCGACGC
CGTCGGGCAG GACGTGAAAA TGGCAATAAG ATCTTTAATA AGTCTCGTGG ACGTCTGAAT
ACAACACTGG GCATTCCTGA TCCAACAGAG TAA
 
Protein sequence
MPVLHNRISN DALKAKMLAE SEPRTTISFY KYFHIADPKA TRDALYQLFT ALNVFGRVYL 
AHEGINAQIS VPASNVETFR AQLYAFDPAL EGLRLNIALE DDGKSFWVLR MKVRDRIVAD
GIDDPHFDAS NVGEYLQAAE VNAMLDDPDA LFIDMRNHYE YEVGHFENAL EIPADTFREQ
LPKAVEMMQA HKDKKIVMYC TGGIRCEKAS AWMKHNGFNK VWHIEGGIIE YARKAREQGL
PVRFIGKNFV FDERMGERIS DEIIAHCHQC GAPCDSHTNC KNDGCHLLFI QCPVCAEKYK
GCCSEICCEE SALPPEEQRR RRAGRENGNK IFNKSRGRLN TTLGIPDPTE