Gene EcSMS35_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2198 
SymbolmukF 
ID6142992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2211616 
End bp2212938 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content53% 
IMG OID641617074 
Productcondesin subunit F 
Protein accessionYP_001744248 
Protein GI170679667 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG3006] Uncharacterized protein involved in chromosome partitioning 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000709701 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0963487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAT TTTCCCAGAC AGTCCCCGAA CTGGTTGCCT GGGCCAGAAA AAATGACTTC 
TCCATCTCGC TGCCGGTAGA CCGACTCTCT TTTCTGCTGG CGGTTGCCAC GCTGAACGGC
GAGCGTCTGG ATGGTGAGAT GAGTGAAGGC GAGCTGGTGG ATGCATTCCG CCATGTGAGT
GATGCGTTTG AGCAAACCAG CGAAACCATC GGCGTGCGTG CCAACAACGC GATCAACGAC
ATGGTGCGTC AACGTCTGCT GAACCGCTTT ACCAGCGAGC AGGCGGAAGG GAACGCAATT
TACCGTCTGA CGCCGCTCGG CATCGGCATT ACTGACTATT ACATCCGTCA GCGCGAGTTT
TCTACGCTGC GCCTTTCTAT GCAGTTGTCG ATTGTGGCGG GTGAGCTCAA ACGCGCGGCA
GATGCCGCCG AAGAGGGCGG TGATGAATTT CACTGGCACC GTAATGTTTA TGCGCCACTG
AAATATTCGG TAGCAGAAAT TTTCGACAGT ATCGACCTGA CGCAACGTCT GATGGACGAA
CAGCAGCAGC AGGTGAAGGA CGATATCGCC CAGTTGCTGA ACAAAGACTG GCGGGCGGCG
ATCTCCAGCT GTGAATTGTT GCTTTCGGAA ACTTCCGGAA CGCTGCGTGA ATTGCAGGAT
ACGCTGGAAG CGGCAGGCGA CAAATTGCAG GCTAATCTGT TGCGCATTCA GGATGCGACG
ATGACCCATG ACGATCTGCA TTTTGTCGAT CGTCTGGTGT TCGATCTGCA GAGCAAACTT
GACCGTATTA TCAGTTGGGG CCAGCAATCC ATCGACTTGT GGATTGGCTA CGACCGCCAC
GTACACAAAT TTATCCGTAC CGCAATCGAT ATGGATAAAA ACCGCGTCTT TGCTCAGCGG
TTACGTCAGT CGGTACAAAC CTATTTTGAT GAGCCGTGGG CGCTAACTTA TGCCAATGCC
GATCGTCTGC TGGATATGCG TGACGAAGAG ATGGCACTGC GCGATGAAGA AGTGACTGGG
GAACTTCCTG AGGATCTGGA GTACGAAGAG TTTAACGAGA TCCGCGAACA GCTGGCGGCG
ATCATTGAAG AACAACTTGC CGTGTACAAA ACCAGACAAG TGCCGCTGGA TCTTGGTCTG
GTGGTACGCG AATATCTGTC ACAGTATCCG CGTGCACGTC ACTTTGACGT TGCGCGCATT
GTTATTGATC AGGCGGTACG TCTTGGCGTA GCGCAAGCAG ATTTCACCGG ACTGCCAGCG
AAATGGCAGC CGATTAATGA TTACGGAGCC AAGGTACAGG CGCATGTCAT CGACAAATAT
TGA
 
Protein sequence
MSEFSQTVPE LVAWARKNDF SISLPVDRLS FLLAVATLNG ERLDGEMSEG ELVDAFRHVS 
DAFEQTSETI GVRANNAIND MVRQRLLNRF TSEQAEGNAI YRLTPLGIGI TDYYIRQREF
STLRLSMQLS IVAGELKRAA DAAEEGGDEF HWHRNVYAPL KYSVAEIFDS IDLTQRLMDE
QQQQVKDDIA QLLNKDWRAA ISSCELLLSE TSGTLRELQD TLEAAGDKLQ ANLLRIQDAT
MTHDDLHFVD RLVFDLQSKL DRIISWGQQS IDLWIGYDRH VHKFIRTAID MDKNRVFAQR
LRQSVQTYFD EPWALTYANA DRLLDMRDEE MALRDEEVTG ELPEDLEYEE FNEIREQLAA
IIEEQLAVYK TRQVPLDLGL VVREYLSQYP RARHFDVARI VIDQAVRLGV AQADFTGLPA
KWQPINDYGA KVQAHVIDKY