Gene EcHS_A1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1029 
SymbolmukF 
ID5595112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1034657 
End bp1035979 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content53% 
IMG OID640920196 
Productcondesin subunit F 
Protein accessionYP_001457761 
Protein GI157160443 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG3006] Uncharacterized protein involved in chromosome partitioning 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.000114842 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAT TTTCCCAGAC AGTCCCCGAA CTGGTTGCCT GGGCCAGAAA AAATGACTTC 
TCCATCTCGC TGCCGGTAGA CCGACTCTCT TTTCTGCTGG CGGTTGCCAC GCTGAACGGC
GAGCGTCTGG ATGGTGAGAT GAGTGAAGGC GAGCTGGTGG ATGCATTCCG CCATGTGAGT
GATGCGTTTG AGCAAACCAG CGAAACCATC GGCGTGCGCG CCAATAACGC GATCAACGAC
ATGGTGCGTC AACGTCTGCT GAACCGCTTT ACCAGCGAGC AGGCGGAAGG GAACGCAATT
TACCGTCTGA CGCCGCTCGG CATCGGCATT ACTGACTACT ACATCCGTCA GCGCGAGTTT
TCTACGCTGC GTCTTTCTAT GCAGTTGTCG ATTGTGGCGG GTGAGCTCAA ACGCGCAGCA
GATGCCGCCG AAGAGGGCGG TGATGAATTT CACTGGCACC GTAATGTCTA TGCGCCACTG
AAATATTCGG TAGCAGAAAT TTTCGACAGT ATCGACCTGA CGCAACGTCT GATGGACGAA
CAGCAGCAGC AGGTGAAGGA CGATATCGCC CAGTTGCTGA ACAAAGACTG GCGGGCGGCG
ATTTCCAGCT GTGAATTGTT GCTTTCGGAA ACTTCCGGAA CGCTGCGTGA ATTGCAGGAT
ACGCTGGAAG CGGCAGGCGA CAAATTGCAG GCTAATCTGT TGCGCATTCA GGATGCGACG
ATGACCCATG ACGATCTGCA TTTCGTCGAT CGTCTGGTGT TCGATCTGCA GAGCAAACTC
GATCGTATTA TCAGTTGGGG CCAGCAATCC ATCGACTTGT GGATTGGCTA CGACCGCCAC
GTACACAAAT TTATTCGTAC CGCGATCGAT ATGGATAAAA ACCGCGTCTT TGCTCAGCGG
TTACGTCAGT CGGTACAAAC CTATTTTGAT GAGCCGTGGG CGCTAACTTA TGCCAATGCC
GATCGTCTGC TGGATATGCG TGACGAAGAG ATGGCACTGC GCGATGAAGA AGTGACTGGG
GAACTTCCTG AGGATCTGGA ATACGAAGAG TTTAACAAGA TCCGCGAACA GCTGGCGGCG
ATCATCGAAG AACAACTTGC CGTGTACAAA ACCAGACAAG TGCCGCTGGA TCTTGGTCTG
GTGGTACGCG AATATCTGTC ACAGTATCCG CGTGCACGTC ACTTTGACGT TGCGCGTATT
GTTATTGATC AGGCGGTACG TCTTGGCGTA GCGCAAGCAG ATTTCACCGG ACTGCCAGCG
AAATGGCAGC CGATTAATGA TTACGGAGCC AAGGTACAGG CGCATGTCAT CGACAAATAT
TGA
 
Protein sequence
MSEFSQTVPE LVAWARKNDF SISLPVDRLS FLLAVATLNG ERLDGEMSEG ELVDAFRHVS 
DAFEQTSETI GVRANNAIND MVRQRLLNRF TSEQAEGNAI YRLTPLGIGI TDYYIRQREF
STLRLSMQLS IVAGELKRAA DAAEEGGDEF HWHRNVYAPL KYSVAEIFDS IDLTQRLMDE
QQQQVKDDIA QLLNKDWRAA ISSCELLLSE TSGTLRELQD TLEAAGDKLQ ANLLRIQDAT
MTHDDLHFVD RLVFDLQSKL DRIISWGQQS IDLWIGYDRH VHKFIRTAID MDKNRVFAQR
LRQSVQTYFD EPWALTYANA DRLLDMRDEE MALRDEEVTG ELPEDLEYEE FNKIREQLAA
IIEEQLAVYK TRQVPLDLGL VVREYLSQYP RARHFDVARI VIDQAVRLGV AQADFTGLPA
KWQPINDYGA KVQAHVIDKY