Gene ECH74115_0863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0863 
SymbolmodF 
ID6968397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp879550 
End bp881022 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content52% 
IMG OID643384888 
Productputative molybdenum transport ATP-binding protein ModF 
Protein accessionYP_002269388 
Protein GI209395942 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1119] ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0496449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGT TGCAAATTTT GCAAGGCACG TTTCGTCTTA GCGACACAAA AACGCTGCAA 
TTGCCTCAGC TAACGTTAAA CGCGGGTGAT AGTTGGGCGT TTGTCGGTTC GAATGGAAGC
GGGAAATCGG CCCTGGCCCG CGCGCTGGCG GGGGAACTTC CGCTTTTGAA AGGTGAACGG
CAAAGCCAGT TTTCCCACAT CACTCGTCTC TCCTTCGAGC AATTGCAAAA GCTCGTCAGC
GACGAATGGC AGCGGAATAA CACCGATATG CTCGGCCCTG GCGAAGATGA CACCGGACGC
ACTACGGCTG AGATTATTCA GGATGAAGTA AAGGATGCAC CGCGCTGCAT GCAACTGGCA
CAGCAGTTCG GTATTACCCC CCTCCTCGAC CGACGCTTTA AATACCTTTC CACTGGCGAG
ACGCGAAAAA CCCTGCTGTG TCAGGCGCTG ATGTCGGAGC CTGACTTGTT GATTCTTGAT
GAGCCGTTCG ATGGCCTGGA TGTTGCCTCA CGTCAGCAGC TGGCTGAGCG ACTCGCCTCG
TTACATCAGT CCGGTATTAC TCTGGTACTG GTGCTCAATC GCTTCGATGA GATCCCGGAG
TTTGTCCAGT TTGCTGGCGT GCTGGCGGAT TGCACGTTAG CGGAAACTGG CGCTAAAGAG
GAACTGCTCC AACAAGCACT CGTCGCGCAA CTGGCGCATA GTGAACAGCT TGAAGGTGTG
CAACTGCCGG AGCCGGATGA ACCTTCAGCA CGTCACGCCT TACCCGCCAA CGAACCGCGC
ATTGTGCTGA ATAATGGCGT GGTTTCTTAT AACGATCGCC CCATTCTTAA TAACCTTAGC
TGGCAGGTGA ATCCAGGTGA ACACTGGCAA ATTGTCGGGC CAAATGGCGC GGGAAAATCG
ACGTTATTAA GCCTGATTAC TGGCGATCAT CCGCAAGGTT ACAGCAACGA TTTGACGCTT
TTCGGACGAC GTCGTGGCAG CGGCGAAACC ATCTGGGATA TCAAAAAGCA TATCGGTTAC
GTCAGCAGTA GTTTGCATCT GGATTACCGG GTCAGTACTA CTGTGCGTAA TGTGATTCTT
TCTGGCTATT TTGATTCGAT TGGCATTTAT CAGGCCGTTT CGGATCGCCA GCAAAAACTG
GTGCAGCAGT GGCTGGATAT TCTCGGCATT GATAAACGCA CGGCTGACGC TCCGTTCCAT
AGTCTTTCCT GGGGACAGCA GCGTCTGGCG CTGATCGTCC GCGCACTGGT GAAACATCCG
ACGTTGCTTA TTCTCGACGA ACCATTGCAG GGGCTTGATC CGCTCAATCG CCAGCTTATT
CGCCGTTTTG TTGATGTGCT GATTAGCGAA GGTGAAACGC AATTGTTGTT TGTTTCGCAC
CATGCTGAAG ATGCGCCGGC TTGTATTACC CATCGTCTGG AGTTCGTGCC GGACGGTGGC
CTCTATCATT ATGCGCTGAC AAAAATAAAC TAA
 
Protein sequence
MSSLQILQGT FRLSDTKTLQ LPQLTLNAGD SWAFVGSNGS GKSALARALA GELPLLKGER 
QSQFSHITRL SFEQLQKLVS DEWQRNNTDM LGPGEDDTGR TTAEIIQDEV KDAPRCMQLA
QQFGITPLLD RRFKYLSTGE TRKTLLCQAL MSEPDLLILD EPFDGLDVAS RQQLAERLAS
LHQSGITLVL VLNRFDEIPE FVQFAGVLAD CTLAETGAKE ELLQQALVAQ LAHSEQLEGV
QLPEPDEPSA RHALPANEPR IVLNNGVVSY NDRPILNNLS WQVNPGEHWQ IVGPNGAGKS
TLLSLITGDH PQGYSNDLTL FGRRRGSGET IWDIKKHIGY VSSSLHLDYR VSTTVRNVIL
SGYFDSIGIY QAVSDRQQKL VQQWLDILGI DKRTADAPFH SLSWGQQRLA LIVRALVKHP
TLLILDEPLQ GLDPLNRQLI RRFVDVLISE GETQLLFVSH HAEDAPACIT HRLEFVPDGG
LYHYALTKIN