Gene EcSMS35_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0783 
SymbolmodF 
ID6142661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp784327 
End bp785799 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content52% 
IMG OID641615671 
Productputative molybdenum transport ATP-binding protein ModF 
Protein accessionYP_001742863 
Protein GI170683734 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1119] ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGT TGCAAATTTT GCAAGGCACG TTTCGTCTTG GCGACACAAA AACGCTGCAA 
TTGCCTCGGC TAACGTTAAA CGCGGGTGAT AGTTGGGCGT TTGTCGGTTC GAATGGAAGC
GGGAAATCGG CGCTGGCCCG CGCGCTGTCG GGGGAACTTC CGCTTTTGAA AGGTGAACGG
CAAAGCCAGT TTTCCCACAT CACTCGTCTC TCCTTCGAGC AATTGCAAAA GCTCGTCAGC
GACGAATGGC AGCGGAATAA CACCGATATG CTTAGTCCTG GCGAAGATGA CACCGGGCGT
ACCACGGCTG AGATCATTCA GGATGAAGTA AAGGATACTC CGCGTTGCAT GCAACTGGCG
CAGCAGTTCG GTATTACCGC CCTCCTCGAC CGACGCTTTA AATACCTTTC CACTGGCGAG
ACGCGAAAAA CCCTGCTGTG TCAGGCGCTG ATGTCGGAGC CTGACTTGTT GATTCTTGAT
GAGCCGTTCG ATGGCCTGGA TGTTGCCTCA CGTCAGCAGC TGGCTGAGCT ACTCGCCTCG
TTACATCAGT CCGGTATTAC TCTGGTACTG GTGCTCAATC GCTTCGATGA GATCCCAGAA
TTTGTCCAGT TTGCTGGCGT GCTGGCGGAT TGTACGTTAG CGGAAACTGG CGCTAAAGAG
GAACTGCTCC AGCAAGCACT CGTCGCGCAA CTGGCACATA GCGAACAGCT TGAAGGTGTG
CAAATGCCGG AGCCGGATGA ACCTTCAGCA CGTCACGCCT TACCCGCCAA CGAACCGCGC
ATTGTGCTGA ACAATGGAGT GGTTTCTTAT AACGAACGCC CCATTCTTAA TAACCTTAGC
TGGCAGGTGA ATCCAGGCGA ACACTGGCAA ATTGTCGGGC CAAATGGCGC GGGAAAATCG
ACGTTATTAA GCCTGATTAC TGGCGATCAT CCGCAAGGTT ACAGCAACGA TTTGACGCTT
TTCGGACGAC GTCGCGGCAG CGGCGAAACC ATCTGGGATA TCAAAAAGCA TATCGGTTAC
GTCAGCAGTA GTTTGCATCT GGATTACCGG GTCAGCACTA CTGTGCGTAA TGTGATCCTT
TCTGGCTATT TTGATTCGAT TGGCATTTAT CAGGCCGTAT CGGATCGCCA GCAAAAACTG
GTGCAGCAGT GGCTGGATAT TCTCGGCATT GATAAACGCA CGGCTGACGC TCCTTTCCAT
AGTCTTTCCT GGGGACAGCA GCGTCTGGCG CTGATCGTCC GTGCGCTAGT GAAACATCCG
ACATTGCTTA TTCTCGATGA ACCACTACAG GGGCTTGATC CGCTCAATCG CCAGCTTATC
CGCCGTTTTG TTGATGTGCT GATTAGCGAA GGTGAAACGC AATTGTTGTT TGTTTCGCAC
CACGCTGAAG ATGCGCCTGC CTGTATTACC CATCGTCTGG AGTTCGTGCC GGACGGTGAC
TTTTATCGCT ATGCGCTGAC AAAAATAAAC TGA
 
Protein sequence
MSSLQILQGT FRLGDTKTLQ LPRLTLNAGD SWAFVGSNGS GKSALARALS GELPLLKGER 
QSQFSHITRL SFEQLQKLVS DEWQRNNTDM LSPGEDDTGR TTAEIIQDEV KDTPRCMQLA
QQFGITALLD RRFKYLSTGE TRKTLLCQAL MSEPDLLILD EPFDGLDVAS RQQLAELLAS
LHQSGITLVL VLNRFDEIPE FVQFAGVLAD CTLAETGAKE ELLQQALVAQ LAHSEQLEGV
QMPEPDEPSA RHALPANEPR IVLNNGVVSY NERPILNNLS WQVNPGEHWQ IVGPNGAGKS
TLLSLITGDH PQGYSNDLTL FGRRRGSGET IWDIKKHIGY VSSSLHLDYR VSTTVRNVIL
SGYFDSIGIY QAVSDRQQKL VQQWLDILGI DKRTADAPFH SLSWGQQRLA LIVRALVKHP
TLLILDEPLQ GLDPLNRQLI RRFVDVLISE GETQLLFVSH HAEDAPACIT HRLEFVPDGD
FYRYALTKIN