Gene EcHS_A0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0814 
SymbolmodF 
ID5594481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp818919 
End bp820391 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content53% 
IMG OID640919986 
Productputative molybdenum transport ATP-binding protein ModF 
Protein accessionYP_001457553 
Protein GI157160235 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1119] ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCGT TGCAAATTTT GCAAGGCACG TTTCGTCTTA GCGACACAAA AACGCTGCAA 
TTGCCTCAGC TAACGTTAAA CGCGGGTGAT AGTTGGGCGT TTGTCGGTTC GAATGGAAGC
GGGAAATCGG CCCTGGCCCG CGCGCTGGCG GGGGAACTTC CGCTTTTGAA AGGTGAACGG
CAAAGCCAGT TTTCCCACAT CACTCGTCTC TCCTTCGAGC AATTGCAAAA GCTCGTCAGC
GACGAATGGC AACGGAATAA CACCGATATG CTCGGCCCTG GCGAAGATGA CACCGGACGC
ACTACGGCTG AGATTATTCA GGATGAAGTA AAGGATGCAC CGCGTTGCAT GCAACTGGCG
CAGCAGTTCG GTATTACCGC CCTCCTCGAC CGACGCTTTA AATACCTTTC CACTGGCGAG
ACGCGAAAAA CCCTGCTGTG TCAGGCGCTG ATGTCGGAGC CTGACTTGTT GATTCTTGAT
GAGCCGTTCG ATGGCCTGGA TGTTGCTTCA CGTCAGCAGC TGGCTGAGCG ACTCGCCTCG
TTACATCAGT CCGGTATTAC TCTGGTACTG GTGCTCAATC GCTTCGATGA GATCCCGGAA
TTTGTCCAGT TTGCTGGCGT GCTGGCGGAT TGTACGTTAG CGGAAACGGG CGCTAAAGAG
GAACTGCTCC AGCAAGCACT CGTCGCGCAA CTGGCGCATA GCGAACAGCT TGAAGGTGTG
CAACTGCCGG AACCGGATGA ACCTTCAGCA CGTCACGCCT TACCCGCCAA CGAACCGCGC
ATTGTGCTGA ACAATGGCGT GGTTTCTTAT AACGATCGCC CCATTCTTAA TAACCTTAGC
TGGCAGGTGA ATCCAGGCGA ACACTGGCAA ATTGTCGGGC CAAATGGTGC GGGAAAATCG
ACGTTATTAA GCCTGATTAC TGGCGATCAT CCGCAAGGTT ACAGCAACGA TTTGACGCTT
TTCGGCCGAC GTCGTGGCAG CGGCGAAACC ATCTGGGATA TCAAAAAGCA TATCGGTTAC
GTCAGCAGTA GTTTGCATCT GGATTACCGG GTCAGCACTA CCGTGCGTAA TGTGATTCTT
TCTGGCTATT TCGATTCGAT TGGCATTTAT CAAGCCGTTT CGGATCGCCA GCAAAAACTG
GTGCAGCAGT GGCTGGATAT TCTCGGCATT GATAAACGCA CGGCTGACGC TCCGTTCCAT
AGTCTTTCCT GGGGACAGCA GCGTCTGGCG CTGATCGTCC GCGCACTGGT GAAACATCCG
ACGTTGCTTA TTCTCGATGA ACCACTACAG GGGCTTGATC CGCTCAATCG CCAGCTTATC
CGCCGTTTTG TTGATGTGCT GATTAGCGAA GGTGAAACGC AATTGTTGTT TGTTTCGCAC
CACGCTGAAG ATGCGCCTGC CTGTATTACC CATCGCCTTG AGTTCGTGCC GGACGGTGGA
CTCTATCGCT ATGTGCTGAC AAAAATATAT TGA
 
Protein sequence
MSSLQILQGT FRLSDTKTLQ LPQLTLNAGD SWAFVGSNGS GKSALARALA GELPLLKGER 
QSQFSHITRL SFEQLQKLVS DEWQRNNTDM LGPGEDDTGR TTAEIIQDEV KDAPRCMQLA
QQFGITALLD RRFKYLSTGE TRKTLLCQAL MSEPDLLILD EPFDGLDVAS RQQLAERLAS
LHQSGITLVL VLNRFDEIPE FVQFAGVLAD CTLAETGAKE ELLQQALVAQ LAHSEQLEGV
QLPEPDEPSA RHALPANEPR IVLNNGVVSY NDRPILNNLS WQVNPGEHWQ IVGPNGAGKS
TLLSLITGDH PQGYSNDLTL FGRRRGSGET IWDIKKHIGY VSSSLHLDYR VSTTVRNVIL
SGYFDSIGIY QAVSDRQQKL VQQWLDILGI DKRTADAPFH SLSWGQQRLA LIVRALVKHP
TLLILDEPLQ GLDPLNRQLI RRFVDVLISE GETQLLFVSH HAEDAPACIT HRLEFVPDGG
LYRYVLTKIY