Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0523 |
Symbol | folP |
ID | 6068728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 563449 |
End bp | 564297 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641599928 |
Product | dihydropteroate synthase |
Protein accession | YP_001723527 |
Protein GI | 170018573 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00018335 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00147266 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTCT TTGCCCAGGG TACTTCACTG GACCTTAGCC ATCCTCACGT AATGGGGATC CTCAACGTCA CGCCTGATTC CTTTTCGGAT GGTGGCACGC ATAACTCGCT GATAGATGCG GTGAAACATG CGAATCTGAT GATCAACGCT GGCGCGACGA TCATTGACGT TGGTGGCGAG TCCACGCGCC CAGGGGCGGC GGAAGTTAGC GTTGAAGAAG AGTTGCAACG TGTTATTCCT GTGGTTGAGG CAATTGCTCA ACGCTTCGAA GTCTGGATCT CGGTAGATAC ATCCAAACCA GAAGTCATCC GTGAGTCAGC GAAAGTTGGC GCTCACATTA TTAATGATAT CCGCTCCCTT TCCGAACCTG GCGCTCTGGA GGCGGCTGCA GAAACCGGTT TACCGGTTTG TCTGATGCAT ATGCAGGGAA ATCCAAAAAC CATGCAGGAA GCTCCGAAGT ATGACGATGT CTTTGCAGAA GTGAATCGCT ACTTTATTGA GCAAATAGCA CGTTGCGAGC AGGCGGGTAT CGCAAAAGAG AAATTGTTGC TCGACCCCGG ATTCGGTTTC GGTAAAAATC TCTCCCATAA CTATTCATTA CTGGCGCGCC TGGCTGAATT TCACCATTTC AACCTGCCGC TGTTGGTAGG TATGTCACGA AAATCGATGA TTGGGCAGCT GCTGAACGTG GGGCCGTCCG AGCGCCTGAG CGGTAGTCTG GCCTGTGCGG TCATTGCCGC AATGCAAGGC GCGCACATTA TTCGTGTTCA TGACGTCAAA GAAACCGTAG AAGCGATGCG GGTGGTGGAA GCCACTCTGT CTGCAAAGGA AAACAAACGC TATGAGTAA
|
Protein sequence | MKLFAQGTSL DLSHPHVMGI LNVTPDSFSD GGTHNSLIDA VKHANLMINA GATIIDVGGE STRPGAAEVS VEEELQRVIP VVEAIAQRFE VWISVDTSKP EVIRESAKVG AHIINDIRSL SEPGALEAAA ETGLPVCLMH MQGNPKTMQE APKYDDVFAE VNRYFIEQIA RCEQAGIAKE KLLLDPGFGF GKNLSHNYSL LARLAEFHHF NLPLLVGMSR KSMIGQLLNV GPSERLSGSL ACAVIAAMQG AHIIRVHDVK ETVEAMRVVE ATLSAKENKR YE
|
| |