Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1831 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 1982312 |
End bp | 1983673 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | ACX39489 |
Protein GI | 260449067 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.306293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACGT TATCTCCCGC TGTGATTACT TTACCCTGGC GTCAGGACGC CGCTGAATTT TATTTCTCCC GCTTAAGCCA CCTGCCGTGG GCGATGCTTT TACACTCCGG CTATGCCGAT CATCCGTATA GCCGCTTTGA TATTGTGGTC GCCGAGCCGA TTTGCACTTT AACCACTTTC GGTAAAGAAA CCGTTGTTAG TGAAAGCGAA AAACGCACAA CGACCACTGA TGACCCGCTA CAGGTGCTCC AGCAGGTGCT GGATCGCGCA GACATTCGCC CAACGCATAA CGAAGATTTG CCATTTCAGG GCGGCGCACT GGGGTTGTTT GGCTACGATC TGGGCCGCCG TTTTGAGTCA CTGCCAGAAA TTGCGGAACA AGATATCGTT CTGCCGGATA TGGCAGTGGG TATCTACGAT TGGGCGCTCA TTGTCGACCA CCAGCGTCAT ACAGTTTCTT TGCTGAGTCA TAATGATGTC AATGCCCGTC GGGCCTGGCT GGAAAGCCAG CAATTCTCGC CGCAGGAAGA TTTCACGCTC ACTTCCGACT GGCAATCCAA TATGACCCGC GAGCAGTACG GCGAAAAATT TCGCCAGGTA CAGGAATATC TGCACAGCGG TGATTGCTAT CAGGTGAATC TCGCCCAACG TTTTCATGCG ACCTATTCTG GCGATGAATG GCAGGCATTC CTTCAGCTTA ATCAGGCCAA CCGCGCGCCA TTTAGCGCTT TTTTACGTCT TGAACAGGGT GCAATTTTAA GCCTTTCGCC AGAGCGGTTT ATTCTTTGTG ATAATAGTGA AATCCAGACC CGCCCGATTA AAGGCACGCT ACCACGCCTG CCCGATCCTC AGGAAGATAG CAAACAAGCA GTAAAACTGG CGAACTCAGC GAAAGATCGT GCCGAAAATC TGATGATTGT CGATTTAATG CGTAATGATA TCGGTCGTGT TGCCGTAGCA GGTTCGGTAA AAGTACCAGA GCTGTTCGTG GTGGAACCCT TCCCTGCCGT GCATCATCTG GTCAGCACCA TAACGGCGCA ACTACCAGAA CAGTTACACG CCAGCGATCT GCTGCGCGCA GCTTTTCCTG GTGGCTCAAT AACCGGGGCT CCGAAAGTAC GGGCTATGGA AATTATCGAC GAACTGGAAC CGCAGCGACG CAATGCCTGG TGCGGCAGCA TTGGCTATTT GAGCTTTTGC GGCAACATGG ATACCAGTAT TACTATCCGC ACGCTGACTG CCATTAACGG ACAAATTTTC TGCTCTGCGG GCGGTGGAAT TGTCGCCGAT AGCCAGGAAG AAGCGGAATA TCAGGAAACT TTTGATAAAG TTAATCGTAT CCTGAAGCAA CTGGAGAAGT AA
|
Protein sequence | MKTLSPAVIT LPWRQDAAEF YFSRLSHLPW AMLLHSGYAD HPYSRFDIVV AEPICTLTTF GKETVVSESE KRTTTTDDPL QVLQQVLDRA DIRPTHNEDL PFQGGALGLF GYDLGRRFES LPEIAEQDIV LPDMAVGIYD WALIVDHQRH TVSLLSHNDV NARRAWLESQ QFSPQEDFTL TSDWQSNMTR EQYGEKFRQV QEYLHSGDCY QVNLAQRFHA TYSGDEWQAF LQLNQANRAP FSAFLRLEQG AILSLSPERF ILCDNSEIQT RPIKGTLPRL PDPQEDSKQA VKLANSAKDR AENLMIVDLM RNDIGRVAVA GSVKVPELFV VEPFPAVHHL VSTITAQLPE QLHASDLLRA AFPGGSITGA PKVRAMEIID ELEPQRRNAW CGSIGYLSFC GNMDTSITIR TLTAINGQIF CSAGGGIVAD SQEEAEYQET FDKVNRILKQ LEK
|
| |