Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2040 |
Symbol | pabB |
ID | 5588725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2022403 |
End bp | 2023764 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640925711 |
Product | para-aminobenzoate synthase component I |
Protein accession | YP_001463114 |
Protein GI | 157158150 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACGT TATCTCCCGC TGTGATTACT TTACCCTGGC GTCAGGACGC CGCTGAATTT TATTTCTCCC GCTTAAGCCA CCTGCCGTGG GCGATGCTTT TACACTCCGG CTATGCCGAT CATCCGTATA GCCGCTTTGA TATTGTGGTC GCCGATCCGA TTTGCACTTT AACCACTTTC GGTAAAGAAA CCGTTGTTAG TGAAAGCGAA AAACGCACAA CGACCACTGA TGACCCGCTA CAGGTGCTCC AGCAGGTGCT GGATCGCGCA GACATTCGCC CAACGCATAA CGAAGATTTG CCATTTCAGG GCGGCGCACT GGGGTTGTTT GGCTACGATC TGGGCCGCCG TTTTGAGTCA CTGCCAGAAA TTGCGGAACA AGATATCGTT CTGCCGGATA TGGCAGTGGG TATCTACGAT TGGGCGCTCA TTGTCGACCA CCAGCGTCAT ACAGTTTCTT TGCTGAGTCA TAATGATGTC AATGCCCGTC GGGCCTGGCT GGAAAGCCAG CAATTCTCGC CGCAGGAAGA TTTCACGCTC ACTTCCGACT GGCAATCCAA TATGACCCGC GAGCAGTACG GCGAAAAATT TCGCCAGGTA CAGGAATATC TGCACAGCGG TGATTGCTAT CAGGTGAATC TCGCCCAGCG TTTTCATGCG ACCTATTCTG GCGATGAATG GCAGGCATTC CTTCAGCTTA ATCAGGCCAA CCGCGCGCCA TTTAGCGCTT TTTTACGTCT TGAACAGGGT GCAATTTTAA GCCTTTCGCC AGAGCGGTTT ATTCTTTGTG ATAATAGTGA AATCCAGACC CGCCCGATTA AAGGCACGCT ACCACGCCTG CCCGATCCTC AGGAAGATAG CAAACAAGCA GAAAAACTGG CGAACTCAGC GAAAGATCGT GCCGAAAATC TGATGATTGT CGATTTAATG CGTAATGATA TCGGTCGTGT TGCCGTAGCC GGTTCGGTAA AAGTACCAGA GCTCTTCGTG GTGGAACCCT TCCCTGCCGT GCATCATCTG GTCAGCACTA TAACGGCGCG ACTACCAGAA CAGTTACACG CCAGCGATCT GCTGCGCGCA GCTTTTCCTG GTGGCTCAAT AACCGGGGCT CCGAAAGTAC GGGCTATGGA AATTATCGAC GAACTGGAAC CGCAGCGACG TAATGCCTGG TGCGGCAGCA TTGGCTATTT GAGCTTTTGC GGCAACATGG ATACCAGCAT TACTATCCGC ACGCTGACTG CCATTAACGG ACAAATATAC TGCTCTGCGG GCGGTGGAAT TGTCGCCGAT AGCCAGGAAG AAGCGGAATA TCAGGAAACT TTTGATAAAG TTAATAAGAT ATTACGCCAA CTGGAGAAGT AA
|
Protein sequence | MKTLSPAVIT LPWRQDAAEF YFSRLSHLPW AMLLHSGYAD HPYSRFDIVV ADPICTLTTF GKETVVSESE KRTTTTDDPL QVLQQVLDRA DIRPTHNEDL PFQGGALGLF GYDLGRRFES LPEIAEQDIV LPDMAVGIYD WALIVDHQRH TVSLLSHNDV NARRAWLESQ QFSPQEDFTL TSDWQSNMTR EQYGEKFRQV QEYLHSGDCY QVNLAQRFHA TYSGDEWQAF LQLNQANRAP FSAFLRLEQG AILSLSPERF ILCDNSEIQT RPIKGTLPRL PDPQEDSKQA EKLANSAKDR AENLMIVDLM RNDIGRVAVA GSVKVPELFV VEPFPAVHHL VSTITARLPE QLHASDLLRA AFPGGSITGA PKVRAMEIID ELEPQRRNAW CGSIGYLSFC GNMDTSITIR TLTAINGQIY CSAGGGIVAD SQEEAEYQET FDKVNKILRQ LEK
|
| |