Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2228 |
Symbol | pabC |
ID | 6270652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2027606 |
End bp | 2028415 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641726249 |
Product | 4-amino-4-deoxychorismate lyase |
Protein accession | YP_001880734 |
Protein GI | 187733311 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR03461] aminodeoxychorismate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000000140571 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTTAA TTAACGGTCA TAAGCAGGAA TCGCTGGCAG TAAGCGATCG GGCAACGCAG TTTGGTGATG GTTGTTTTAC CACCGCCAGA GTTATCGACG GTAAAGTCAG TTTGTTATCG GCGCATATCC AGCGACTACA GGATGCTTGT CAGCGGTTGA TGATTTTCTG TGACTTCTGG CCTCAGCTTG AACAAGAGAT GAAAACGCTG GCAGCAGAAC AGCAAAATGG TGTGCTGAAA GTCGTGATCA GTCGCGGTAG TGGCGGGCGA GGGTACAGCA CATTGAACAG TGGACCGGCA ACGCGGATTC TCTCCATTAC GGTTTATCCT GCACATTACG ACCGTTTGCG TAACGAAGGG ATGACGTTGG CGCTAAGCCC GGTGCGGCTG GGGCGCAATC CCCATCTTGC AGGTATTAAA CATCTTAATC GGCTTGAGCA AGTATTGATC CGCTCTCATC TTGAGCAGAC AAACGCTGAT GAGGCGCTGG TCCTTGACAG CGAAGGGTGG GTTACGGAAT GCTGTGCGGC TAATTTGTTC TGGCGGAAGA GCAATGTAGT TTATACGCCG CGACTGGATC AGGCAGGTGT TAACGGCATT ATGCGACAAT TCTGTATCCG TTTGCTGGCA CAATCCTCTT ATCAGCTTGT CGAAGTGCAA GCCTCTCTGG AAGAGGCGTT GCAGGCAGAT GAGATGGTTA TTTGTAATGC GTTAATGCCA GTGATGCCCG TACGTGCCTG TGGCGATGTC TCCTTTTCGT CAGCAACGTT ATATGAATAT TTAGCCCCAC TTTGTGAGCG CCCGAATTAG
|
Protein sequence | MFLINGHKQE SLAVSDRATQ FGDGCFTTAR VIDGKVSLLS AHIQRLQDAC QRLMIFCDFW PQLEQEMKTL AAEQQNGVLK VVISRGSGGR GYSTLNSGPA TRILSITVYP AHYDRLRNEG MTLALSPVRL GRNPHLAGIK HLNRLEQVLI RSHLEQTNAD EALVLDSEGW VTECCAANLF WRKSNVVYTP RLDQAGVNGI MRQFCIRLLA QSSYQLVEVQ ASLEEALQAD EMVICNALMP VMPVRACGDV SFSSATLYEY LAPLCERPN
|
| |