Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0499 |
Symbol | |
ID | 4569269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 547701 |
End bp | 549548 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765098 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_910980 |
Protein GI | 119356336 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGATC GGGAATCTTG CAGGATTCAG CGGCTTCTTC TCCATGAGGG GACAGTCTGG CTGGACGAAT CGTATGGTGC TCGACATGAG GGCGAAAGCC TGCTTTTTTC CGAACCTCTT GAGATGCTCG CTCTTTTTTC GGCAAGCGGT ATCAGGGAGT TTTTTACTCT TCTCGAACAA AAACTCGACA GCGGGTATTG GCTTGCCGGA TGGATGGGCT ATGAGGCAGG ATACGGTTTT GAGTCGGAAC GCTTTCTTGC GAACTGCGAT CCAGAGAGAG CTGCGCCTCT TGCCTGGTTT GGCGTTTATC GGCCCCCGGA GCGATTTGCC TCTATTGTGA GCGAGAGGAT GTTTTCTCAG GTAGCTGTCG ATGAACCGTT CATGATCAGC GATTTTCGTT TTAATCGCAC TCCTGCCGAG TATTTTCAAG ATGTTGAGAG GGTGAAACGG GAGATTGCCA AAGGAAATGT TTATCAGGTT AACGTTACCG GCCGGTTTCG TTTTTCTTTT CATGGTTCAG CTCAGGCATT GTTCGGTGCG TTGTATCCGC AGCAGCCTTC GGTGTATTCC GCTTTTATCA ATACCGGTCG GCATCAGGTG CTCTCTTTTT CTCCCGAGCT TTTTTTTCGG CGCCGGGCTT GTACGATTGA AGCTATGCCG ATGAAAGGGA CGGCTCCGAG AAGCGGCGTT GTTGAAGAAG ATAATCGACT GAAGAGGCAA TTATCGCTCT GCGAGAAGAA TCGGGCTGAA AACCTGATGA TTGTCGATCT GTTGCGTAAC GATCTTGGCA GAATATGCAG ATCAGGTTCA GTTGAGGTTT CGGGGATGTT TGCCACTGAG ACCTATCCTA CGCTTCACCA GATGGTGTCA ACTATTCGCG GGGAACTGAA GGATGAAAAC GGTCTTTTCG AGACGTTTCG CGCTCTGTAT CCCTGCGGTT CCATTACGGG AGCGCCAAAA ATCAGCGCAA TGCAGTTGAT CCGTGAACTT GAGCCGGGCT TGAGGGGGTG CTACACCGGT ACCATCGGGT ATATCAATCC TCGGAGAGAT ATGGTTTTCA GTGTTGCGAT TCGTACGCTT GAACTGTCCG GCAATGAGGG CGTCTATGGT TCGGGCGGCG GCATCGTCTG GGATTCCGAT CCGGAGGAGG AGTACCAGGA GTGCATGCTG AAGGCAAAAA TTCTTGATGA TGTTACCGCA GAGAGCGTTG AGCTGTTTGA GAGCATGCTT TTTTCCGGCA GATTTCTCTG GATGCAGGAG CATCTTGGAC GGCTTGCGGC TTCGGCAAGA GTTCTCGGAT TTGCTTTCGA TGCTGCAGAG GCTGCAGCCC GACTTGCGGC CCTTGCTGAT ATTCTTTTCA AGGCAGGCGG ACGTTTCAAG GTCAGACTCG CTCTTGAAAA AAACGGGCGG ATGGCCATCA GCTATGAATC GCTGCTGTCT GATACAGGTC GCGTTCCCCT GAAGCTCTCT CTTGCGGATG GATTTATCGA CTCATCAAAT TCGCTTCGAT TTCATAAAAC CAGTTCGCGA AAACGGTACG AACGGTATTA CCGGAAAGCT CTTGAGAGCG GTTATGACGA GGTTGTGTTT CTCAATGAAC GCCAGGAGGT GACGGAAGGG GCGATCAGCA ACATTATTAT TCTCAAAAGC CGACGCTATG TTACTCCCTC TCTTGGTTCA GGATTGCTTG ACGGCATTTA TCGACGTTAT TTTCTTTCTC TTCATCCGAA TGCTTCAGAA CAAGTGCTCA CGCTGAAGGA TCTTTTCGAG GCTGATCAGG TTTTTATCTG CAATTCGGTT CGAGGATTGC GTCCCGTCGT GTTCGATGGG TTTGTTATCT CCATGTAA
|
Protein sequence | MRDRESCRIQ RLLLHEGTVW LDESYGARHE GESLLFSEPL EMLALFSASG IREFFTLLEQ KLDSGYWLAG WMGYEAGYGF ESERFLANCD PERAAPLAWF GVYRPPERFA SIVSERMFSQ VAVDEPFMIS DFRFNRTPAE YFQDVERVKR EIAKGNVYQV NVTGRFRFSF HGSAQALFGA LYPQQPSVYS AFINTGRHQV LSFSPELFFR RRACTIEAMP MKGTAPRSGV VEEDNRLKRQ LSLCEKNRAE NLMIVDLLRN DLGRICRSGS VEVSGMFATE TYPTLHQMVS TIRGELKDEN GLFETFRALY PCGSITGAPK ISAMQLIREL EPGLRGCYTG TIGYINPRRD MVFSVAIRTL ELSGNEGVYG SGGGIVWDSD PEEEYQECML KAKILDDVTA ESVELFESML FSGRFLWMQE HLGRLAASAR VLGFAFDAAE AAARLAALAD ILFKAGGRFK VRLALEKNGR MAISYESLLS DTGRVPLKLS LADGFIDSSN SLRFHKTSSR KRYERYYRKA LESGYDEVVF LNERQEVTEG AISNIIILKS RRYVTPSLGS GLLDGIYRRY FLSLHPNASE QVLTLKDLFE ADQVFICNSV RGLRPVVFDG FVISM
|
| |