Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0490 |
Symbol | |
ID | 6374154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 511503 |
End bp | 513326 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642683007 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_001958934 |
Protein GI | 189499464 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.104803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGATA GCTTCGCAGC CAATACGATG TTGGATGAAC CCTTCACGGT TCTGTTTGGA GGTTCCTTCA GGAAAGATGA TCCCGGGGGG TATCTTCTGT TCTCAGACCC GGTGGATACT ATCGCGCTCA CCTCTCCTGA TGACCTTCGC TCATTTTTCG GAAAGCTTGA AGCATTTCTT GCTGAAGGGT TCAGTCTGGC AGGGTATGTC GGTTATGAGG CCGGCTACGG TTTTGAGCCT GAATCTTTTT CTTCCGAAAG CGCAGCGAAG GCGGTTGTTC CGCTTGCCTG GTTCGGGGCC TATCGCTCTG CTGAGAGATT GTCGGGGAAC GGGGCCGAAG GTCTATTTTC CGGGCAGTGC AGGCCGGGAG CGCTTCAGTT TGACATGACG CAGGGGGAGT ATGCGGAAAA GATTGACGAG ATAAAGAAAC ATATTGCGGC CGGGGATGTC TATCAGGTTA ACTTTACGGG AAGGTATCGT TTTGATTTTG GAGGAAGTGC CTCCTCGTTG TTCCGGTATC TCTCTTCCAG GCAGCCTGGA GTCTACTCAG CGTGGATGAA CCTTGGTGAG CATCAGCTGG CGTCGTTTTC TCCTGAACTG TTTTTCAGGA TGGAGGGGAA TGGTATTGAA ACCAGACCGA TGAAAGGAAC CGCACCGAGA GGTGGCAGTG AGCATGAAGA CAGGTTGTTC AGGGAGTGGT TGGGATCCAA TGAGAAAAAC AGGGCCGAAA ATCTCATGAT TGTGGATCTT CTTCGCAATG ATCTCGGGCG AATATGCAAA CCGGGTTCAG TTAACGTTCC TGAACTCTTT TCCGTCGAGA CCTACCCGAC ACTGCATCAG ATGGTATCCT CCGTTCGCGG AGAGGTACAG GACGACATTT CACTCTATGA ACTTTTCCGT GCGGTATTTC CCTGCGGTTC CGTGACAGGG GCTCCGAAAA TAAGAGCAAT GCAGCTGATT CAGGAGCTTG AGCGTTCACC AAGAGGGGTC TACACCGGTG CAGCAGGCTA TATGCTTCCT GACAGGTCAA TGTGTTTCAA TGTGGCAATC AGGACAGCGA TGTTGTGTGG TCATACCGGA GAATACGGGG CCGGAGGAGG TATTGTATGG GATTCGAATA CCGGGGAGGA GTACAACGAG TGCAGATTGA AAGCGAAAAT CCTCAAACCA GGCAAGGCTG AAAATTTCGG CATTTTTGAA ACCATATTGT ATAACGGATC TTTTGTCTGG CTTGACGAGC ATCTTTTCCG TCTCAGCGAG TCGGCAAGGT GTCTGGGCTT TTCGTGTGAC CTGGAGAGGA TCAGGCGCGA ACTGGAACGT CTGACTGATG AAGAGCTCAG GGGGAGGGGT AGATATAAGG TTCGTCTTGA ACTTCATCCT GAAGGTACTT TTCAGATAAC TGTTGATGAC CTTTCTGAGA GCCCCTCATC TGATCCGGTT TCTGTTTGCA GAGCAGGAGT ATCCCTGCCC TCAGACGGTC ATCTCAGAAT GCATAAAACA ACAAGGAGGG AGCTCTACGA TAAGTTGTTG CGAAAAGCAA AGAAAAGGGG TTACGATGAA CTGCTGTTCT GCAATGACAG AGGGGAGGTT GCCGAAGGAG CGATAAGCAA CATCATTATC TGTTCTGACG GGCACTATGT TACACCGGGC CTTTCTTCCG GACTGCTTAA CGGCATCTAT CGGCAATATT TTCTTTCAAC CCGCATGAAT GTTCAGGAAG CGATACTCAC TATGCATGAT ATAGAACAGG CCGATCTCCT GTTTGTCTGT AATTCATTGA GAGGGTTGAG AAGAGCGGTT CTTTTCGATG AGGTGGTGTC ATGA
|
Protein sequence | MRDSFAANTM LDEPFTVLFG GSFRKDDPGG YLLFSDPVDT IALTSPDDLR SFFGKLEAFL AEGFSLAGYV GYEAGYGFEP ESFSSESAAK AVVPLAWFGA YRSAERLSGN GAEGLFSGQC RPGALQFDMT QGEYAEKIDE IKKHIAAGDV YQVNFTGRYR FDFGGSASSL FRYLSSRQPG VYSAWMNLGE HQLASFSPEL FFRMEGNGIE TRPMKGTAPR GGSEHEDRLF REWLGSNEKN RAENLMIVDL LRNDLGRICK PGSVNVPELF SVETYPTLHQ MVSSVRGEVQ DDISLYELFR AVFPCGSVTG APKIRAMQLI QELERSPRGV YTGAAGYMLP DRSMCFNVAI RTAMLCGHTG EYGAGGGIVW DSNTGEEYNE CRLKAKILKP GKAENFGIFE TILYNGSFVW LDEHLFRLSE SARCLGFSCD LERIRRELER LTDEELRGRG RYKVRLELHP EGTFQITVDD LSESPSSDPV SVCRAGVSLP SDGHLRMHKT TRRELYDKLL RKAKKRGYDE LLFCNDRGEV AEGAISNIII CSDGHYVTPG LSSGLLNGIY RQYFLSTRMN VQEAILTMHD IEQADLLFVC NSLRGLRRAV LFDEVVS
|
| |