Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0422 |
Symbol | |
ID | 3747688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 495126 |
End bp | 496898 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637772952 |
Product | Para-aminobenzoate synthase, component I |
Protein accession | YP_378738 |
Protein GI | 78188400 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.744844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTTTC ATAACCCACA TGAAGTGTTG ATGCTGAATG CGTTCGATGG AGTAGAAGAT TTTTTTAAGA AGATAGAAGA GAGGGTTGCG GCAGGATTTT TTGTGGCGGG ATGGTTAAGC TATGAAGCTG CCTACGGCAT GGATAGCGCA CTTGCCGAAA TGGCAACGGC GCAAACGTGG CAAGCGCCAC TTGCGTGGTT TGGGGTATAT AAAGCACCAC AACGCTTTAC GGCTGATGAG GTGGCACAAC TCTTTCCACC ATCCTTAACG ACTGCCATTA CAGCACCGCA TTGTTCTACC ACTGAGATTG ACCATGCCGA GCAAGTTGCC GCTATTCGAG AAGAGATTGC GGCTGGCAAG GTGTATCAAG TCAACTTGAC TGCTCGCTAT CACTTTAGCA TGGCGGGAGA GGCACCTGCG CTTTTTGCAG CGCTGCGCCA ACAGCAACCC GCCTCCTACA CGGCATTTCT TAATTGCGGA GAACGCACCA TCCTCTCCTT TTCACCTGAA CTTTTTTTCC GAACTGATGG CTGCGCCATT GAAACGCGCC CCATGAAAGG CACTGCACCT CGTGGCAGTT CAGCGGAAGA AGATGCCCAT TTGCGCTTGC AGCTTCAGCA ATGCGAAAAA AATTGTGCCG AAAACTTGAT GATTGTGGAC TTGCTCCGCA ACGATTTAGG GAGAATTTGT ACCCCCGCCA CCATTAAAGC TACGAAGCTT TTTGCTACCG AAAGCTGGCC TACGCTTCAC CAAATGATCT CCACAATTTC GGGTGAACTG CGCAATAACG TCAGTTTATA CGAACTTTTT CAAGCGCTCT ACCCCTGCGG CTCCATTACA GGAGCACCAA AAATAAGCGC CATGCAGTTA ATTCAGCAGC TTGAACAATC GCCACGCGGC ATTTATACAG GCGCTATTGG CTACATAACG CCGCCATCAG CTCAAGTATC TGCACAAACC ATGCGCTTTA GCGTAGCAAT CCGCACCCTT GAGCTGCAAG GGCAGCACGG CATCTATGGC TCTGGTGGAG GTATTGTGTG GGATTCCGTT GCGGCTGATG AGTATTGCGA ATGCCAACTC AAAACTAAAA TTCTTGAGAG CATTGCCGCC CCACCATTTG AACTGTTTGA AACCATGCTG TGGCATGATG GATGCTACCT CTGGCTTAAT GAACACCTCA ATCGCCTTGC GAACTCAGCC AAAGCACTTG GCTTTGCATT TGAACGTCAA GCAACATTGC AGCAACTTCT TGCCTTTGAA GTGGAACTGC AACAGTCCCC AAAAAAACGC TGTAAAGTAA AACTCACCCT TTTTCGCAAT GGTGAAGTAC AGCTTGATGC CGAAGCCGTT TCGCCTGACT TATCAGGGCG CTTGATGCTT GTAACGCTTG CAGAAAAGCC TGTTTCGAGC AATGAAGAGG CGTGGCTTCA GCACAAAACA ACCTTGCGCC ACTCGTATGA CAGCGCATTT GCCGCTGCCC GTGCGGCTGG CTACGACGAG GTTATTTTTT GCAACCAACG TGGCGAAATT ACGGAAGGCG CAATCAGCTC AATTATGGTG CGGCACGGCT CCCAACTTCT TACGCCATCA CTTGCTTGTG GGCTGCTCAA CAGCATTAGC CGTCGCTACC TGCTTGCCAC CCGCCCAAAT TTGCGTGAAG CCACTCTGTA CCCCAATGAC CTTGTTACTG CCGACATGCT CTATATTGCA AACTCCGTGC GCGGCATTCG CCCAGCAGTA ATGGAGCAAG AAATGAAACG TATAGAAAAA TAA
|
Protein sequence | MLFHNPHEVL MLNAFDGVED FFKKIEERVA AGFFVAGWLS YEAAYGMDSA LAEMATAQTW QAPLAWFGVY KAPQRFTADE VAQLFPPSLT TAITAPHCST TEIDHAEQVA AIREEIAAGK VYQVNLTARY HFSMAGEAPA LFAALRQQQP ASYTAFLNCG ERTILSFSPE LFFRTDGCAI ETRPMKGTAP RGSSAEEDAH LRLQLQQCEK NCAENLMIVD LLRNDLGRIC TPATIKATKL FATESWPTLH QMISTISGEL RNNVSLYELF QALYPCGSIT GAPKISAMQL IQQLEQSPRG IYTGAIGYIT PPSAQVSAQT MRFSVAIRTL ELQGQHGIYG SGGGIVWDSV AADEYCECQL KTKILESIAA PPFELFETML WHDGCYLWLN EHLNRLANSA KALGFAFERQ ATLQQLLAFE VELQQSPKKR CKVKLTLFRN GEVQLDAEAV SPDLSGRLML VTLAEKPVSS NEEAWLQHKT TLRHSYDSAF AAARAAGYDE VIFCNQRGEI TEGAISSIMV RHGSQLLTPS LACGLLNSIS RRYLLATRPN LREATLYPND LVTADMLYIA NSVRGIRPAV MEQEMKRIEK
|
| |