Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1856 |
Symbol | |
ID | 4571198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2150471 |
End bp | 2152009 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639766438 |
Product | anthranilate synthase, component I |
Protein accession | YP_912296 |
Protein GI | 119357652 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.415025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGGGA AAAAATTAAT GACAGGTGTC CTACAGCATT ACTGTACTAT GGCTTTTAAT ACACGGCATA CTCGCTACGT TCTCAAACCG CTTGTCAAGG AAGTTTATGC CGATACCGAA ACGCCTGTTT CCGTATACCT CAAACTTCAG CGTGACTACT CCTGCCTGCT TGAATCGGTC GAAGGCGAAG AGATGCTCGC CCGGTTTTCC TATATCGCGT TTGATCCGGT CGCACTTCTC CGTGGTTCGG TCAATGGAGA GATCTCGCTT GCTATTCTTG ATCAGAAGTT CAATTCCCTG TCAGCCATAA CGGAGCAGGA AACCAATTTG CGAACGATCA TCGATCACTG CCTTGAGGCG TTTGATACCG AAGAGATTCC GAGGAAAAAA AACGGGTCAC CGCAGATGAT TACCTCCGGG GTGTTCGGTT ATTTCGGTTA TGATGCCATG CATCTTGTTG AACGGATTCC CGAACCCGAA TCTCCTGATC CGGCCGGCAT GCCCGATGTT TTTTTGCTTT TTTGCGATAC CCTTGTGGTT TTTGACAATA TCATGCGCAA GGCCTTTATT ATTGTCAACT ACCTTGATGA CGATGATAAG CCTGCTGCTT CAGAAAAGAT AGAACGCATC GCCGAACAGA TGTTCCGTCC GCTCTCGGCG GAGGAGATTT CGCTGCAGAC GGAAAAACCG GAACCGGTAG TTTCCAATAC CATGAAAGAG GACTATTTAC AGAAGGTACT TCAAGCCAAG GAGTATATTC TTGACGGTGA TATATTTCAG GTACAGGTAT CCCAGCGTCT CAGAAGGAGG CTCAATACTC GCCCTTTTGA TGTGTATCGA ATGCTTCGAA CCATAAACCC TTCGCCTTAT CTCTATTATT TCGATTTGAA GGAGTTCAGG ATTATCGGTT CTTCCCCGGA ACTTCTCGTC AAGGTTGAGC GAGACAGTAG CGGTCGCAGG ATGGTTGATA CCAGGCCGAT AGCCGGAACG CGGCATCGGG GATTGAGCTT TGAAGAGGAT GAAGCAATAG CTCACGAGCT GCTGGCCGAT GAGAAAGAGT GTGCGGAACA TCTCATGCTT ATTGATCTGA GCCGGAACGA CATAGGAAGA ATAGCCAAAA TCGGGACAGT CGAGACCAAC GAGATGATGG TTATTGAAAA ATATTCACAT GTCATGCATA TTGTCAGCAA TGTGCGCGGG GAGTTGAGAG ACGATCTTGG AACCATGGAC GCCTTCTGGT CATGCTTCCC TGCCGGCACA CTTACCGGAG CCCCGAAAGT TCGTGCCATG GAGATAATAT ATGAGCTCGA AAAAGAAAAG CGGGGTCTGT ATGGCGGCGC TGTAGGTTTT CTTGACTTCA AAGGCAATCT GACGACTGCA ATTGCAATTC GAACGATGGT TGTTGAAGGG GGAACCATCT ATTTTCAGGC TGCGGGGGGT ATTGTAGCCG ACTCGAAACC GGTTGCCGAA TATGACGAAA CGATGAATAA GATGAGAGCC GGATTGACGG CGCTTGAACG TATGGAAACT TTGCAGTAA
|
Protein sequence | MSGKKLMTGV LQHYCTMAFN TRHTRYVLKP LVKEVYADTE TPVSVYLKLQ RDYSCLLESV EGEEMLARFS YIAFDPVALL RGSVNGEISL AILDQKFNSL SAITEQETNL RTIIDHCLEA FDTEEIPRKK NGSPQMITSG VFGYFGYDAM HLVERIPEPE SPDPAGMPDV FLLFCDTLVV FDNIMRKAFI IVNYLDDDDK PAASEKIERI AEQMFRPLSA EEISLQTEKP EPVVSNTMKE DYLQKVLQAK EYILDGDIFQ VQVSQRLRRR LNTRPFDVYR MLRTINPSPY LYYFDLKEFR IIGSSPELLV KVERDSSGRR MVDTRPIAGT RHRGLSFEED EAIAHELLAD EKECAEHLML IDLSRNDIGR IAKIGTVETN EMMVIEKYSH VMHIVSNVRG ELRDDLGTMD AFWSCFPAGT LTGAPKVRAM EIIYELEKEK RGLYGGAVGF LDFKGNLTTA IAIRTMVVEG GTIYFQAAGG IVADSKPVAE YDETMNKMRA GLTALERMET LQ
|
| |