Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2277 |
Symbol | |
ID | 4568693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2610038 |
End bp | 2611342 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639766839 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_912693 |
Protein GI | 119358049 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.252741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTAT TCAAGGGTGA AGTAACCGCG TTGCCGCCTG ACAAGTCGAT CTCTCATCGT GCGGCACTGA TCGGAGCGTT GTCGGATGGA ACGACGGAAA TTGTAAATTT TTCCGGAGGG TTCGATAACC AGTCGACACT CGCTGTCCTG CAGGCTTCGG GTATTGCTCT TATTCAGGAA GAGTGTGCCG GCAGTTACGG CAGGAGAATA AGACGGGTTG TTATCGAATC GAGGGGGTTA TGGAGTTTTC TTGCCCCGCA GGCACCGTTG ATGTGCAATA ATTCCGGCAG TACCATGCGA ATGTTTGCCG GTATTCTTGC AGCTCAGCCT TTTGAGAGCG TTCTTGAAGG CGACAGTTCA CTCATGAAGC GGCCGATGAA TCGTGTTGCC GATCCCTTGC GGCAGATGGG CGCCCAGGTA GAGCTCTCTT TTTCGGGGAC CGCGCCGATT CGGATACAGG GCACAAAAGA TCTTCACTCG CTTGAATACC GTCTTCCTGT ATCTTCAGCA CAGGTTAAGT CGCTTGTTGC CTTTGCTGCG CTTCATGCCG ACGGCCAAAC CCGCATTATC GAACCGATTC GATCGCGCGA CCATACCGAG CTGATGCTTG GTCTTGAAAC CATCGATCAG CCGAATGGCG AGCGGGTGAT TATTGTTCCC GGCCGTAAAC GTATCGAATC AAAGCCGTTT TATATCCCCG CTGATCCTTC CGCAGCCTGC TTTATCGTAG CGCTTGCGCT GCTTGCAAAA GGTTCGGACA TTATCATCAG GGATCTCTGT CTCAATCCTA CCAGAACCGG ATACCTTGCT ATTCTTGCAG GCGCGGGCGC AGGTATATCG GTAGAAAACA GTCGTGTTAT CGGTGGTGAA GCAATCGGTG ACGTTCTTGT GCATAGCGAG GGCGAACTCA ACTCGTTAGT TATCAGCGAT CCTCATGAGG TTGCCAATGT TATTGACGAA ATTCCGATGC TTGCCGTTTT GTCGGCCTTT TCGTCCGGTC GCTTTGAGTT GCATCATGCT GCGGAACTCA GAACCAAGGA GAGCGACAGG ATTGATGCTC TTGTCGTTAA TCTTGAGCGT CTCGGTTTTC AGTGCGAGCA GTACCCTGAT GGATTCAAGG TCAATGGCCG CATCGCGATG CCGAAGGGAG TTGTTTCCAT TGAGAGTTTT GATGATCACA GGATTGCCAT GAGTTTTGCT ATTGCAGGAA AAGCAACGGG CGTTGACCTT GCTATTTCAG ATATCGGCGT GGTGGGGGTG TCGTTTCCAA ACTTTTTCGA GATTATCGAG AGCCTCGAAG TCTGA
|
Protein sequence | MSVFKGEVTA LPPDKSISHR AALIGALSDG TTEIVNFSGG FDNQSTLAVL QASGIALIQE ECAGSYGRRI RRVVIESRGL WSFLAPQAPL MCNNSGSTMR MFAGILAAQP FESVLEGDSS LMKRPMNRVA DPLRQMGAQV ELSFSGTAPI RIQGTKDLHS LEYRLPVSSA QVKSLVAFAA LHADGQTRII EPIRSRDHTE LMLGLETIDQ PNGERVIIVP GRKRIESKPF YIPADPSAAC FIVALALLAK GSDIIIRDLC LNPTRTGYLA ILAGAGAGIS VENSRVIGGE AIGDVLVHSE GELNSLVISD PHEVANVIDE IPMLAVLSAF SSGRFELHHA AELRTKESDR IDALVVNLER LGFQCEQYPD GFKVNGRIAM PKGVVSIESF DDHRIAMSFA IAGKATGVDL AISDIGVVGV SFPNFFEIIE SLEV
|
| |