Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1918 |
Symbol | |
ID | 4569862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2225309 |
End bp | 2226466 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639766500 |
Product | homocitrate synthase |
Protein accession | YP_912358 |
Protein GI | 119357714 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACTTG AAAAGAATAC GCCTTCGGGC AATGGCATAA GGCCCTGGAT TATCGACACA ACCCTGAGAG ATGGCGAACA GGCTCCGGGA GTTGTATTTA CTGCTGGCGA AAAATACAGA ATCGCTCAAC TGCTTGCAGA AATTGGCGTC AACGAGCTTG AAATCGGGTA TCCGGCAATC AGCAGCGAGG AACGAGAGAA CATCAGAACC ATTGCCGCGC TTCATCTGCC GGTGCGCCTG ACCAGTTGGG CAAGAGCGTC ATGGGACGAT ATCGAACATG CCAGGAGCTG CGAAACCGAG GCGGTGCACA TCAGTTTTCC GGTCTCGCCA CTCTACCTGC AACTGATGCA GAAGGATTAC CTGTGGGTGC AGCGCCAGCT CCAGGAACTG GTGCCGAAAG CAAAAAAATA TTTCAACATT GTCAGTGTCG GGGCTCAGGA TGCAACAAGA ACGCCGTATG AACTTCTGAA AACCTTTGTT CTCGATGCTG AAGCGTGCGG AGCCGACAGG ATTCGCATAG CCGATACCGT TGGCATAGCG ACTCCCATAT CGGTACTCGA TCTGGTGGGA CGTCTTCAGT CCGTAAGCCC GACAGCCCTC GAATTTCATG CGCACAACGA TCTCGGCATG GCAACGGCCA ATGCGTTCAC CGCTCTTGAG GCCGGCTGCT CCGCTGTGAG CGTTTCGGTA ACGGGACTTG GCGAACGGGC GGGTAACGCC GCTCTTGAAG AACTTGCCGT CGCTCTTTTG CTCAACAATC AATTCCAATG CAAGATCGAC ACCACAAAGC TTGCTATGCT CTGTAAAACC GTCAGCAAAG CATCCGGAAG ACCAATCCAG GATCAGAAAC CGGTTATCGG CAAATCGGTA TTCCAGCACG AATCAGGCAT TCATTGCGCA GCATTGTTAA AAAATCCGCT CTCCTACCAA CCATTTCTTC CATCTGAAGT CGGAAGAAAA CCTCATGAGC TGGTAATCGG CAAGCATTCC GGCAGTGCGG CGCTCAAACA TTTTTACCAT ACAAGAGGAA TCAGCCTGAC AAGAGATGAA GCCAGCCGGA TCCTGAGTCT GGTCCGCAGA AGCGCTGATG AAAAGAAAAG AGCACTGACA GCCCATGAAC TTGATGAGAT CTACGCAATA CGCAGCCAAA AAGGATAA
|
Protein sequence | MILEKNTPSG NGIRPWIIDT TLRDGEQAPG VVFTAGEKYR IAQLLAEIGV NELEIGYPAI SSEERENIRT IAALHLPVRL TSWARASWDD IEHARSCETE AVHISFPVSP LYLQLMQKDY LWVQRQLQEL VPKAKKYFNI VSVGAQDATR TPYELLKTFV LDAEACGADR IRIADTVGIA TPISVLDLVG RLQSVSPTAL EFHAHNDLGM ATANAFTALE AGCSAVSVSV TGLGERAGNA ALEELAVALL LNNQFQCKID TTKLAMLCKT VSKASGRPIQ DQKPVIGKSV FQHESGIHCA ALLKNPLSYQ PFLPSEVGRK PHELVIGKHS GSAALKHFYH TRGISLTRDE ASRILSLVRR SADEKKRALT AHELDEIYAI RSQKG
|
| |