Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0846 |
Symbol | |
ID | 4570383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 968010 |
End bp | 969620 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639765444 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_911321 |
Protein GI | 119356677 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.127256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACC CTCTTTCGAA AACGATAGAA CTCTATGACA CCACCCTGCG TGACGGCACG CAGGGCGAGC ACATCAATCT TTCAGTTCAG GACAAACTGC TCATTGCGGA ACGTCTTGAC GAGTTCGGCA TGGACTATAT TGAAGGCGGC TGGCCGAGCA GCAACCCCAA GGACGAAGAA TTCTTCCTGA AAGCACGACA ATTGAAGCTC AACCACGCAA AGCTCTGCGC TTTCGGCTCC ACCGCGCGCT CCTCAGCAAC GGTCAAGAGT GACCAGAACC TGCTCGGACT GCTCCAATCC GAAACTCCGG TTATCACCAT TTTCGGTAAA ACATGGAAAG CCCACTCCTC AAAAGGGCTC GGAATTTCTG ATGAGGAAAA TGCTGAACTG ATCCATCGTT CCGTCCAGTT CCTTAAAGAA GCAGGTCGTG AGGTCTTTTT TGACGCAGAA CATTTCTTTG ACGGCTTCAA AGACAATCCC GAATTCGCGC TCACCATGAT CCTGGCCGCC GTAGAGGCCG GAGCGTCAAG AGTCGTACTG TGCGATACCA ACGGCGGCTC AATGCCGCAT GAAGTCGATG CCATCGTAAA AAAAGTAGTC GCCACGGCGG GCGTACCGGT GGGAATCCAC TGCCACAATG ACAGCGACAT TGCCGTTGCA AACTCCATTA TTGCCGTTCA GGCCGGAGCG ACGCATGTTC AGGGAACCAT CAACGGCATC GGTGAACGGT GCGGCAATGC CAATCTCATC AGCATCATAC CAAACATCAT GCTCAAACTG CATGGGAGTT TTACCCATCT GCAGCAGCTC AGCCAGTTGA CATCGCTCTC AAAGTTCGTC TTCGAGATTC TCAACCTCCC CTCCGACACA AAGGCGCCCT TTACAGGCAA ATCGGCCTTT GCCCATAAAG GGGGCATTCA TGTCAGCGCT GTCATGAAAG AGAGCTCCCT GTACGAACAT ATCGACCCGA AACTTGTCGG AAACAGACAG CGCGTGCTCG TCTCAGAGCT TGCCGGCCAG AGCAACATCC GGTACAAGGC TGATGAACTT GGAATCAAGC TGCCCGAAAA GGGAGAACAG ATCAGAAACC TCGTTCACCA TATCAAGGAA CTTGAACACA AAGGGTACCA GTTCGACGGC GCCGAAGCAT CATTCGAACT GATCCTCCGA CGCGAACTCG GTGACTTCAG CCCCTATTTC AACGTGCTTG AAACCAAGGT GCATATCGAG TCAGGGGTCG ACTCGAAAAA CGTCGATCAG GCAATCCTGA AAGTCCAGGT CGGCAACGAA ATCGAGCACA TTGCCGCTGA CGGAGACGGC CCGGTCAATG CACTCGACAA AGCGCTGCGA AAAGCACTGA TCCATTTTTA TCCTGCCATA AAAACAATCA GGCTGGTTGA CTATAAAGTC CGGGTCCTTG AAGAAAAACG CAGCACCAGC GCAAAAGTCC GGGTGCTGAT TCAAACCAGT AACGGACAGG AAACGTGGGG AACGGTCGGA GTATCAACGA ACATTATCGA AGCAAGTCTT CTTGCACTCC AGGACAGCAT GAACTATCAC CTCTTCAACG TCAGGACAGC CATTCAAAAA AAAGCAGCAG CCGAGGCATA G
|
Protein sequence | MTNPLSKTIE LYDTTLRDGT QGEHINLSVQ DKLLIAERLD EFGMDYIEGG WPSSNPKDEE FFLKARQLKL NHAKLCAFGS TARSSATVKS DQNLLGLLQS ETPVITIFGK TWKAHSSKGL GISDEENAEL IHRSVQFLKE AGREVFFDAE HFFDGFKDNP EFALTMILAA VEAGASRVVL CDTNGGSMPH EVDAIVKKVV ATAGVPVGIH CHNDSDIAVA NSIIAVQAGA THVQGTINGI GERCGNANLI SIIPNIMLKL HGSFTHLQQL SQLTSLSKFV FEILNLPSDT KAPFTGKSAF AHKGGIHVSA VMKESSLYEH IDPKLVGNRQ RVLVSELAGQ SNIRYKADEL GIKLPEKGEQ IRNLVHHIKE LEHKGYQFDG AEASFELILR RELGDFSPYF NVLETKVHIE SGVDSKNVDQ AILKVQVGNE IEHIAADGDG PVNALDKALR KALIHFYPAI KTIRLVDYKV RVLEEKRSTS AKVRVLIQTS NGQETWGTVG VSTNIIEASL LALQDSMNYH LFNVRTAIQK KAAAEA
|
| |