Gene Cpha266_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0846 
Symbol 
ID4570383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp968010 
End bp969620 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content52% 
IMG OID639765444 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_911321 
Protein GI119356677 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.127256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACC CTCTTTCGAA AACGATAGAA CTCTATGACA CCACCCTGCG TGACGGCACG 
CAGGGCGAGC ACATCAATCT TTCAGTTCAG GACAAACTGC TCATTGCGGA ACGTCTTGAC
GAGTTCGGCA TGGACTATAT TGAAGGCGGC TGGCCGAGCA GCAACCCCAA GGACGAAGAA
TTCTTCCTGA AAGCACGACA ATTGAAGCTC AACCACGCAA AGCTCTGCGC TTTCGGCTCC
ACCGCGCGCT CCTCAGCAAC GGTCAAGAGT GACCAGAACC TGCTCGGACT GCTCCAATCC
GAAACTCCGG TTATCACCAT TTTCGGTAAA ACATGGAAAG CCCACTCCTC AAAAGGGCTC
GGAATTTCTG ATGAGGAAAA TGCTGAACTG ATCCATCGTT CCGTCCAGTT CCTTAAAGAA
GCAGGTCGTG AGGTCTTTTT TGACGCAGAA CATTTCTTTG ACGGCTTCAA AGACAATCCC
GAATTCGCGC TCACCATGAT CCTGGCCGCC GTAGAGGCCG GAGCGTCAAG AGTCGTACTG
TGCGATACCA ACGGCGGCTC AATGCCGCAT GAAGTCGATG CCATCGTAAA AAAAGTAGTC
GCCACGGCGG GCGTACCGGT GGGAATCCAC TGCCACAATG ACAGCGACAT TGCCGTTGCA
AACTCCATTA TTGCCGTTCA GGCCGGAGCG ACGCATGTTC AGGGAACCAT CAACGGCATC
GGTGAACGGT GCGGCAATGC CAATCTCATC AGCATCATAC CAAACATCAT GCTCAAACTG
CATGGGAGTT TTACCCATCT GCAGCAGCTC AGCCAGTTGA CATCGCTCTC AAAGTTCGTC
TTCGAGATTC TCAACCTCCC CTCCGACACA AAGGCGCCCT TTACAGGCAA ATCGGCCTTT
GCCCATAAAG GGGGCATTCA TGTCAGCGCT GTCATGAAAG AGAGCTCCCT GTACGAACAT
ATCGACCCGA AACTTGTCGG AAACAGACAG CGCGTGCTCG TCTCAGAGCT TGCCGGCCAG
AGCAACATCC GGTACAAGGC TGATGAACTT GGAATCAAGC TGCCCGAAAA GGGAGAACAG
ATCAGAAACC TCGTTCACCA TATCAAGGAA CTTGAACACA AAGGGTACCA GTTCGACGGC
GCCGAAGCAT CATTCGAACT GATCCTCCGA CGCGAACTCG GTGACTTCAG CCCCTATTTC
AACGTGCTTG AAACCAAGGT GCATATCGAG TCAGGGGTCG ACTCGAAAAA CGTCGATCAG
GCAATCCTGA AAGTCCAGGT CGGCAACGAA ATCGAGCACA TTGCCGCTGA CGGAGACGGC
CCGGTCAATG CACTCGACAA AGCGCTGCGA AAAGCACTGA TCCATTTTTA TCCTGCCATA
AAAACAATCA GGCTGGTTGA CTATAAAGTC CGGGTCCTTG AAGAAAAACG CAGCACCAGC
GCAAAAGTCC GGGTGCTGAT TCAAACCAGT AACGGACAGG AAACGTGGGG AACGGTCGGA
GTATCAACGA ACATTATCGA AGCAAGTCTT CTTGCACTCC AGGACAGCAT GAACTATCAC
CTCTTCAACG TCAGGACAGC CATTCAAAAA AAAGCAGCAG CCGAGGCATA G
 
Protein sequence
MTNPLSKTIE LYDTTLRDGT QGEHINLSVQ DKLLIAERLD EFGMDYIEGG WPSSNPKDEE 
FFLKARQLKL NHAKLCAFGS TARSSATVKS DQNLLGLLQS ETPVITIFGK TWKAHSSKGL
GISDEENAEL IHRSVQFLKE AGREVFFDAE HFFDGFKDNP EFALTMILAA VEAGASRVVL
CDTNGGSMPH EVDAIVKKVV ATAGVPVGIH CHNDSDIAVA NSIIAVQAGA THVQGTINGI
GERCGNANLI SIIPNIMLKL HGSFTHLQQL SQLTSLSKFV FEILNLPSDT KAPFTGKSAF
AHKGGIHVSA VMKESSLYEH IDPKLVGNRQ RVLVSELAGQ SNIRYKADEL GIKLPEKGEQ
IRNLVHHIKE LEHKGYQFDG AEASFELILR RELGDFSPYF NVLETKVHIE SGVDSKNVDQ
AILKVQVGNE IEHIAADGDG PVNALDKALR KALIHFYPAI KTIRLVDYKV RVLEEKRSTS
AKVRVLIQTS NGQETWGTVG VSTNIIEASL LALQDSMNYH LFNVRTAIQK KAAAEA