Gene Cpha266_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1918 
Symbol 
ID4569862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2225309 
End bp2226466 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content52% 
IMG OID639766500 
Producthomocitrate synthase 
Protein accessionYP_912358 
Protein GI119357714 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACTTG AAAAGAATAC GCCTTCGGGC AATGGCATAA GGCCCTGGAT TATCGACACA 
ACCCTGAGAG ATGGCGAACA GGCTCCGGGA GTTGTATTTA CTGCTGGCGA AAAATACAGA
ATCGCTCAAC TGCTTGCAGA AATTGGCGTC AACGAGCTTG AAATCGGGTA TCCGGCAATC
AGCAGCGAGG AACGAGAGAA CATCAGAACC ATTGCCGCGC TTCATCTGCC GGTGCGCCTG
ACCAGTTGGG CAAGAGCGTC ATGGGACGAT ATCGAACATG CCAGGAGCTG CGAAACCGAG
GCGGTGCACA TCAGTTTTCC GGTCTCGCCA CTCTACCTGC AACTGATGCA GAAGGATTAC
CTGTGGGTGC AGCGCCAGCT CCAGGAACTG GTGCCGAAAG CAAAAAAATA TTTCAACATT
GTCAGTGTCG GGGCTCAGGA TGCAACAAGA ACGCCGTATG AACTTCTGAA AACCTTTGTT
CTCGATGCTG AAGCGTGCGG AGCCGACAGG ATTCGCATAG CCGATACCGT TGGCATAGCG
ACTCCCATAT CGGTACTCGA TCTGGTGGGA CGTCTTCAGT CCGTAAGCCC GACAGCCCTC
GAATTTCATG CGCACAACGA TCTCGGCATG GCAACGGCCA ATGCGTTCAC CGCTCTTGAG
GCCGGCTGCT CCGCTGTGAG CGTTTCGGTA ACGGGACTTG GCGAACGGGC GGGTAACGCC
GCTCTTGAAG AACTTGCCGT CGCTCTTTTG CTCAACAATC AATTCCAATG CAAGATCGAC
ACCACAAAGC TTGCTATGCT CTGTAAAACC GTCAGCAAAG CATCCGGAAG ACCAATCCAG
GATCAGAAAC CGGTTATCGG CAAATCGGTA TTCCAGCACG AATCAGGCAT TCATTGCGCA
GCATTGTTAA AAAATCCGCT CTCCTACCAA CCATTTCTTC CATCTGAAGT CGGAAGAAAA
CCTCATGAGC TGGTAATCGG CAAGCATTCC GGCAGTGCGG CGCTCAAACA TTTTTACCAT
ACAAGAGGAA TCAGCCTGAC AAGAGATGAA GCCAGCCGGA TCCTGAGTCT GGTCCGCAGA
AGCGCTGATG AAAAGAAAAG AGCACTGACA GCCCATGAAC TTGATGAGAT CTACGCAATA
CGCAGCCAAA AAGGATAA
 
Protein sequence
MILEKNTPSG NGIRPWIIDT TLRDGEQAPG VVFTAGEKYR IAQLLAEIGV NELEIGYPAI 
SSEERENIRT IAALHLPVRL TSWARASWDD IEHARSCETE AVHISFPVSP LYLQLMQKDY
LWVQRQLQEL VPKAKKYFNI VSVGAQDATR TPYELLKTFV LDAEACGADR IRIADTVGIA
TPISVLDLVG RLQSVSPTAL EFHAHNDLGM ATANAFTALE AGCSAVSVSV TGLGERAGNA
ALEELAVALL LNNQFQCKID TTKLAMLCKT VSKASGRPIQ DQKPVIGKSV FQHESGIHCA
ALLKNPLSYQ PFLPSEVGRK PHELVIGKHS GSAALKHFYH TRGISLTRDE ASRILSLVRR
SADEKKRALT AHELDEIYAI RSQKG