Gene Cag_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1898 
Symbol 
ID3747643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2416439 
End bp2418037 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content45% 
IMG OID637774435 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_380191 
Protein GI78189853 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAT GCACAACCCC TATTGAGTTG TACGATACCA CCCTGCGTGA TGGCACGCAG 
GGTGAGCATA TCAACCTTTC TGTTCAAGAT AAACTGCTCA TAGCCCAAAA GCTTGATGAG
TTCGGCATGG ACTTTATTGA AGGTGGCTGG CCCAGCAGTA ATCCCAAAGA TGAAGAGTTC
TTTCTGAAAG CGCGCCACCT TACCTTTAAA CATGCGCGTC TCACCTCTTT TGGCTCAACC
GCTCGCTCGG TTAAAGCGGT AGAAAGCGAT CCCAATTTGC TTGGCTTGCT TCGCTCCGAA
ACGCCCATTA TTACCATTTT TGGTAAAACA TGGCAGGCAC ACTCCCTTAA AAGTCTTGGT
ATTTCCGATG AAGAAAATGC TGAATTGATT TATCGTTCAG TAGCTTTTTT GAAGGAAAGT
GGACGCGAAG TCTTTTTTGA TGCTGAGCAC TTTTTTGATG GCTATAAAGA TAACCGTGAC
TTTGCCGTTG CCATGCTTCG TGCAGCCGTT GAAGGCGGCG CAACCCGTCT TGTGCTATGC
GACACCAATG GTGGCACCCT GCCCGACGAA GTAAGCGCCA TTGTTCGCGA TATTGGCTCA
ACCTTCCCAA CCGTTGCTCT TGGCATTCAT AGCCATAACG ATGGCGATGT AGCTGTTGCA
AACTCACTTG CTGCCGTGGT TGCTGGAGCA ACGCACATAC AAGGCACTAT TAACGGCATT
GGTGAACGGT GCGGCAATGC CAACCTTATC AGCATTATTC CTAATGTGCT TTTAAAGTTG
CAACGCACCT GTAACTACGT AGAGCACTTA ACCCAACTTA CGCCGCTCTC AAAATTTGTC
TATGAAATTT TGAACTTACC GGCGAACACA CGCGCTCCGT TTGTTGGTAA GTCGGCATTT
GCCCACAAAG GTGGCATTCA TGTAAGTGCG GTTATGAAGG AGAGTTCGCT CTATGAGCAT
ATTGACCCAA AAGTAGTAGG CAACCATCAG CGCGTGCTTG TGTCGGAACT TGCAGGGCAA
AGCAACATTC GCTACAAAGC GCAAGAGCTT GGCATTGAGC TTCCAGAAAA AAGCGATCTC
TTTAAAAATA TTGTTCATCG CATTAAAGAG TTAGAACATG CAGGCTTTCA ATTCGATGGA
GCCGAAGCCT CTTTTGAGTT GCTCCTACAC CATGAGTTAG GTAACTTCAC TCCATTTTTT
GAAGTGCTTG AAACGAAAGT TCAAATAGAG TCCAGCAAAG AGAATAAAGT GGTAAACCAA
GCCACGTTAA AAGTACAAGT TGGCGATGAC GTTGAACATG TTGTTGCTGA TGGCGATGGT
CCTGTTAACG CGCTTGATAA AGCATTACGA AAAGCGCTGT TACGCTTTTT CCCTGAGATA
AAACAGATCA AGCTTGTTGA TTACAAAGTG CGCGTATTAG AAGAAAAACG AGGTACAAGC
GCTAAAGTGC GCGTGCTGAT TGAATCAAGT AATGGAGAGA CAACATGGGG CACAGTTGGC
GTTTCAACCA ATATTATTGA GGCAAGCTTA CAAGCGTTGC AGGATAGCAT GAATTACCAT
CTCTTTAGTT TGCAGCAAAA AACGCAAGAA CAGAGCTGA
 
Protein sequence
MSLCTTPIEL YDTTLRDGTQ GEHINLSVQD KLLIAQKLDE FGMDFIEGGW PSSNPKDEEF 
FLKARHLTFK HARLTSFGST ARSVKAVESD PNLLGLLRSE TPIITIFGKT WQAHSLKSLG
ISDEENAELI YRSVAFLKES GREVFFDAEH FFDGYKDNRD FAVAMLRAAV EGGATRLVLC
DTNGGTLPDE VSAIVRDIGS TFPTVALGIH SHNDGDVAVA NSLAAVVAGA THIQGTINGI
GERCGNANLI SIIPNVLLKL QRTCNYVEHL TQLTPLSKFV YEILNLPANT RAPFVGKSAF
AHKGGIHVSA VMKESSLYEH IDPKVVGNHQ RVLVSELAGQ SNIRYKAQEL GIELPEKSDL
FKNIVHRIKE LEHAGFQFDG AEASFELLLH HELGNFTPFF EVLETKVQIE SSKENKVVNQ
ATLKVQVGDD VEHVVADGDG PVNALDKALR KALLRFFPEI KQIKLVDYKV RVLEEKRGTS
AKVRVLIESS NGETTWGTVG VSTNIIEASL QALQDSMNYH LFSLQQKTQE QS