Gene Cag_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1901 
Symbol 
ID3747646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2420036 
End bp2421094 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content46% 
IMG OID637774438 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_380194 
Protein GI78189856 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTATA AAATTGTCTC TATTCCGGGT GATGGTATAG GTACCGAAGT TGTTGCTGGC 
GCTGTTGCTG TATTACGTCA ACTTGAAAAA AAATATGGCT TTACCGTTGA GATTGAAGAG
CATCTTTTTG GTGGCGCTTC TTACGATGTG CATGGTGAAA TGTTAACCGA TGCTACGCTT
GAAGCCTGCA AAAATTGCGA TGCCGTGCTG CTTGGAGCTG TTGGTGGTCC AAAATGGGAA
AACCTTCCCC ACGAGCACAA GCCTGAAGCT GCGTTGCTTA AAATCCGCAA AGAGCTTGGC
TTGTTTGCCA ACCTTCGCCC AGCAAAAGTG TATGATGCCT TAGTTGATGC TTCATCACTA
AAAGCGGATG TTGTGCGTGG CACCGATTTT GTGGTTTTTC GTGAGCTAAC GGGTGGTATT
TACTTCGGTC AACCTCGTGG CTACGATGAG AACAAAGGCT GGAACACCAT GGTTTATGAA
AAGTATGAGG TTGAGCGTAT TGCTCGCCTT GCTTTTGAAG CGGCTCGCCA ACGCCAAGGG
CGCGTTATGT CTATTGATAA GGCAAACGTC CTTGAAGTAT CACAATTGTG GCGGAACGTT
GTTCACGCTG TACACGCCGA TTACCAAGAT GTTGAATTGA GTGATATGTA TGTGGATAAT
GCTGCAATGC AAATTGTACG TAATCCAAAA CAGTTTGACG TTATTGTTAC TGGCAACCTT
TTTGGTGATA TTCTGAGCGA TATTTCAGGC ATGATTACTG GTAGCCTTGG CATGTTGCCT
TCGGCAAGCA TTGGTTCTAA GCACGCACTA TATGAGCCAA TTCACGGCAG TGCCCCCGAT
ATTGCAGGAC AAAACAAAGC AAACCCCATT GCAACCATTG CTTCGGTAGC AATGATGTTT
GAACACAGCT TTAAGCGTAC CGATATTGCT CGTGATATTG AACAAGCCAT TGAAGCTGCC
CTTGCTACCG GTGTAAGAAC GGCAGACATT GCAGCAGCCG GCGATACAGC AGTTTCAACC
ACAGCAATGA CTGAAGCCAT TATCAGCCAA CTGAAGTAA
 
Protein sequence
MNYKIVSIPG DGIGTEVVAG AVAVLRQLEK KYGFTVEIEE HLFGGASYDV HGEMLTDATL 
EACKNCDAVL LGAVGGPKWE NLPHEHKPEA ALLKIRKELG LFANLRPAKV YDALVDASSL
KADVVRGTDF VVFRELTGGI YFGQPRGYDE NKGWNTMVYE KYEVERIARL AFEAARQRQG
RVMSIDKANV LEVSQLWRNV VHAVHADYQD VELSDMYVDN AAMQIVRNPK QFDVIVTGNL
FGDILSDISG MITGSLGMLP SASIGSKHAL YEPIHGSAPD IAGQNKANPI ATIASVAMMF
EHSFKRTDIA RDIEQAIEAA LATGVRTADI AAAGDTAVST TAMTEAIISQ LK