Gene Cag_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1900 
Symbol 
ID3747645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2418637 
End bp2419932 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content47% 
IMG OID637774437 
Product3-isopropylmalate dehydratase 
Protein accessionYP_380193 
Protein GI78189855 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAA CAATAACCCA GAAAATTTTT GCAAAATCAG CAAAACGCCC TTTTGTTGAT 
CCCGGCGAAA GTGTTTGGCT TAATGTGGAT GTTCTCCTCA CACATGACGT GTGCGGACCG
CCCACGATTG ATATTTTTAA AGAGAAATTT GGCTCCAACG CTAAAGTGTG GGATCCCGAA
AAAGTTATTA TTCTGCCTGA TCACTACATC TTTACTGCCA ACGAACATGC GCACCGCAAC
ATTGATTTGC TGCGCCAATT TGCTAAAGAG CAAGGCTTAC CTCATTACTA CGATGTTGGC
ACCGATCGTT ATAAAGGTGT GTGCCATGTA GCACTTGCTG AAGAGGGCTT TAACCTTCCC
GGTACCGTGC TTTTTGGTAC CGACTCACAC ACCTGCACCT CTGGCGCTTT TGGCATGTTT
GGTACCGGCA TTGGCAATAC CGATGCGGCA TTTATTCTTG GTACCGGCAA ATTGTGGGAA
AAAGTACCCG AATCAATGAA GTTCACCTTT GAGGGTGAAA TGCCAGCCTA TTTGACAGCT
AAAGATCTGA TTTTGCAGAT TCTTGGCGAC ATCACCACCG ATGGTGCAAC CTATCGCGCT
ATGGAGTTTG ATGGCGAAGC TATTTTCTCT CTGCCAATGG AAGAGCGCAT GACGCTTACC
AACATGGCAA TTGAAGCGGG TGGCATGAAT GGCATTATTG CAGCCGATAA CATTGCAGAA
GAGTATGTAA AGGCACGTAC CAAAAAGCCT TACGAGATTT TCCAAAGCGA TCCTGACGCA
AAGTACCATA GCACCTATCG CTATAACGTG CGTGATTTGG AGCCTGTAGT AGCTCAACCG
CATAGCCCCG ATAACCGTGC AACCGTGCGT AGCGTAGCTG GCACAAAAAT CACCAAATCG
TACATTGGCT CATGCACGGG TGGCAAGCTA AGCGACTTTA TGATGGCAGC TAAAATCCTA
AAAGGGCAGA AAGTTACCGT TACCACAACT ATTGTTCCTG CAACTACTTT AGTAGCTCGT
AGCCTTGAAA CGGAGCAATA CGATGGCAAA AGCTTAAAGC AAATTTTTGA AGAAGCTGGC
TGCAACGTTG CTTTACCATC GTGCGCTGCC TGTCTTGGCG GTCCAGCTGA CACCGTTGGT
CGTTCGGTGG ATAATGACCT TGTGGTTTCA ACAACGAACC GCAACTTCCC TGGACGCATG
GGTAGCAAAC ATGCAGGCGT TTATCTTGCT TCACCACTTA CGGCAGCGGC ATCAGCAATT
ACCGGCAAAC TTACCGATCC GAGAGATTTT CTCTGA
 
Protein sequence
MAQTITQKIF AKSAKRPFVD PGESVWLNVD VLLTHDVCGP PTIDIFKEKF GSNAKVWDPE 
KVIILPDHYI FTANEHAHRN IDLLRQFAKE QGLPHYYDVG TDRYKGVCHV ALAEEGFNLP
GTVLFGTDSH TCTSGAFGMF GTGIGNTDAA FILGTGKLWE KVPESMKFTF EGEMPAYLTA
KDLILQILGD ITTDGATYRA MEFDGEAIFS LPMEERMTLT NMAIEAGGMN GIIAADNIAE
EYVKARTKKP YEIFQSDPDA KYHSTYRYNV RDLEPVVAQP HSPDNRATVR SVAGTKITKS
YIGSCTGGKL SDFMMAAKIL KGQKVTVTTT IVPATTLVAR SLETEQYDGK SLKQIFEEAG
CNVALPSCAA CLGGPADTVG RSVDNDLVVS TTNRNFPGRM GSKHAGVYLA SPLTAAASAI
TGKLTDPRDF L