Gene Clim_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0737 
Symbol 
ID6356018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp807097 
End bp808392 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content54% 
IMG OID642668362 
Producthomoaconitate hydratase family protein 
Protein accessionYP_001942797 
Protein GI189346268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAA CAATAACCCA GAAAATTCTC TCGAGAGCCG CTAACCGGAA ATTTGTCGAT 
GCCGGTGAAA ACGTCTGGCT CAATGTCGAC ATCCTGCTCA CTCATGACGT GTGCGGACCG
CCGACCTTCG ATATTTTCAA GCAGGAGTTC GGCCCGGATG CAAAAGTATG GGACCCCGAA
AAGGTCGTGG TCCTGCCAGA CCACTATATT TTTACAGCAA ATGAGCATGC ACACCGCAAT
ATCGACCTGT TGAGACAGTT TGCATCGGAA CAGAGTCTCC CCAACTACTA CGATGTCGGC
ACCGACCGTT ACAAAGGGGT CTGCCATGTA GCTCTTGCTG AAGAGGGATT CAATATTCCG
GGTACGGTTC TGTTCGGCAC GGACTCGCAT ACCTGTACCT CGGGAGCATT CGGCATGTTC
GGCTCCGGAA TCGGAAACAC TGACGCAGCC TTCATTCTCG GCACAGGCAA GCTCTGGGAA
AAGGTGCCTG AGTCCATGAA ATTCATCTTC GAAGGCGACA TGCCGGAATA CCTCTGCGCA
AAGGATCTCA TTCTGCAGAT TCTCGGCGAC ATAGGCACCG ACGGAGCAAC TTACCGGGCA
ATGGAATTCG ACGGCGAAGC GGTCTACTCT CTTCCGGTCG ATGAGCGCAT GACCCTGTGC
AATATGGCTA TCGAAGCAGG AGGCATGAAC GGCATCATCG CGGCCGACGC CGTTACCGAA
GCTTATGTAA AGGCACACAG CAGCAAACCC TACGAAATCT TCCAAAGCGA TCCCGACGCC
GACTATCACA GCGTTTACCG ATATAACGCA AGGGAACTGG AACCGGTTGT GGCAAAACCG
CACAGTCCGG ACAACAGGGC TACCGTCAGA AGCATGCAGG GCACGAAAAT CACCAAGTCC
TATATAGGCT CCTGCACCGG AGGCAAACTG ACCGATTTCA TGCTTGCAGC GAAAATTCTT
AAAGGCAAAC AGGTTACCGT ACCGACCAAC ATCGTTCCGG CAACCGTGCT TGTAGCCCGC
GCCCTGGAAT GTGAAACATG GGAAGGCGTT ACACTGAAAA AGATTTTTGA AGATGCCGGA
TGCAGCATAG CCCTGCCCTC ATGCGCAGCA TGTCTCGGCG GGCCTGCAGA TACCGTGGGA
CGCTCGGCCG ATCAGGATGT TGTGGTCTCC ACGACGAACC GCAACTTCCC GGGACGCATG
GGAAGCAAGA AAGCCGATGT CTATCTTGCC TCTCCGCTTA CCGCTGCAGC ATCTGCAGTT
ACCGGAAAAC TAACCGATCC AAGGGAGTTC CTCTGA
 
Protein sequence
MAQTITQKIL SRAANRKFVD AGENVWLNVD ILLTHDVCGP PTFDIFKQEF GPDAKVWDPE 
KVVVLPDHYI FTANEHAHRN IDLLRQFASE QSLPNYYDVG TDRYKGVCHV ALAEEGFNIP
GTVLFGTDSH TCTSGAFGMF GSGIGNTDAA FILGTGKLWE KVPESMKFIF EGDMPEYLCA
KDLILQILGD IGTDGATYRA MEFDGEAVYS LPVDERMTLC NMAIEAGGMN GIIAADAVTE
AYVKAHSSKP YEIFQSDPDA DYHSVYRYNA RELEPVVAKP HSPDNRATVR SMQGTKITKS
YIGSCTGGKL TDFMLAAKIL KGKQVTVPTN IVPATVLVAR ALECETWEGV TLKKIFEDAG
CSIALPSCAA CLGGPADTVG RSADQDVVVS TTNRNFPGRM GSKKADVYLA SPLTAAASAV
TGKLTDPREF L