Gene Clim_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1020 
Symbol 
ID6355469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1118152 
End bp1120113 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content61% 
IMG OID642668643 
Productcobaltochelatase subunit 
Protein accessionYP_001943074 
Protein GI189346545 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1239] Mg-chelatase subunit ChlI 
TIGRFAM ID[TIGR02442] cobaltochelatase subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.816011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AATACTTTCC ATTTTCGGCG ATTGTCGGGC AGGAGGATCT CAAAAAGGCC 
CTGCTGCTCA ATGCGGTCAA TCCCCGTACA GGCGGAGTGC TCGTCCGCGG GGAAAAAGGA
ACCGCCAAAT CGAGCGCTGT CAGGGCGCTC GGCCAACTGC TTCCCGAACG TCAGCAGAGG
GCCGGGCAGG GAGGGGTGTC GGTCGTTACG CTTCCGCTCA ACGCAACCGA AGAGATGGTT
GCCGGCGGTA TCGATTTTCA GGAAACCATG AAAGAGGGCC GCAGGATTTT CCAGCCGGGG
CTCCTTGCCA AAGCCCATGA GGGAATTCTC TATGTCGATG AAGTGAACCT GCTTGACGAC
CATCTTGTCG ATATCGTGCT CGATGCGGCC TCTTCCGGAG AGAATCGTGT GGAGCGCGAA
GGCATGACCC TTGTGCATCC CTCGCTGTTC GTGCTTGCCG GCACCATGAA TCCCGAAGAG
GGCGAACTTC GTCCGCAGCT GCTCGACCGG TTCGGCCTCT GTGTGGAGGT CCATGGCGAA
ACCGATCCGG ATCTGCGGGT GGAACTGATG CTTCGCAGGG AGGCTTTCGA CAGCGATCCC
GACGCATTTG CCGGACACAG CGGCGATGAA GAGCGGCGGA TCGCCGAAAA GATTGCGGCC
GCGCGTCTCC TTCTGCCTGC GGTACGGATG CCCTCACACC TGCGGGGGTT CATTTCGGAG
CTGTGCCGCA ACAGTAACGT CGCCGGCCAT CGGGCTGATC TTGTCATCGA GCAGGCCGCA
CGGGCGAGTG CCGCACTGCG AGGTTCACGG GAAGTGTCGG TTGGCGATAT CACCGGAGTG
GCTCCGCTCG CTCTCGTCCA TCGGCGCAGG GACCCCGTGC CTCCGCCCGA AGAGCGGCCC
CGGGAGCCGG AGCGAGGCGA AGGGCAGGAG AACGACAACC CGCAGCAGCC GGAAGATGAT
GGCCGGAAGC CCGGGGAGCC GAAGGAGAAC TCTGGCGAGA AGAGCGCTGA AGGTCCCGAA
GGAGACAGGT CTCGAGAGAG CGACGGAGAT CGGCAGGAGT CGGCGGAACC GCAGGAGCCG
GAAGATCGGG GTGAAGGGGA GGAGCGTTCC GGAACCGATG AGCTGTTCGG CGTAGCCCCC
TCCTTCAGGG TCCGCAGCAT CGTGACGCCG AAGGACCGCA AACTGCGTCG CGGTTCGGGT
AAACGCTCCC GATCGCGGGT TTCGCAGAAA CAGGGGCGCT ACACCAGGAG CACCATGCCC
CGTGGTACCG ACGATATCGC GCTTGACGCC ACGCTGAGGG CTGCAGCGCC CTTCCAGCGG
TATCGCCTGA ACCCGAACGG CATGGCGGTG GTTCTGCAGA ACGAGGATAT ACGCGAAAAG
ATCAGGGAGA AGCGCCTTGG CAATCTGCTC ATTTTCGTCG TGGACGCGAG CGGTTCGATG
GGCGCAAGAG GCAGGATGGC GGCCTCGAAA GGGGCGATCA TGTCGCTTCT GCTCGATGCA
TACCAGAAGC GCGACAAGCT TGCCATGGTC TCCTTCCGCA AGGAGGGCGC GGTGGTGAAT
CTGCCGGTCA CCTCTTCCAT CGAACTTGCC GCGAGACTGC TCAGGGATAT GCCGGTAGGC
GGGCGCACTC CCTTTTCTGC CGGCCTCGTA AAAGGGTACG AAATAGCGAT GAACTATCTG
CGCAAGGAAC CGCAGGGGCG TCCGCTCGTC ATTCTTGTCA CCGACGGCAA GGCGAACCGG
TCCATCGGTT CGTCCAGGCC TCTCGACGAG GCGTTCCGGA TCGCCCGGCG GGTTGCCGGT
GAAGAGCGCA TCCGCTATCT CGTTGTCGAT ACCGAAGAGC CCGGTCTCGT CAATTTCGGG
CTCGCAAAAA AACTCGCCGG TCTGCTCGAT GCATGGTATT TCCGCATCGA CGACCTGCGT
GCAGATACCC TCGTTTCCAT CGTAAAAAAC ATGACACCAT GA
 
Protein sequence
MKRKYFPFSA IVGQEDLKKA LLLNAVNPRT GGVLVRGEKG TAKSSAVRAL GQLLPERQQR 
AGQGGVSVVT LPLNATEEMV AGGIDFQETM KEGRRIFQPG LLAKAHEGIL YVDEVNLLDD
HLVDIVLDAA SSGENRVERE GMTLVHPSLF VLAGTMNPEE GELRPQLLDR FGLCVEVHGE
TDPDLRVELM LRREAFDSDP DAFAGHSGDE ERRIAEKIAA ARLLLPAVRM PSHLRGFISE
LCRNSNVAGH RADLVIEQAA RASAALRGSR EVSVGDITGV APLALVHRRR DPVPPPEERP
REPERGEGQE NDNPQQPEDD GRKPGEPKEN SGEKSAEGPE GDRSRESDGD RQESAEPQEP
EDRGEGEERS GTDELFGVAP SFRVRSIVTP KDRKLRRGSG KRSRSRVSQK QGRYTRSTMP
RGTDDIALDA TLRAAAPFQR YRLNPNGMAV VLQNEDIREK IREKRLGNLL IFVVDASGSM
GARGRMAASK GAIMSLLLDA YQKRDKLAMV SFRKEGAVVN LPVTSSIELA ARLLRDMPVG
GRTPFSAGLV KGYEIAMNYL RKEPQGRPLV ILVTDGKANR SIGSSRPLDE AFRIARRVAG
EERIRYLVVD TEEPGLVNFG LAKKLAGLLD AWYFRIDDLR ADTLVSIVKN MTP