Gene Clim_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2052 
Symbol 
ID6355030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2261961 
End bp2263409 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID642669648 
ProductO-succinylbenzoate-CoA ligase 
Protein accessionYP_001944060 
Protein GI189347531 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01923] O-succinylbenzoate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTTG TAAACCGGGC ATCCCTGCTT TTCGACTCTT CACCCGCACT CATCTCCCCG 
GCAGCGACAC TCTCTTTCAG GCAGTGTGCC TCCATAACCT CCCGGATTGC CGGAAGGCTC
TACGAAAAAG GACTCCGTTC AGGCGACGCT GTCGCCATAC TTTCACCGAA TAGTCCCGAA
TCGGCACTGC TGATGATGTC GCTGCTGGGA AACGGCCTGA TCGCGGCTCC CCTGAACCAC
CGCTTTCCTC CCGAACAGCT GCTGAAAACC CTGCAGGCTC TGCACCCAGA GATGGTGGTA
ACGGCCGATC CTGAAATCAT AAAGCCGGGA GAAAGCCCGT TCAAGGCGGA AAATATGCAG
GATATCGCGT TTGCAGCGTC GGAGCCTGAA AGCCCTGACA GGTCAGCTCC GAGGATGAAA
ATGGAGCGCC CCGTCACCAT CATCCACACC TCGGCAAGTT CGGGATTGCC GAAAGCCGCC
CAGCACAGCT TCGGCAACCA CTGGTACAGC GCACTCGGAG CGGCAAGGAA CATGCCGCTC
GGAAACGGTG ACTGCTGGCT GCTTTCGCTT CCCTTCTTCC ACATCGGAGG CTATGCCGTG
CTCTTCAGGG CTCTCGTATC CGGATCGGCC GTTGCTCTGC CGGACCCGCA TGATGCAATT
GAACGGAGCC TTGAGCGCTT TCCTGCAACG CACCTTTCAC TGGTACCTAC GCAGCTCTAC
CGGCTTCTCC GGAAACCGGA AACCCTGCCG ATCCTGAGAA AGCTCAAGGC CGTGCTGCTG
GGAGGAAGCG CCGTTCCGGC TCCGCTGCTT GCAGAATGCA TCCGGGAAGG CATTCCCGTC
TTTGTCAGTT ACGGCTCGAC GGAAATGAGC TCGCAGATTG CGACAACGCC AGCACCCGAC
GGATCGTTTC GGAAAAACTG CGGCAAACCG CTCCCCTGGA GGGAACTCGC AATTGCAGGT
GACGGAGAAA TTCTTGTCAG GGGCGCCTGC CTTTTTCAGG GATACCTCAA GAACAGCGCT
TCAGGCCGTC AGCCGCATCC GGAGCTGGAC AGCGAAGGAT GGTTTCACAC CGGCGATACC
GGAAGCCTCG ACGACAACGG CAATCTCTCG GTTTCCGGAC GCAAGGACAA CATGTTCATA
TCGGGCGGTG AGAACCTCCA CTGCGAAGAG ATCGAAGAAG CATTAAGCAC CGTCGAGGGA
ATCGAACAGG CTCTTGTGGT GCCGCTGGCA GACCGGGAAT ATGGCCAGAG AGCGGCAGCG
TTCATAAAAA CCGCACAACC GGGCACTCCT ACCGACGACG CCATTACCGA AACCATGCTG
AAAACCGCAG GAAGGCTGAA AACACCGGTA CTCTATATCA GAATTTGCCA ATGGGTAACG
TTGCCGGGAT CGCAGAAAAT CGACAGGAAA TGGTACAACC GGCAGGTAAG GGAAGGAAAA
ATCCATTAA
 
Protein sequence
MDLVNRASLL FDSSPALISP AATLSFRQCA SITSRIAGRL YEKGLRSGDA VAILSPNSPE 
SALLMMSLLG NGLIAAPLNH RFPPEQLLKT LQALHPEMVV TADPEIIKPG ESPFKAENMQ
DIAFAASEPE SPDRSAPRMK MERPVTIIHT SASSGLPKAA QHSFGNHWYS ALGAARNMPL
GNGDCWLLSL PFFHIGGYAV LFRALVSGSA VALPDPHDAI ERSLERFPAT HLSLVPTQLY
RLLRKPETLP ILRKLKAVLL GGSAVPAPLL AECIREGIPV FVSYGSTEMS SQIATTPAPD
GSFRKNCGKP LPWRELAIAG DGEILVRGAC LFQGYLKNSA SGRQPHPELD SEGWFHTGDT
GSLDDNGNLS VSGRKDNMFI SGGENLHCEE IEEALSTVEG IEQALVVPLA DREYGQRAAA
FIKTAQPGTP TDDAITETML KTAGRLKTPV LYIRICQWVT LPGSQKIDRK WYNRQVREGK
IH