Gene Cagg_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1159 
Symbol 
ID7267908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1428603 
End bp1430063 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content58% 
IMG OID643566003 
ProductO-succinylbenzoate-CoA ligase 
Protein accessionYP_002462505 
Protein GI219848072 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01923] O-succinylbenzoate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.498459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000208098 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTATCTTC CCGACTGGCT AGCCCGGCAA TCACTACTCC GGCCACACCA TCCGGCGTTG 
ATCGGTGCTG AAGCGACATA TACGTTTGCC GAGCTTGATC GCTGGGTCAG TGTGGTGGCG
GATCGGTTGC GCCAAAGGGT ACCGGTCGGG TCGCGGGTTG CTTTGCTGGC CCGTAACCGG
TTGGCGTATG CCGCAGTAGT GCATGCTGCA CCACGGGCAG GTGTGACGCT TGTTTTGCTC
AATACTCGTC TGACACCTGC TGAGCTTGCC TTTCAAGTGC GTGATAGCGC ACCATCTTTG
CTGATCGCCG AAGCTGAACT TTCGACCAAA ATACACGAAG CAGCGTATGG CGTGCCGATT
GTTACGCTTG AGGAGCTGAC TGCACCGACA ACAACAATTT CACCTTCACC GGCTCCGCCA
ATCGACCTCG CGGCTCCGCA TACGATCATC TACACTTCGG GAACGACCGG GCAGCCGAAG
GGTGCGATTC TGACCGCCGG CAACCATTGG TGGAATGCAG TCGGCTCGAT GCTCAACCTT
GGCCTGCACG ACGATGACCG TTGGCTGGCG GTGCTACCGC TTTTTCACGT TGGTGGGTTG
AGTATATTGC TGCGTGGTGT TATTTATGGC ATACCGGTCG TGTTGCATGA GCGGTTCGAT
CCGGCGTTGG TTAAGCGCGA TCTTGCCGAG CAGCGTATCA CGATTGTCTC ACTCGTTGCC
GTGATGCTCC AACGATTGCT CATAGTTGAT TCCACACCGT TTCCTGCTCA CTTGCGCTGC
GTCTTGCTTG GTGGTGGTCC GGTGCCGCAA ACGTTGCTCG AACAGTGTGC CGCGCGCGGA
ATTCCGGTCA CGCAGACGTA TGGTATGACT GAAGCGGCTT CGCAGGCGGC AACGCTCGCG
CCCGCCGAGG CATTGCAACG ACTCGGGTCG GCGGGTAAGC CGTTGTTGCC GGTTGAACTG
CGGATCGTTC GGTCGGATGG GAGCGAGGCT GCTGCCGGAG AAGTGGGGGA AATTTGCTTG
CGTGGACCAA CGCTGTCGCC AGGGTATCTG GGGATGCCGT CGCGTCGGCC TGATGAGTGG
TTCCGTACCG GCGATATGGG CTATCTCGAT GCTGATGGTT ATCTCTATGT GGTTGACCGA
CGCAGTGACT TGATCATTGC CGGTGGTGAG AATATCTATC CTGCCGAAGT TGAAGCTGCG
CTCCTCAGTC ATCCGGCTGT GGTCGAAGTG GGAGTGGTTG GGTTGTCCGA CCCGGAGTGG
GGGCAGCGTC CGGTAGCCGC AGTCGTTGTG CGCTTCCCGG TCACGGCTGA GAGTTTAATA
GCCCACTGTC GTGAACGTCT CGCCGGTTAT AAAGTGCCGC GCACCATTGT GTTTGTCGAT
GAGCTACCTC GTACTGCGGC AGGCAAACTG CGTCGTCACC AACTCCGGGA ATGGATGCTG
GAGCGCGGTG TGACCGCATG A
 
Protein sequence
MYLPDWLARQ SLLRPHHPAL IGAEATYTFA ELDRWVSVVA DRLRQRVPVG SRVALLARNR 
LAYAAVVHAA PRAGVTLVLL NTRLTPAELA FQVRDSAPSL LIAEAELSTK IHEAAYGVPI
VTLEELTAPT TTISPSPAPP IDLAAPHTII YTSGTTGQPK GAILTAGNHW WNAVGSMLNL
GLHDDDRWLA VLPLFHVGGL SILLRGVIYG IPVVLHERFD PALVKRDLAE QRITIVSLVA
VMLQRLLIVD STPFPAHLRC VLLGGGPVPQ TLLEQCAARG IPVTQTYGMT EAASQAATLA
PAEALQRLGS AGKPLLPVEL RIVRSDGSEA AAGEVGEICL RGPTLSPGYL GMPSRRPDEW
FRTGDMGYLD ADGYLYVVDR RSDLIIAGGE NIYPAEVEAA LLSHPAVVEV GVVGLSDPEW
GQRPVAAVVV RFPVTAESLI AHCRERLAGY KVPRTIVFVD ELPRTAAGKL RRHQLREWML
ERGVTA