Gene Cagg_3776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3776 
Symbol 
ID7267850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4606345 
End bp4607853 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content57% 
IMG OID643568584 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002465048 
Protein GI219850615 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.597037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.34641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCAC TGCCAATCGT TGCTACCCTC CTCCATTACG CAACTACCGA ACCGCTCCGA 
CCATGTGTTG TGGTTGAGAA TCACGTAATA ACGTATCGTG ATCTTGCAGC CGCATCTGCC
GGCTGGGCCA CCCGCTATCG CGACCTCGGC ATTGCACGTG GCGATCGTGT CGCGTTGGCT
CTGCCCAACT CACCGGCCTT TCTCGCTGCC TACTTCGGTG CTCAATTGGC CGGGGCAGCC
GTCGTTCTGG TGAACCCACA GTATCGCCAC GCCGAACTGA GCCATCTACT TGCCGATGCT
GAACCACTCA TTGTAGTTGC AACTGACGAG AACGAAGCGA TCTTGCGTGA GGCAATGACG
GCTCCACACC CACACCTGAT CAAACCCGAC GCATCGCTGT GTGGTGCATC ACCGGTGGAC
CCGACTGCAT TTTCACCGCC GGCTGCCGAC GATATGGCAC TGATCGCGTA CACATCGGGC
ACCACCGGTC GAGCCAAAGG AGCAATCCAC ACGCACGCCA GCCTTGCCGC AAATTGCGAT
GCGGTTATCC GCGCTTGGCG CTGGACCGAG GCCGATCGTT TACTCTTGAT GTTGCCACTG
TTTCACGTCC ATGGGCTAGG TGTAGGTGTC CATGGCACGA TCCGAAGCGG CGCGAGTCTT
GAATTGCACG CACGATTTGA TGCCGAACTG GCCTTGCAAC GCATGGCCGA CCCAGCTATT
ACCCTCTTTT TCGGCGTACC AACGATGTAT GTGCGGTTGA TCGAAGCAGC ACGCCAGCAC
GGCGTTCCTC GTCATCGCAT GCGACTGTTT GTTTCCGGTT CGGCCCCACT CAGCCCGCAG
ACTTTTGCCG ATTTCGCTGA CCTCTTCGGA CAACCCATCC TCGAACGCTA TGGCATGACC
GAAACGGGGA TGAATTTGAC CAATCCCTAC GAAGGCGAGC GCCGTCCCGG CAGTGTTGGT
ATGCCATTTC CCGGTCAAGA GGCCCGAATT GTCGATCGGA CAACACGCCA ACCGCTACCG
GCAGGCGAGG TTGGCGAGAT CCAAGTACGC GGACCACACC TCTTCCGTGG CTACTGGCGT
AATCCGAGTG CTACTGCGGC TGCGTTTACC GAAGATGGCT GGTTTAACAC CGGAGATGTC
GGGTTCGTTG ATACCGATGG CTATGTTCAC ATTACCGGTC GTAGTCGTGA ACTCATCATC
AGTGGCGGTT ACAACATCTA CCCTCGTGAA GTCGAGGAGG TTCTCGCCCA ACATCCGGCA
GTCGCTGAAT GCGCCGTTTA CGGCCAACCC GATCCCGATC TCGGCGAAGT ACCGGTGGCC
GATGTAGTGA TACGATCAGG CATCCACACC ACAGCACAAG AACTGATCGA TCATTGTCGT
CAGCAACTGG CTGCGTACAA ACGGCCACGC CAGATCCGCT TCGTCACGGC ATTACCACGC
AATGCCATGG GAAAGGTACA ACGTCATCTA CTCGGAATCG ATCCACCCGT AGCCGAGGAA
CAGCGATGA
 
Protein sequence
MSPLPIVATL LHYATTEPLR PCVVVENHVI TYRDLAAASA GWATRYRDLG IARGDRVALA 
LPNSPAFLAA YFGAQLAGAA VVLVNPQYRH AELSHLLADA EPLIVVATDE NEAILREAMT
APHPHLIKPD ASLCGASPVD PTAFSPPAAD DMALIAYTSG TTGRAKGAIH THASLAANCD
AVIRAWRWTE ADRLLLMLPL FHVHGLGVGV HGTIRSGASL ELHARFDAEL ALQRMADPAI
TLFFGVPTMY VRLIEAARQH GVPRHRMRLF VSGSAPLSPQ TFADFADLFG QPILERYGMT
ETGMNLTNPY EGERRPGSVG MPFPGQEARI VDRTTRQPLP AGEVGEIQVR GPHLFRGYWR
NPSATAAAFT EDGWFNTGDV GFVDTDGYVH ITGRSRELII SGGYNIYPRE VEEVLAQHPA
VAECAVYGQP DPDLGEVPVA DVVIRSGIHT TAQELIDHCR QQLAAYKRPR QIRFVTALPR
NAMGKVQRHL LGIDPPVAEE QR