Gene Cagg_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2017 
Symbol 
ID7269175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2475441 
End bp2477051 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content56% 
IMG OID643566851 
ProductNLP/P60 protein 
Protein accessionYP_002463341 
Protein GI219848908 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.474847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAGC TGCAACACGA TCAATCTGGT GATCTACTAC CCCTTGAATT ACAACGCTTA 
CGTCAACGCG CTGTTCACCA TCGGAGTAGA CACCGACAAC GCTCGCGACC GTTTGCTACA
ATCGTCGCCT TTGTCCGTAC CTCCTTAGAT CTCCAACAAC GACTGAACCG ACTTCCTTCC
CGCTATCTTT TCCATGCGGT AATTGCCCTC TTTCTACCGG TCGCTCTCCT GTTAGGGCAA
TTGCCCCTCC GCCCGGTGAA GCCTATCCCC GTCCAACCCA CCGCTGCACC TGTGGCCCCC
GACGGAGACG TTCCGCTCAT CATGCTCGGC CCGATTAACC TCGACCAAAC GCGCACCGGT
GAGGTAGTCG GCGATCCACC CCTTTCTGCC GATGAGTCGT TACCGGTACC ACTCACAATC
CTCTCGCGCA GTGAAATCAA CGCTCCGCTC ATTGTGCCGG CCACCGTTAC CGCCGACGTA
GCCAAAGTGC GTAATGGTCC CGGTCTTGCC TACGACGACA TCGCCCGACT GAATGGTGGT
ACCACCATTG AAGTTGTTGG TCGGCATAAC GAGTGGCTCC AGTTCCGTAC TACCGACGAT
CCTACTTTGC GCTGGATCGC CGCTGAGCTG GTCGATTTGC CGGAAGCGAT CTTCTACAAC
CTCAAACCGG TAGACGAATC CACTATTCCC CCTCCACCCC CACCCAAGGT GGCAATCGTG
CGTGAAGACG GCCTTAATCT GCGTGATGGC CCCGGCACGA ATTATGTCAG TATGAAACGG
TTGACTGCCG GTGAAGAGTT GAACCTGGTC GAACAGTATA ACGGCTGGTT CCTTATCGAG
ACCGGTGGTA TCTACGGATG GGTAACCTCT GAGTTTCTCA ACATCGCTCC CGGTGTGATC
GAACGAGTGC CGGTCGCGTC ATCGATCCCC GATCCGAACC CACCGCTTGT CGGGTCTGTG
CTCGAAAACT CGGTTAATCT CCGCAAAGGT CCCGGTTCTG CATACGAACG GATTGGCTCG
ATCAATGCCG GCGCTGACGT GAAGCTGCTG GCTCGCCACA AAGATTGGTA CCGCGTCGAG
CTAAGCAATG GGACGCGAGC ATGGATCTAT TCGGAGTTGT TGGGTGTGAC ACCGATGGCC
GCTCGCCGCG TGCCCTATAC GAATGATATC CCACCCCTTC CCAACCGTGC TCGCCTGGCT
AATAGCGGTC CGGTGAATAT TCCGGCCAGC GGTGATGTCG CCAGCTATGC CGTGCAGTTT
GTCGGGTATC GCTATGTCTG GGGCGGCGCT AGTCCACGTA CCGGTTTCGA TTGTAGTGGC
CTCACGTGGT ACGTTTATCG CCAGTTTGGC GTTAATCTAC CGCGTACCGC TGCCTCCCAG
TTCAACTCTC GATACGGTGC TGTGATTGGT AATCTCAATA ATCTGGCTCC CGGTGACCTG
ATGTTCTTCG CCAATACCGG CGGTGGCCGA GGGATTACCC ACGTCGCGAT CTATATCGGC
GGTGGTCAAA TGGTGCATGC GATGACGCCT GCCTACGGTG TCCAAATCTC AAGTATTTGG
GGTGCGTATT GGACCAGCCG TTTTGTGGGT GCGATTCGAC CGTATCGCTA A
 
Protein sequence
MIELQHDQSG DLLPLELQRL RQRAVHHRSR HRQRSRPFAT IVAFVRTSLD LQQRLNRLPS 
RYLFHAVIAL FLPVALLLGQ LPLRPVKPIP VQPTAAPVAP DGDVPLIMLG PINLDQTRTG
EVVGDPPLSA DESLPVPLTI LSRSEINAPL IVPATVTADV AKVRNGPGLA YDDIARLNGG
TTIEVVGRHN EWLQFRTTDD PTLRWIAAEL VDLPEAIFYN LKPVDESTIP PPPPPKVAIV
REDGLNLRDG PGTNYVSMKR LTAGEELNLV EQYNGWFLIE TGGIYGWVTS EFLNIAPGVI
ERVPVASSIP DPNPPLVGSV LENSVNLRKG PGSAYERIGS INAGADVKLL ARHKDWYRVE
LSNGTRAWIY SELLGVTPMA ARRVPYTNDI PPLPNRARLA NSGPVNIPAS GDVASYAVQF
VGYRYVWGGA SPRTGFDCSG LTWYVYRQFG VNLPRTAASQ FNSRYGAVIG NLNNLAPGDL
MFFANTGGGR GITHVAIYIG GGQMVHAMTP AYGVQISSIW GAYWTSRFVG AIRPYR