Gene Cagg_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1394 
Symbol 
ID7267246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1719538 
End bp1721049 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content56% 
IMG OID643566237 
ProductCarboxypeptidase Taq 
Protein accessionYP_002462737 
Protein GI219848304 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.272507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCA GGTTACAAGA ACTTCGTGCC CGTCTCCTCG AGATCGACGA TATTAACAGT 
GCTGCCGCAG TGTTGGGGTG GGATCAGAGT ACGTATATGC CGCCCGGTGG TGCTGCGGCC
CGCGCACGCC AACTGGCGAC CCTCTCGCGG TTGGCTCATG TGCGGAGCAC CGATCCGGCG
TTGGGTGCTC TGTTGGCGGA ATTGATGCCG TATGCTGAAC AATTGCCCTA CGACCATCCC
GATGCCGCAC TGATTAGGGT AGCCCATCGC AACTACGAGC GCATGACCCG AATTCCGGTG
GAGCTGGCAA GTGAGATCGC TTCGCATACT GCTGCGAGCT ATCAGGCATG GACGCAGGCG
CGACCGGCGA ATGATTTTGC TACGATGTTG CCCTATCTCG AGCGAACGCT TGAGCTGAGT
CGCCGGGTGG CCGATTGTTT CCCCGGCTAT GATCATCCCG CCGATCCGCT GATCGACTTT
AGCGACTACG GGATGCGGGC ATCTGAGATT CGCGATCTCT TTGCCCGACT GCGTGCCGGG
CTGACCCCGA TCATTCGCGC GATTGTGGCC CAGCCGCCAA TTGACGATTC GTGTCTGCGC
AAGTATTACC CGCGCAACGA TCAATTGGCT TTTGGTGAGC AGATCATTCG TCGTTTTGGG
TACGATTTCG AGCGCGGTCG GCAGGATTTA ACCCATCATC CGTTTGCTAC GAAGTTTTCG
ATTGGCGATG TGCGGATTAC CACCCGTATC AACGAGCACG ATCTCGGAGA TGGTTTGTTT
AGCACTCTGC ACGAGTCGGG TCATGCGATG TATGAGCAGG GGATCGACCC GGCGTTTGAA
GGGACACCGC TCTGTAACGG TGTTTCGGCG GGTGTTCACG AGAGTCAATC GCGTTTGTGG
GAAAATCTGA TCGGTCGTTC TCGGCCATTC TGGGAACATT TTTACCCTGA ATTACAACAG
ACCTTCCCGC AGCAGTTAGG GAATGTTTCG CTCGACGAGT TCTATCGAGC GATTAACCGT
GTGCAACCGT CGCTCATCCG TACCGATGCC GATGAGGTGA CGTACAACCT CCACGTGATG
ATCCGGTTTG ATCTCGAGTT AGCGTTGCTC GAAGGCAGTC TGAAGATCAC CGATCTGCCT
GAAGCGTGGA ATGCTCGCTA TGCAGAGGAT TTGGGGGTTG TCGTCCCCGA TTACCGTGAT
GGCGTGTTGC AGGATGTGCA TTGGTTTGGT GGATTGATCG GTGGTGCGTT TCAGGGCTAT
ACCATCGGTA ATATCCTGAG CGCGCAATTT CTGGCCGCGG CGCGGTCTGC TCACCCCGAA
ATCGATGCTG AGATCGGACA GGGTGAGTTT GCGACGTTGC ATGGATGGTT GCGGGAGCAT
ATCTACCGTC ACGGTAGTGT CTTTACGCCT GCGGAGTTGA TCGAGCGGGC AACCGGTCGG
TCAATGCAGA TCGAGCCGTA CCTCCAATAC CTACGGCAGA AGTATTCGGC AATTTACGGG
ATCGAGTTAT AG
 
Protein sequence
MESRLQELRA RLLEIDDINS AAAVLGWDQS TYMPPGGAAA RARQLATLSR LAHVRSTDPA 
LGALLAELMP YAEQLPYDHP DAALIRVAHR NYERMTRIPV ELASEIASHT AASYQAWTQA
RPANDFATML PYLERTLELS RRVADCFPGY DHPADPLIDF SDYGMRASEI RDLFARLRAG
LTPIIRAIVA QPPIDDSCLR KYYPRNDQLA FGEQIIRRFG YDFERGRQDL THHPFATKFS
IGDVRITTRI NEHDLGDGLF STLHESGHAM YEQGIDPAFE GTPLCNGVSA GVHESQSRLW
ENLIGRSRPF WEHFYPELQQ TFPQQLGNVS LDEFYRAINR VQPSLIRTDA DEVTYNLHVM
IRFDLELALL EGSLKITDLP EAWNARYAED LGVVVPDYRD GVLQDVHWFG GLIGGAFQGY
TIGNILSAQF LAAARSAHPE IDAEIGQGEF ATLHGWLREH IYRHGSVFTP AELIERATGR
SMQIEPYLQY LRQKYSAIYG IEL