Gene Cagg_1321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1321 
Symbol 
ID7268612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1631230 
End bp1632828 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content59% 
IMG OID643566163 
ProductAmidohydrolase 3 
Protein accessionYP_002462664 
Protein GI219848231 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000608926 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATCA TTGTGCTGCG TAATGGCACT ATTTACACCC TTAATCCATC CCAACCAGTG 
GCACAAGCGC TGGCGATCCG TGGCGAGCGG ATTATAGCGG TGGGTGATGA AGCGACGGTA
CGGGCCGCTG CCGGCCCGCA AAGTGAGGTG ATCGACCTGC ACGGGCGGGC AGTAGTTCCC
GGACTGACCG ATGCGCACGT CCATATTGTG CTGCACGGCC TTGCCCGTCA GCAGGTACGC
CTGACTGGCT GCGCTGATTT CACGGCGGCG CTCGATCAGA TCGCGGTTGC GGCGCAGCGT
TTGCCGCCGG GTGCCTGGCT ACGAGGGAAT GGTTGGGATC ATACGTTGTG GGGAGGGTGC
TGGCCCACTC GTGCCGATCT CGACCGGGTG TGCCCGGATC GGCCAGCAAT GCTGGATCGC
AAAGACGGCC ATTCGCTGTG GGTCAATAGC CGCGTACTCG AGTTGGCCGG GATTACCGCT
GCTACTCCCG ATCCGGATGG CGGCCAGATT CAACGTGACG AGCACGGCGA ACCAACCGGC
ATCTTGCTCG AGACGGCCAT GGAGTTGGTG CGCGCGATTA TGCCGCCTCC CACTCGGGCC
GAGCGGTTGG CAGCACTGCG GTTGGCAATC AATGAGGCGT TGAGTTACGG TTTGACCAGT
CTTCATGTAC CACCGGCAAC GAATCCGGCC GATGGCCCTG ATACGCTTAT CGATTTGCAA
GCGTTACGCG CTGCCGGTGA TCTTACCATC CGTGTACTCG TTCACATTGC CGGTGCTCAT
CTTGATCATG CTATCGGATT GGGATTACGC AGCGGGTTAG GCGACGATTG GCTACGGATT
GGCGGCCTGA AACTGTTTGC CGATGGTTCA CTCGGCTCCG AGAGCGCCCA CATGCTAGCT
CCGTATGAAG GGCGTGATCA TACCGGCATC GCGGTCATTC CGCCTGCCGA GATGAAAGAG
ATTGTGACTC GCGCCAATGC TCACGGGATC AGTGTGGTAG TGCATGCCAT CGGCGACGCC
GCGAATCGCA GTGTGCTCGA TGCGATTGCC GCAGCACGTC CGACTGCCGC CCATCTTGCC
CTGCCCAACC GGATCGAGCA TGCCCAGATT CTTGCGCCGA CCGACATTCC GCGCTTTGCC
GAGCTTGGGG TTATTGCCTC AATGCAGCCG ATCCATTGCA CCGCCGATAT GGCGATGGCC
GAGCGGTTGT GGGGAACGCG CTGTACAACG AGCTATGCCT GGCGTAGTCT GCTGAATGCC
GGTGCTACGC TGGCGTTTGG TTCCGATGCG CCGGTTGAGA CACTCGACCC TTGGGCCGGC
ATTCACGCTG CTACAACCCG TCAAACGACC GATGGCACAC CGGTTAACGG TTGGTATCCT
GAACAACGAC TCACCGTTGC TGAAGCATTA GCAGCCTATT GTATTGGCCC GGCAATTACT
GAAGCCGCTG CTGAGCGTAA GGGGCGCTTA ATGCCCGGTA TGCTGGCCGA TTTGGCGGTG
TTGAACAATG ATCCCTTCCA GATACCGGTG TCTCATCTGT ATACGGTGCA TGCTGAGTTG
ACGATTGTGG GTGGAACAAT CGTATTTGAG AGGAATTGA
 
Protein sequence
MKIIVLRNGT IYTLNPSQPV AQALAIRGER IIAVGDEATV RAAAGPQSEV IDLHGRAVVP 
GLTDAHVHIV LHGLARQQVR LTGCADFTAA LDQIAVAAQR LPPGAWLRGN GWDHTLWGGC
WPTRADLDRV CPDRPAMLDR KDGHSLWVNS RVLELAGITA ATPDPDGGQI QRDEHGEPTG
ILLETAMELV RAIMPPPTRA ERLAALRLAI NEALSYGLTS LHVPPATNPA DGPDTLIDLQ
ALRAAGDLTI RVLVHIAGAH LDHAIGLGLR SGLGDDWLRI GGLKLFADGS LGSESAHMLA
PYEGRDHTGI AVIPPAEMKE IVTRANAHGI SVVVHAIGDA ANRSVLDAIA AARPTAAHLA
LPNRIEHAQI LAPTDIPRFA ELGVIASMQP IHCTADMAMA ERLWGTRCTT SYAWRSLLNA
GATLAFGSDA PVETLDPWAG IHAATTRQTT DGTPVNGWYP EQRLTVAEAL AAYCIGPAIT
EAAAERKGRL MPGMLADLAV LNNDPFQIPV SHLYTVHAEL TIVGGTIVFE RN