Gene Cagg_2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2224 
Symbol 
ID7266797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2722041 
End bp2723231 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content58% 
IMG OID643567055 
Productamidohydrolase 
Protein accessionYP_002463543 
Protein GI219849110 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.355102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.809475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTG AACGCGCCCA GGCACTCGCC GATGAGCTTA TCCGCATCCG TCGTGATATT 
CACGCCCATC CCGAACTTGG GTTTCAGGAA CACCGTACCG CTGCCCTTGT TGCTGAAACG
TTGCAAGAGA TCGGAGGGAT CAAGATTACC ACCGGCGTAG CCAAGACTGG TGTGATCGGT
GAACTCGGTG ATGGCGATGG GCCGGTTATC GCGATCCGGG CCGATATGGA TGCGCTGCCC
ATTCTGGAAG AGAACAACGT GGAGTATGCC TCAACGAACC CCGGGGTGAT GCATGCTTGT
GGTCACGACG CGCATACGGC GATGCTGCTC GGCGCTGCCC ATCTCTTACG CGAACGTTTT
GCCGCCGAAC ATTTGCGCGG GCGTGTGCGT TTTCTCTTTC AACCTTCTGA AGAAGGGTGG
GACGATGAGG CGAAGAGCGG TGCCCTCCGT ATGGTTGAAG AAGGCGCATT GCAAGGGGTC
GATGCCGTCA TTGCGCTGCA CGTCGATTCA ACCCTGCCGG TTGGGCAAGT CACGATTCGC
GGTGGTTGGT CGTCGGCAGC CGTTGATGAT TTTAAGGGGT ATATTCGCGG GACAGGTGGT
CACGGGGCGT ACCCACATCT CGGCACCGAT CCGGTCTTTA TGCTGTCGCA TGTGCTGAAC
GCTCTGTTTG GCATTCGCTC ACGCCTGATC AACCCGATGG AGCCGGCGAT CCTCAGTGTG
GGGACGGTGC GTGGTGGTCA TGCTTCAAAT GTGATTCCTA GTGAGATTTT TGTGCAGGGA
ACACTGCGTA GTTTTAGCGA AGAGGTACGG GCGAAACTTG CCAAAGAGGT TGAGCGTGCG
TTTGCCGTGG CCGAAGCGTT CGGTGGTAGC GCCGAGGTGA AGATCACCCG TGGCTATCCC
GCTGGCTGGA ACGACGAACG GGTGGCTGAG TGGATGAGTC AGGTCGCCGG TGAATTCCTT
GGAGCTAACG CGATTGATCG CTCGCGCACC GGTATGGGCG CGGAAGATTT TGCCTATATG
ACCCAGCAAG CGCCCGGCGC GATGTTGATG CTCGGTGCTG CGATTGACGA CGGTAAAGTA
CGTGCTCACC ATACACCCAT CTTCGATATC GACGAGCGAG CACTCCCGAT CGGTACTGCT
ATCTTGGCCG AAACGGCATT GCGTTTCTTG CGCGGTGAGG TGTCGTTGTA G
 
Protein sequence
MLLERAQALA DELIRIRRDI HAHPELGFQE HRTAALVAET LQEIGGIKIT TGVAKTGVIG 
ELGDGDGPVI AIRADMDALP ILEENNVEYA STNPGVMHAC GHDAHTAMLL GAAHLLRERF
AAEHLRGRVR FLFQPSEEGW DDEAKSGALR MVEEGALQGV DAVIALHVDS TLPVGQVTIR
GGWSSAAVDD FKGYIRGTGG HGAYPHLGTD PVFMLSHVLN ALFGIRSRLI NPMEPAILSV
GTVRGGHASN VIPSEIFVQG TLRSFSEEVR AKLAKEVERA FAVAEAFGGS AEVKITRGYP
AGWNDERVAE WMSQVAGEFL GANAIDRSRT GMGAEDFAYM TQQAPGAMLM LGAAIDDGKV
RAHHTPIFDI DERALPIGTA ILAETALRFL RGEVSL