Gene Cagg_1611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1611 
Symbol 
ID7268177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1964658 
End bp1965995 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content58% 
IMG OID643566452 
Productamidohydrolase 
Protein accessionYP_002462948 
Protein GI219848515 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0170905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00285374 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTACGATC TTCTCATTCA GCACGTTGAT GTTCTCCAGA TCGCCAATGG CGCTCCTACC 
ATTCTGCCTC GCCACGACCT TGCCATCACC GATCGACGCA TTAGCGCAAT CGCTCCGGCG
ATTAGTCCCG GCCTCGCCCG TGAGGTGATT GACGGTGAGG GGCATCTAGC CATCCCCGGT
CTGATCAATA GCCATGCCCA TACCGCAATG AGCCTCTTTC GGGGGGTAGC TGAAGATGTA
CCGATTGAAG AGTGGTTTAA CCGCTTTATC TGGCCACTCG AAACGAATCT GACCCCGGAA
GATGTGTATT GGGGTACGTT ACTCGGTCTG GCCGAGATGA TCGAAGCCGG GGTGACATGC
GTCGCCGATC ACTATTTTGC GACGGATGCT ATCGCTCAGG CGGTGCAGGA ATCGGGAATG
CGTGCATTGT TGGCGTGGAC GCTCTTTTCC GGCGCCGATG AGGATACCCA GCTTAACAGC
GCACGCCGAT TTACCGAGCA GTGGCATGGT ACTGCCGGTG ATCGCATTCG GGTTTGGATG
GGACCACACT CGCCTTATAC CTGTACTCCT TCGTTCTTGA GCCGTATCGC GCGAACCGCG
CGTGAACTGG GAGTAGGAAT TCACATTCAT TTAGCCGAGA CGGCCGGTCA AGTGTCACAG
AGTATCGCGA CCTATGGTCG TTCGCCGGTG ATGGTAGCGT ATGATGCGGG ATTGTTTGCC
GGGCCGGCCC TGGCTGCCCA CGTTGCTCAT GTCTCACCAG AAGATATTGC CGTCCTTGCG
ACGCATGGGG TGGCGGTTGC GGTCACGCCG AAGACCGAGA TGAAGCTGGG GATCGGTGTT
GCACCGGTGA CAACCATGCG GGCAGCAGGG GTAACGGTTG CCTTGGGGAG TGATGGGGCG
GCGAGTAACA ATACCTACGA TGTGCTCGAA TCGGCGCGGT TACTCGCACT GCTCGAAAAA
CTGCGCACCG GCGATGCCCG AGTTATGCCG ATTGGAACGG TGCTCGAGTT GGCGACTGTT
GCCGGTGCGC AGGCTTTGCA CTGGGAAGGG ATTGGTGTTT TACAACCCGG TGCGCGTGCC
GATCTAGCTT TGATACAGTA TGCTACCGCG CATACCCAGC CGGTACACGA TCCGGCGGCA
GCGCTCCTCT ACAGTAGTCA GCCCGCCGAT GTGCGTACCG TGATTGTGGA TGGTCGCGTC
TTGATGCGTG ATCGCGTTTT GCTCACCATC GATAAGCCGC GAGTGCTGCG TGAGGTGGTT
GCACGGATAG AGCGCCTCAC GCAGTATCAG CTCGATAAGC GGATAGCAGT GTATCCTGAA
GCCAGAACCG ATGCGTAG
 
Protein sequence
MYDLLIQHVD VLQIANGAPT ILPRHDLAIT DRRISAIAPA ISPGLAREVI DGEGHLAIPG 
LINSHAHTAM SLFRGVAEDV PIEEWFNRFI WPLETNLTPE DVYWGTLLGL AEMIEAGVTC
VADHYFATDA IAQAVQESGM RALLAWTLFS GADEDTQLNS ARRFTEQWHG TAGDRIRVWM
GPHSPYTCTP SFLSRIARTA RELGVGIHIH LAETAGQVSQ SIATYGRSPV MVAYDAGLFA
GPALAAHVAH VSPEDIAVLA THGVAVAVTP KTEMKLGIGV APVTTMRAAG VTVALGSDGA
ASNNTYDVLE SARLLALLEK LRTGDARVMP IGTVLELATV AGAQALHWEG IGVLQPGARA
DLALIQYATA HTQPVHDPAA ALLYSSQPAD VRTVIVDGRV LMRDRVLLTI DKPRVLREVV
ARIERLTQYQ LDKRIAVYPE ARTDA