Gene Cagg_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1779 
Symbol 
ID7267691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2182346 
End bp2183926 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content60% 
IMG OID643566620 
ProductAmidase 
Protein accessionYP_002463115 
Protein GI219848682 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.455319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.676292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC CAGATCCAAC CAGCGCGATA GGCGCTCTCG CCTATGAAGC GACCATTGCC 
CAGTTGCAAG CGGCTATGGA TAGCGGCGCG ATCAGCGCTG AAGCCTTGAC GATGGCCTGT
CTCGAACGTA TTGAAGCGCT TAATCGGGCG GGACCGTGTC TCAATGCGGT GATCGAGATC
AGCCCATCGG CGCTCAAAAC GGCGATTGCC CTCGATACCG AGCGTAATGC GCACGGGCCA
CGTAGTCCAT TGCACGGCAT TCCAATCCTC CTGAAAGACA ACATTGACAC GCTCGATGAT
ACCGCCACGA CGGCCGGTTC GTTGGCCCTA CTCGGTTCGC GACCGGCAGC CGAGGCGACG
GTTACGTCTC GGTTACGGGC TGCCGGGGCG GTGATTTTGG GCAAAGCGAA CATGAGCGAA
TGGGCCAACT TCCGCAGCAC CGCATCCTCA AGTGGGTGGA GTGCGCGCGG CGGGCAGGCA
CGTAATCCGT ATGTCCTCTC GCGCAGTCCC TGTGGTTCAA GTTCGGGATC GGCAATTGCC
GTCGCTGCCA GTATGTGTGT GGTCGCTATC GGTACCGAGA CCGATGGCTC AATCTCGTGC
CCGTCGGCGT TGTGCGGAGT GGTAGGGATT AAGCCGACGG TAGGGCTGAC CAGTCGTGCC
GGGGTCGTTC CGATCAGTTT TACGCAGGAT ACCGTCGGGC CACATGCCCG TTGTGTCGCC
GATGCGGCAA CGGTGCTGGG TATCATTGCC GGGCCTGATC CACGTGATCC GGCGACGGCG
GCAGCAGCCG GCCATGCTCG CCCCGACTAC CGCACCTGTT TGCAAGCCGA TGCGTTGCGC
GGCGCACGGA TTGGCGTGCT GCGCAGCGAT CGCTTTGCCG GCTTTGGTCG TCACGTTGAA
CAGGCCTTTG CGAACGCATT GACGGCGATG ATTGACGCCG GCGCGCATAT CGTTGATCCG
GTAACGCTAC CTGATGACCT GCTCGCTTTC GGCGAGGCCG AATTGACGGT ACTCATCTAC
GAGTTCAAAG ATACGCTCAA CCGGTATCTT GCCTCTCGTG TACCCGATCC GCAGGCTACC
GATCCGCCAC CACATTCGCT GGCCGAACTG ATTGTCTTCA ATGAGCGGCA TGCCGAGCAC
GAATTGCGGT TCTTTGGCCA AGAGCTGTTG TTGCAAGCAG CAGCGGTCGG TGATCTCAAT
GATCCGGCGT ATCAGCAGGC GTTAGTCGCT AGTCGTGATA CTGTGCGGCA GGCGCTCGAT
ACCGTGCTTT ACGAAAAGCA GCTTGATGCG TTGGTCGCGC CGGCGACAGG TCTTGCCTGG
CCGATTGATC TCATTGCCGG TGATCGCTAT CCCGGCGGCA GCAGCTCGCT CGCGGCACGA
GCCGGCTACC CGATGGTCAC GGTGCCGGCA GGGATGGCAT TTGGTTTACC TATCGCTATC
AACTTCATCG GTGGAGCGTG GTCGGAACCA ATGCTGATCA GATTAGCGTA TGCCTTTGAG
CAGGCAACTC GTTGGCGTCG CCCACCAACG TACCGGCTGT GGCTCGAAGG AGATAATGAG
CTGCCTACGG TGACCGCATA G
 
Protein sequence
MSNPDPTSAI GALAYEATIA QLQAAMDSGA ISAEALTMAC LERIEALNRA GPCLNAVIEI 
SPSALKTAIA LDTERNAHGP RSPLHGIPIL LKDNIDTLDD TATTAGSLAL LGSRPAAEAT
VTSRLRAAGA VILGKANMSE WANFRSTASS SGWSARGGQA RNPYVLSRSP CGSSSGSAIA
VAASMCVVAI GTETDGSISC PSALCGVVGI KPTVGLTSRA GVVPISFTQD TVGPHARCVA
DAATVLGIIA GPDPRDPATA AAAGHARPDY RTCLQADALR GARIGVLRSD RFAGFGRHVE
QAFANALTAM IDAGAHIVDP VTLPDDLLAF GEAELTVLIY EFKDTLNRYL ASRVPDPQAT
DPPPHSLAEL IVFNERHAEH ELRFFGQELL LQAAAVGDLN DPAYQQALVA SRDTVRQALD
TVLYEKQLDA LVAPATGLAW PIDLIAGDRY PGGSSSLAAR AGYPMVTVPA GMAFGLPIAI
NFIGGAWSEP MLIRLAYAFE QATRWRRPPT YRLWLEGDNE LPTVTA