Gene Cagg_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1955 
Symbol 
ID7268871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2388516 
End bp2389502 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content62% 
IMG OID643566793 
Producturea amidolyase related protein 
Protein accessionYP_002463286 
Protein GI219848853 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00247054 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCTACGT TTCTCGACAT CATCGCGGCC GGGTCGCTAT TGACAATCCA AGACGGTGGA 
CGAACGACCG CGCGACGCTA TGGGGTGCCG GTAGGAGGAG CGATGGATCG GTTTGCGTTA
GCTGTAGCTA ACCGCTTAGC CGGTAATCAG CCTTACGTAC CGGCGTTCGA GATCACTGCT
GGTGGAACCC AGATCCGTTG CAGCACAACT ATCACCATCG GTCTGGCCGG CGCCGATTTG
CAAGCGCGGC TCAACGATAC ACCGCTTGTC CCGTGGCATA GCGCCGTCGC TCCGGCCGGC
AGTACCATCA CCTTTGGCGG ACGACGTGGC GGCTGGGGCG GCCGTGCGTA TCTGGCGGTC
GCCGGTGAAC CGGTCGTCGA GTGGGCGATC GGTGGTGCGG GTACCTGTTT GGCCGGTGGC
TTTGGTGGTT ATCAGGGACG AGCATTACGA GCCGGTGATC GGATCGCCGT TCAAGCTCGA
CCGGCGATGG CAGTTGATGG AATGCGTTGG TGGCCGGTGG ACCGGCGTCC ACCCTACGGC
CCTCAACCAC GCTTGCGCGT CATTCCCGGT CCTCATGCGG ATCAATTACC GATGGCGTGG
ACCGGGCTAC TATCCGCGAC ATGGCAGATC GACCAGGCGG CCAGCCGGCA AGGCTACCGA
CTGACCGGTG CCATCCTACC TTCGTTTACC CACTCGCTAA CCTCATTCGG AATCGTACCG
GGCGCGATCC AGTTACCACC CGATGGCCGA CCGATCCTGT TAATGGCCGA TGCCCAAACG
ACCGGCGGCT ATCCGGTGAT TGCCGTCGTC ATCGGTGCCG ATCTCCCGCT AGCCGCACAA
CTCTTACCCG GCGACCGGCT CACATTTGTC GCAAGCGATC TGGCTACTGC CAAAGAATCA
TTAGCCCAGC AGTCGGTATG GCTGACTGCC GGGCCTGAAG ATGATGAGAA CGGATGGTTG
CTGGCCCAAG CGGGCGCAAT ACGGTGA
 
Protein sequence
MATFLDIIAA GSLLTIQDGG RTTARRYGVP VGGAMDRFAL AVANRLAGNQ PYVPAFEITA 
GGTQIRCSTT ITIGLAGADL QARLNDTPLV PWHSAVAPAG STITFGGRRG GWGGRAYLAV
AGEPVVEWAI GGAGTCLAGG FGGYQGRALR AGDRIAVQAR PAMAVDGMRW WPVDRRPPYG
PQPRLRVIPG PHADQLPMAW TGLLSATWQI DQAASRQGYR LTGAILPSFT HSLTSFGIVP
GAIQLPPDGR PILLMADAQT TGGYPVIAVV IGADLPLAAQ LLPGDRLTFV ASDLATAKES
LAQQSVWLTA GPEDDENGWL LAQAGAIR