Gene Caci_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0100 
Symbol 
ID8331425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp103647 
End bp105341 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content71% 
IMG OID644953267 
ProductAllophanate hydrolase subunit 2 
Protein accessionYP_003110896 
Protein GI256389332 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00370] conserved hypothetical protein TIGR00370
[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.368046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0250714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTACCCG CGGGCCCATT CGGGGTACTG CTCGAGTTCG ATTCTTTGGA CAAAGTTCAG 
CGCTGCTTCG CATGGGTCGA GCGCTGGCGC ACGGCGCGGC CGGGAGTGCT GGTCGATGTG
GTGCCCGCCG AGCAGACGCT GATGGCCATC GCGACGTTCG ACGACGCGGG GCAGCGCGGG
TTGCGGGAGT TCGTCGCGGC GGTCGGGGCT GTGGACTGGG CTGATGCCGG TACGGACGAT
GACGCGGCAC CTGCCGCAGA CGCCGAGACT GTCGAACTCC CGGTGACGTA CGACGGCCCA
GATCTGGACG ATGTGACCCG CCTGACCGGG CTTGCCGCAA ACGAGATCGT TTCGTTGCAC
ACGGAAGCCG AATTCACCGT CGCGTTCACA GGTTTCGCGC CGGGCTTCGC GTACCTGACC
GGCTTGCCGG ACGCGCTACG CGTGCCCCGC CGCGATGCGC CGCGCACCCG CGTGCCGACC
GGAGCGGTGG CATTGGGTGG GCCGTACAGC GGCGTCTATC CGCGCGAATC CCCGGGCGGC
TGGCAGCTGA TCGGGCAACT GGCGCCATGG GCCCCGACAC TGTGGGACGA GACGCGGGAA
TCGCCGGCCC TGCTGCGGCC AGGCACACGG GTCCGGTTCG TCGACGCAGC CGGGCAAAGC
GCTCCGGACA GCAGCGCTCC CGCCAAGCTG ACGCAGGCCA CCGCGGCAGC CACGGAACCC
GGCCTGCGGG TGGTCCGCGC CGGACCACTG ACGACCGTGC AGGACCTCGG CCGACCGGGC
TTCGCGCACC TCGGCGTGCC ACGCTCAGGA GCGGTCGACC GGACATCGCT GAAGCTCGCC
AACCGCTTGG TGGGCAACGA AGAGGGCGCC GCAGCCCTCG AGCTGACACT TGGCGGCGGA
GCGGTGCAGC TCGAAGTCGG ACGCTGGGTC GCGGTTACCG GGGCACGGTG CGAGATCACG
ATGACGGTGC CGGCACCCGA GTCGGCGCTT GAGCCTGCCG CCGCTCGCGA GGTCGGCCCT
GATCGAAAGT CCGCACCCGC AGCCGGGCCC TCCCACCGGC CGATTGTTAT GCGAACTGTG
CCAGGCACTG CTTTTTATGC GCCTGAGGGC GCCGTGGTTG AGGTGGGGCC GGCGGTGGCG
GGGGTGCGCG CTTACCTTGC CATGGCCGGT GGGGTGGGGG GTTTTGAGGT GTTGGGGAGT
CGGTCCGTTG ATCTCCTTTC GGGGGTTGGG GGGCGGGCTC TGGGGGGCGG GGCGCGGGTC
GGTGTGGGCG TGCCGGTGGG GTTGCCGCCG GATATCCCGG GGTTGGGGAT GGCACCGGTG
CGGGATGTGG GGGATCCGGT GGTGGTGCGG TTGGTTTTGG GGCCCCGGAG TGAGTGGTTC
ACGGATGACG CGGTGCGGGA TCTGGTTGCG GCGCGGTGGA CGGTGGGAGT GGAGTCGAAT
CGGACCGCGG TGCGGTTGGA CGGGCCTTCG TTGGCGCGGG CGCGTGATGG GGAGCCCGCC
AGCGAGCCTT TGGTCGTGGG GGCTGTGCAG GTGCCTCGGG ATGGGCGGCC GCTGCTGTTC
TTGGCCGACC ATCCGGTGAC CGGGGGGTAT CCGGTGATCG CGTGTGCCCA TCCGGCGGAT
GTGGATGCCG CGGGGCAGGC GCGACCGGGG ACGGGGATCC GGTTCCGGAT GGTCGGTGGG
CTGGCTTCGC GGTGA
 
Protein sequence
MLPAGPFGVL LEFDSLDKVQ RCFAWVERWR TARPGVLVDV VPAEQTLMAI ATFDDAGQRG 
LREFVAAVGA VDWADAGTDD DAAPAADAET VELPVTYDGP DLDDVTRLTG LAANEIVSLH
TEAEFTVAFT GFAPGFAYLT GLPDALRVPR RDAPRTRVPT GAVALGGPYS GVYPRESPGG
WQLIGQLAPW APTLWDETRE SPALLRPGTR VRFVDAAGQS APDSSAPAKL TQATAAATEP
GLRVVRAGPL TTVQDLGRPG FAHLGVPRSG AVDRTSLKLA NRLVGNEEGA AALELTLGGG
AVQLEVGRWV AVTGARCEIT MTVPAPESAL EPAAAREVGP DRKSAPAAGP SHRPIVMRTV
PGTAFYAPEG AVVEVGPAVA GVRAYLAMAG GVGGFEVLGS RSVDLLSGVG GRALGGGARV
GVGVPVGLPP DIPGLGMAPV RDVGDPVVVR LVLGPRSEWF TDDAVRDLVA ARWTVGVESN
RTAVRLDGPS LARARDGEPA SEPLVVGAVQ VPRDGRPLLF LADHPVTGGY PVIACAHPAD
VDAAGQARPG TGIRFRMVGG LASR