Gene Caci_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0251 
Symbol 
ID8331578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp281646 
End bp283079 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content70% 
IMG OID644953418 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_003111045 
Protein GI256389481 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.421015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTTCCT CCGAGAACCC GCAGACCGCG ACCACGATCC CGGCCACTAC TCCCTCCCCG 
AACGGCGGCG CCGCCAGCGC CGGCGCCGAC CGGCGCGCGT TCTGGTGCGA ACTCGCCTGG
ATCAACGGCG AGATCAAGAG CAAGGTCCTC ATCGAGGTCG CCGGCGGGCA GATCTCCGCG
GTCACCTGCG GCGTGAAACC CCGCCCCCAG AACGCCGAGA AACTGACCGG CCTGACCATC
CCGGGCCTGG CGAACGTCCA CTCCCACGCC TTCCACCGAG CCCTGCGCGG CCGCACCCAG
ATCGAGTCCG GCACCTTCTG GACCTGGCGC GAACGCATGT ACGCCGCCGC CGCGCACCTG
GACCCGGACT CCTACCGCGA ACTGGCCACC GCGGTCTTCG CGGAGATGGC ACTGGCCGGC
GTCACAGCCG TCGGGGAGTT CCACTACGTA CACCACTCGC CCAAGGGCGG CCTGTATCAG
GACCCGAACG CCATGGGCCA CGCCCTGACC GAGGCCGCCG AGGCCGCGGG CATCCGCATC
ACCCTGCTGG ACACGTGCTA CCTGTCCGGC GGCTTCGACA ACGAACTGAA CGACGTCCAG
CGGCGCTTCT CCGACGGCGA CGCCGGCCGC TGGGCCGAAC GCGTCGAGGC ACTCCGCAAG
GCTTATGCGG GCTCTGACAC GGTACGCATC GGCGCGGCGG TGCACAGCGT CCGCGCCGTC
CCCGTCGATC AGCTCTCCCC GGTGGTGGCC TTCGCGGCGG AGAACGAAAT GCCCCTGCAC
GTCCACCTGT CCGAGCAGCG CGCGGAGAAC GACGCCTGCC TGGCCCGCCA CCACAAGACC
CCCACCGAAC TCCTGCACGC CCACGGCGCC CTCGGCCCGC GCACCACCGC CGTCCACGCC
ACACACCTGT CCCAAATGGA CATCGACCTC CTCGGCACCT CCGCCACGGC GGTCTGCATG
TGCCCCACCA CCGAACGCGA CCTGGCCGAC GGCATCGGCC CAGCCCACGC AGTCCACCTC
GCAGGCTCCC CCGTCAACCT GGGCACCGAC TCCCACGCCA TGATCGACCT CTTCGAAGAA
GCCCGAGCCG TAGAACTCGA CGAACGCCTC CGCACCGAAC GCCGAGGCCA CTGGCTAGCC
TCCGAACTCC TCCAAGCCGC CACCACCGAC GGCCACGCCT CCCTAGGCTG GCCCACCACA
GGCCGCCTGC AACCCGGCAC CCCCGCCGAC TTCACCACCA TCGCCCTCGA CACCGTCCGC
CTAGCCGGCG TCCAACCCGC CCACGCCGCC GAATCAGTGA TCTTCGCCGC CACCGCCGCC
GACGTCCGCC ACGTCGTGGT CGCCGGTAAG TTCACGGTCC GCGACCATCA GCACATGCTG
GTCGACGACG TGCCGGGACG CCTGGCCGCG ACGATCGGGG CGATCTTCAA GTAG
 
Protein sequence
MTSSENPQTA TTIPATTPSP NGGAASAGAD RRAFWCELAW INGEIKSKVL IEVAGGQISA 
VTCGVKPRPQ NAEKLTGLTI PGLANVHSHA FHRALRGRTQ IESGTFWTWR ERMYAAAAHL
DPDSYRELAT AVFAEMALAG VTAVGEFHYV HHSPKGGLYQ DPNAMGHALT EAAEAAGIRI
TLLDTCYLSG GFDNELNDVQ RRFSDGDAGR WAERVEALRK AYAGSDTVRI GAAVHSVRAV
PVDQLSPVVA FAAENEMPLH VHLSEQRAEN DACLARHHKT PTELLHAHGA LGPRTTAVHA
THLSQMDIDL LGTSATAVCM CPTTERDLAD GIGPAHAVHL AGSPVNLGTD SHAMIDLFEE
ARAVELDERL RTERRGHWLA SELLQAATTD GHASLGWPTT GRLQPGTPAD FTTIALDTVR
LAGVQPAHAA ESVIFAATAA DVRHVVVAGK FTVRDHQHML VDDVPGRLAA TIGAIFK