Gene Caci_4725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4725 
Symbol 
ID8336079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5389661 
End bp5390701 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID644957825 
Productallantoicase 
Protein accessionYP_003115427 
Protein GI256393863 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG4266] Allantoicase 
TIGRFAM ID[TIGR02961] allantoicase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0172758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA CCCCGTCCTT CACCCAGCTG ACCGACCTCG CCGCCCGGGA CGTCGGAGGC 
GCCGTGGTCT GGGCCAACGA CGAGTTCTTC GGCGAGAAGG AATCGCTGAT CCGCCCCGAA
CCGCCGACTT TCTCACCCGC CACCTTCGGC CACAAAGGCC AGGTCGTCGA CGGCTGGGAG
ACCCGCCGCA GACGCCCCGG CGGACCCGGA GTCGAGGGCA CGGAGCACGA CTCGGCGATC
GTCCGGCTCG GCCTTCCTGG CGTCATCCGC GGCGTCACCA TCGACACCGC CTTCTTCCTC
GGCAACTACC CGCCCCACGC CCGCGTCGAA GCAGCGAGCG TCCCCGGCTT CCCGACTCCG
GCGGACCTGC TGGCCGCCGA GTGGACCGAG ATCGTCCCGA CCAGCCCGCT GTCCGGCGGC
TCCGAGCAGC ACTTCGAGGC GGAGATCACC GGCCGCCGCT TCACCCACGT CCGCCTCGCC
ATGATCCCCG ACGGCGGCAT AGCCCGCTTC CGCGTCTACG GCGAAGCCGT CCCCGACCCC
GTCTTCCTCG CCGGTGTCCC CGTCGATCTC GCCGCCCTGA CCAACGGCGC GCGCATCGTG
GCCGCCTCCA ACATGTTCTT CTCCGCGCCC GAGAACCTGA TCAAACCGGC CGAGTCCCGC
GTCATGGGCG AAGGCTGGGA GACCGCGCGC CGCCGCGACG ACGCCGGCGA CTGGATAGAG
GTACGCCTGG TCGCGCAAGG CGTCCCGGCC GTCATCGAGA TCGATACCGC CAACTACAAG
GGCAACGCCC CAGATCACAT CGTCCTGCTC GGTGCGGATC GCCCAGGCCA GGAAGCGGGC
TCGAACTGGT TCGAGGTCAT CGCGCAGACC CGCATGCTCC CCGACTACAA GCACCGCTTC
CGGCTCGAAG GCGCGCGCCC CGTCACGCAC CTGCGCCTGG AAGTACGTCC CGACGGGGGA
GTGGCACGCC TGCGCGCCTT CGGCAGCCTC ACCGACGCCG GCCTGACCGC CGTCCGCACT
CGCTGGGCGG AGCACGCATA G
 
Protein sequence
MSDTPSFTQL TDLAARDVGG AVVWANDEFF GEKESLIRPE PPTFSPATFG HKGQVVDGWE 
TRRRRPGGPG VEGTEHDSAI VRLGLPGVIR GVTIDTAFFL GNYPPHARVE AASVPGFPTP
ADLLAAEWTE IVPTSPLSGG SEQHFEAEIT GRRFTHVRLA MIPDGGIARF RVYGEAVPDP
VFLAGVPVDL AALTNGARIV AASNMFFSAP ENLIKPAESR VMGEGWETAR RRDDAGDWIE
VRLVAQGVPA VIEIDTANYK GNAPDHIVLL GADRPGQEAG SNWFEVIAQT RMLPDYKHRF
RLEGARPVTH LRLEVRPDGG VARLRAFGSL TDAGLTAVRT RWAEHA