Gene Caul_4697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4697 
Symbol 
ID5902159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5080099 
End bp5081109 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content70% 
IMG OID641565216 
Productagmatine deiminase 
Protein accessionYP_001686315 
Protein GI167648652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.399563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGA CCGTCCCCGC CGAGTGGGCT CCGCACCGCG CCATGTGGGT GGGCTTCCCC 
AGCCACCCCG AGCTTTGGCG GGAAGACCTC GACCAGGCGC AGCAGGAAGT CGCCGACCTG
GCCCGAGCCC TGGCCGGTCC GGGCGGCGAG CGCGTGCGGC TGATGGTGGT CGGCGACGAG
GCCGAGGCCG CCGCCCGAGC CCTGCTGGGC GACACCACGG TCGAGATCGT CCGCGGTCAG
TTCGGCGACA TCTGGCTGCG CGACACCGGC CCGATCTTCA CCGAGACGGC GAATGGCGCG
GTCGCCGCCG GCTTCCTGTT CAACGGCTGG GGCGGCAAGT ATCAGATGGA GGGCGACGAG
ATCGTCGCCG AGCAGATCGC CGCCGCCAGC GGCGCGCCCT TGGCCCGCAA CGGCTTTGTC
CTCGAGGGCG GATCCTTGGA CCATGACGGC TCGGGCACGA TCCTGACCAC GCGCCAGTGC
CTGCTGAACG CCAACCGCAA CACGAACTGG GACGAAGCCA CGGCGAGCGC CGCCCTCGCC
GACGCCCTGG GGGCCAAGAA GCTGCTGTGG CTGGGCGACG GCCTGCTCAA CGATCACACC
GACGGCCATG TCGACAATCT GGCCCGCTTC GTCGCGCCGG GCGTGGTGGC CTGTCCGATG
GCCTTCGGGA CCGACGACCC GAACGCCGAG GTGTATGCGG CGACCGCCCG CGCCCTCGCG
GCCATGACCG ACAGCCGGGG CTCGCCCCTG CAGGTGGTGC GTATTCCCTC GCCGGGCCGC
ATCCTGGACG AGGACGGCGA GATCGTTCCG GCCTCGCACA TGAACTTCCT GATCGCCAAC
GAGGCGGTGA TCGTGCCGAT CTATGCCGAG GAGTCGGGCG CCTTCGCCGT CGAGGTGATC
AGCGGCCTGT TCCCCGAGCG CGAGGTGATC GGCCTGCCCT CGACCGCCAT CCTGACCGGC
GGCGGTTCAT TCCACTGCAT CTCGCAACAA GAGCCGGAGG TCCGAGCATG A
 
Protein sequence
MTPTVPAEWA PHRAMWVGFP SHPELWREDL DQAQQEVADL ARALAGPGGE RVRLMVVGDE 
AEAAARALLG DTTVEIVRGQ FGDIWLRDTG PIFTETANGA VAAGFLFNGW GGKYQMEGDE
IVAEQIAAAS GAPLARNGFV LEGGSLDHDG SGTILTTRQC LLNANRNTNW DEATASAALA
DALGAKKLLW LGDGLLNDHT DGHVDNLARF VAPGVVACPM AFGTDDPNAE VYAATARALA
AMTDSRGSPL QVVRIPSPGR ILDEDGEIVP ASHMNFLIAN EAVIVPIYAE ESGAFAVEVI
SGLFPEREVI GLPSTAILTG GGSFHCISQQ EPEVRA