Gene Csal_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1189 
Symbol 
ID4027000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1364048 
End bp1365169 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content69% 
IMG OID637966366 
Productpeptidyl-arginine deiminase 
Protein accessionYP_573244 
Protein GI92113316 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID[TIGR03380] agmatine deiminase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.636661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCT TGCTGACCTC CCTGCCCAAG GCCGACGGTT TCCGGCTGCC CGGCGAATTC 
GAACCCAAGG CTGGCTGCTG GCTGGGCTGG CCGGAGCGCC CGGACGTGTG GCGCAACGGC
GGCAAGCCGG CACAGCGGGT ATGGGTCGAG ATTGCGTCCG CCATCGCCGA GAGCGAGCCG
GTCACCGTCT GCGCTTCCGC CGCCCAGTTC GCCAACGCCC GGCGCCTGCT GCCGCCCTCG
GTGCGGGTGG TGGAGATGAC CTGCAACGAC AGCTGGTTCC GTGACAGCGG GCCGGCCTTC
GTGGTCAACG ACGCGACCGG CGAGGTGCGC GGCGTGGATC TCGAGTTCAA TGCCTACGGC
GGCCTCGACG GCGGCCTGTA CTTCCCCTGG GATCAGGACG ATCTGATCGC CCGGAAGGTA
CTCGAGATCG AGGGGCTGGA CCGCTACCGG GCGCCGTTCA TCGCCGAGAT GGGCGGTATC
CAGTCCGACG GCCAGGGCAC CCTGCTGACC ACCGAGCAGT GCCTGCTCAA CCGCAACCGC
AATGGCCACC TGGGCAAGCA AGCGGTCACC CGGCATCTCG AGGACTACCT CGGCGCCGAG
CGGGTCATCT GGCTGCCGCG GGGCTGCAAG TTCGACGAGA CCGACGGCCA TATCGACGAT
CTGGCCTACT TCGTGCGCCC CGGCGAGGTG CTGCTGCAGT GGACCGACGA CCGCGACGAC
CCGCAGTGGG AGATCTGCCA GGAGGCCTAT GACGTGCTGA AAGGCACCCG CGACGCCCGA
GGGCGCGAGC TCACCGTGCA CAAGATGCCC CAGCCGCGGG CCTTGGAGTG GACGGCCGAG
GAGGCCGCAG GACTCGATCA GGTCGACGGC ACCCATCTGC GTCAGGCCGG CACCCGTATC
TGTGCCTCCT ACATCAACTA TTACGCCGGC AACTCGGTGA TCGTGGTACC GCTGTTCGGC
GATCCCAGCG ATCGCGTCGC CCAGGCCACC CTGGCCGAAC TGTTCCCCCG TCATCGTATC
GTCGGCATCG AGAGCTCCCG CGAGATCCTG CTCGGCGGGG GCAACGTTGC CTGTATCACC
ATGCCCCAGT ACGCCGCCCC GACGCCGCCG ACCGGCGGCT GA
 
Protein sequence
MARLLTSLPK ADGFRLPGEF EPKAGCWLGW PERPDVWRNG GKPAQRVWVE IASAIAESEP 
VTVCASAAQF ANARRLLPPS VRVVEMTCND SWFRDSGPAF VVNDATGEVR GVDLEFNAYG
GLDGGLYFPW DQDDLIARKV LEIEGLDRYR APFIAEMGGI QSDGQGTLLT TEQCLLNRNR
NGHLGKQAVT RHLEDYLGAE RVIWLPRGCK FDETDGHIDD LAYFVRPGEV LLQWTDDRDD
PQWEICQEAY DVLKGTRDAR GRELTVHKMP QPRALEWTAE EAAGLDQVDG THLRQAGTRI
CASYINYYAG NSVIVVPLFG DPSDRVAQAT LAELFPRHRI VGIESSREIL LGGGNVACIT
MPQYAAPTPP TGG