Gene Csal_1581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1581 
Symbol 
ID4027600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1799897 
End bp1800964 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content66% 
IMG OID637966770 
Productpeptidyl-arginine deiminase 
Protein accessionYP_573633 
Protein GI92113705 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCCAC GCCTGTTCCC CGAATGGCAT CCCCAGGACG CCATTCAGCT CACCTGGCCC 
ACCACGGAAA GCGACTGGGA GCCTCTGCTC GAACGCATCG AAGCCACCAT GGAAGCCATC
GTGGTCGCCA TCACCCGCTT CCAGCCGGTG CTGATCGTCG TCGCCGATGC ACCCACGCGG
CAACGTCTCG ACACGCGTTT CATGCAACTG GGCATTCACC CGAAGCAGTG GCGACTCATC
GTCGCCCCCG CCGACGACAC CTGGACCCGC GATCACGGCC CCATCGCCGT GGAGCGGCAA
TCGGAGGTGG TGCTGCTGGA TTACCGCTTC ACCGGCTGGG GCGGCAAGTT TCCCGCCCAG
CGTGATGACG CCCTGACCGC GGCCCTGGCG GACATCGGCA TTTATGCCGC GCCCTGCGAA
CAACGCGACC TGGTGCTGGA AGGCGGTGCC ATCGACAGCG ATGGCGAAGG GACTCTGCTG
GTCACCGAGG CGTGTCTGCT CAATCCCAAC CGCAACCCGG ACTTGACCCG CGAGGACATC
GAAGCGCGCT TGCGCGACGA CCTGGGTGTC GAACGCTTCC TGTGGCTCAC GCAGGGCCAC
CTCGAGGGCG ATGACACCGA CAGCCACATC GATACGCTGG CGCGTTTCTG CGACGCACAC
ACCATCGCCT ATGTCCGCTG CGAGGATCCG GACGATCCGC ACTACCCGGC CCTCGCCCAG
ATGGAAAGCG AGCTCAAGGC CATGCGTCGC GCCGACGGCA GCGCCTATCG CCTGATTCCG
CTGCCCCTGC CGCAGCCGTG TCACGACCCG GACGATGGCC ACCGTCTGCC GGCAACGTAT
GCCAACTTCC TGATCATCAA CGGCGCGGTG CTGGTGCCCA CCTACGCCGA CGCCGCCGAC
GGCGTGGCCC TGACGGCACT GGCCAGTGCC TTTCCGGGAC GCAGCATCAT CCCCATCGAC
TGCCGCACCG TCATTCGCCA ACATGGCAGC CTGCACTGTC TGACCATGCA GCTGCCGCGC
GGCGCACTCT TCACGCCGTC GAACGGCGAC GTCACTTCGG AGGCCTGA
 
Protein sequence
MLPRLFPEWH PQDAIQLTWP TTESDWEPLL ERIEATMEAI VVAITRFQPV LIVVADAPTR 
QRLDTRFMQL GIHPKQWRLI VAPADDTWTR DHGPIAVERQ SEVVLLDYRF TGWGGKFPAQ
RDDALTAALA DIGIYAAPCE QRDLVLEGGA IDSDGEGTLL VTEACLLNPN RNPDLTREDI
EARLRDDLGV ERFLWLTQGH LEGDDTDSHI DTLARFCDAH TIAYVRCEDP DDPHYPALAQ
MESELKAMRR ADGSAYRLIP LPLPQPCHDP DDGHRLPATY ANFLIINGAV LVPTYADAAD
GVALTALASA FPGRSIIPID CRTVIRQHGS LHCLTMQLPR GALFTPSNGD VTSEA