Gene Csal_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2804 
Symbol 
ID4028489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3132885 
End bp3134228 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content68% 
IMG OID637968011 
Productsuccinylarginine dihydrolase 
Protein accessionYP_574849 
Protein GI92114921 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3724] Succinylarginine dihydrolase 
TIGRFAM ID[TIGR03241] succinylarginine dihydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGG ACGTGCGCGA GGTCAATTTC GATGGCCTGG TGGGGCCCAC CCACAACTAT 
GCCGGGCTGG CGCACGGCAA TGTCGCCTCG ATGCGCCATG GCGGGCTGAC CGCCAATCCG
CGCGAGGCGG CCCTGCAAGG GCTGGCCAAG ATGAAGTCGT TGATGGAGGC AGGGTTCGCT
CAGGGCGTGC TGCCGCCCCA GCAGCGCCCC GATCTGGGGG CGTTGCGCGA CCTCGGCTTC
ACCGGCGACG ACGCCGGTGT GCTCGCCCAG GCGGCACGCC AGGCGCCGCA ACTGCTGCGC
GCGGTATGTT CGGCCTCGTC GATGTGGACG GCGAATGCCG CCACCGTCAC ACCCAGCCTG
GATGCGCCGG ACGGGCGCGT CCACTTCACC GCCGCCAATC TGCAGTCGAG CTTTCACCGC
TATCTCGAGC CGCGTACCAC GGCGCGGGTG CTGGCGGCGA TGTTCCACGA CCCGGCGCAT
TTCGCGCATC ATCCGGTGCT GCCGGCCACG CCGACATTCT CCGACGAGGG GGCCGCCAAC
CACACCCGCT TGTGCGGCGA TCACGACGAA CCGGGCGTGC ACCTCTACGT ATACGGACGC
CAGGCCTTCG GTGGCGAGCA CGGGCCCAAG CGGTATCCGG CCCGCCAGAC GCTGGAGGCC
AGCCAGGCGA TCGCCCGGCA GCATGGCCTG GACGACACCC GCACGGTCTT CGCCCAGCAG
CATCCTGATG CCATCGACGC CGGCGTTTTC CACAACGATG TCATCGCCGT CGGCAACGGC
CCGGTGCTGC TCTATCACGA GATGGCGTTC CGCGACGAGA CGGCCACGCT GGAGGCGCTG
CGCGCCCGCA TGTCGACGCC GTTGATTCCG GTGCGGGTGC CGAGCGAGGC GATCAGCCTG
GAGGATGCGG TAGCGACCTA TCTCTTCAAC TCGCAGTTGC TCTCCAACCC TGACGGCAGC
ATGACGCTGG TCGTGCCCGG CGAGTGCCAG GAAAACGAGA CTGTCTGGCG CACGATCCAG
GACTTGTTGC TGGGGGGGAA CAACCCGATC AGCGAGGTGC TGGTCAAGGA CGTCAAGCAG
AGCATGCGCA ACGGTGGGGG GCCGGCGTGC CTGCGCCTGC GGGTGGCGCT GGCGGCACGC
GAGCGCCAGG CGCTGACCGG CCGTGTACTG CTCGACGAGG CGTTGCACGA CGACCTGGCG
GCGTGGGTCG AGCGGCATTA CCGCGATCGG CTGGCGCCTG AGGATCTCGC CGATCCGCTG
CTTGTCCGGG AATCGCTCAC CGCTCTCGAC GAGTTGACGC AGCTGCTGGG CATCGGCGCG
GTGTATCCGT TTCAGTTGAA CTGA
 
Protein sequence
MSQDVREVNF DGLVGPTHNY AGLAHGNVAS MRHGGLTANP REAALQGLAK MKSLMEAGFA 
QGVLPPQQRP DLGALRDLGF TGDDAGVLAQ AARQAPQLLR AVCSASSMWT ANAATVTPSL
DAPDGRVHFT AANLQSSFHR YLEPRTTARV LAAMFHDPAH FAHHPVLPAT PTFSDEGAAN
HTRLCGDHDE PGVHLYVYGR QAFGGEHGPK RYPARQTLEA SQAIARQHGL DDTRTVFAQQ
HPDAIDAGVF HNDVIAVGNG PVLLYHEMAF RDETATLEAL RARMSTPLIP VRVPSEAISL
EDAVATYLFN SQLLSNPDGS MTLVVPGECQ ENETVWRTIQ DLLLGGNNPI SEVLVKDVKQ
SMRNGGGPAC LRLRVALAAR ERQALTGRVL LDEALHDDLA AWVERHYRDR LAPEDLADPL
LVRESLTALD ELTQLLGIGA VYPFQLN