Gene Csal_1464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1464 
Symbol 
ID4029178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1657229 
End bp1659310 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content67% 
IMG OID637966647 
Productoligopeptidase B 
Protein accessionYP_573516 
Protein GI92113588 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAC AGCCGCACAT CCCTGCGCTT CACGACGCGA CACAACTCTC GCCGGTGGCA 
CGCTTCCAAC GCAAGGAAGA TCCGGACTGG CACTGGCTGG AAAACCGTGA CGACCCCGAG
GTCGTGGCCT TTCTCGAACA GGCCAACCAG GAATTTTCGC AATGGTCCGC CCCCCTCGCA
CCGCTGGTCG AGGCGCTCTA TTCCGGCCAT CTCGCGCGAC GCGAACTGGC CGTCGAAGGA
CTCGGCACGC CGCTCGACCA TTACACCTAC TGGAGCGAGA CGGCCCGCGA CGCCGACTAC
CCCGTCTGGT GGCGTCACCC CAATCAGGTA CCGTCACGCC GCGAATGCGT GCTCGACCTC
CAGGCACGCG CCAGCGAGCA GACCTTCATG GAACTGGGCG ATATCGCCAT CGCCCCCGAT
GAACACTGGC TCGCCTGGAC CGAGGACACC AGTGGCGACG AGTATTTCAC GCTCTTTCAT
CGTGCGCTGC CCGATGGCAC GCCACGGCGC CTGCTGACCG ACATCGGCCC CGAGCTGTGC
TGCGCCGAGG ACAATCGCAC CCTGTTCTTC ACCCGCTACG ATGATACCCA ACGCCCTGAC
AGCGTCTGGC GCCTGGACAT CGAAAGCGGC GAGATCGCGT GCGTCTTCCG CGAGACGGAC
CCCGAGTTCT GGGTCGGCGT GGGCAAGACG CGTTCGCGTG AATGGCTGGT GCTGGAAACC
GCCTCCAAGG ACACCTCGGA ATGCCACCTG GTGCCTGCCG CGCACCCGCA TGTGCCACCG
CGCTGCGTTC GCGAGCGCAT CAAGGGCATC GAATACGCCC TCGAGCACCG TCCCGGGCAC
TTTTACATCC TGCACAACCA GGACGCACCG CACTTCCGGC TCGATGTCGC CGACGAAACG
GCGCCCGATA CCTGGCATCC GCTGGTGGCG CACGATGCGC AGTTGACTCT GGAAAGCATC
GACGCCTTCG CCTGGGGACT GGTCATCACC GAGCGCGATC ATCGCGAAGC CCAGGTCCAT
CTCAGCGTGC TCGACCTCGA CGCGCCGGCA CCGACCCGAC GCCGACTGCC CCTGCCCGAG
GCCCCGTGCA GCCTGATGCT GGGCGACACA CCGGATTTTC ATACCCGGCG GCTGCGGCTG
CACGAAGAGT CGTTCACCCT GCCGTCACGC TGGATCGAAC ACGATCTCGA CAACGACGCC
CGCCGACTGC TCAAGGTCCA ACCGGTCTAC GGCGACCTGC CTCCCGAGCG GCTGGTCTGC
CGGCGCGTGT GGGCAGAGGC GCACGACGGC GAGCGAATTC CGGTCTCCGT GGTGGCGCGA
GACGATCTCT GGGCACAGGG GCCCATGCCC ACGCTGCTTT ACGGCTATGG CGCCTACGGG
GAAGTGCTCG ATCCATGGTT CTCAGTGGCA CGCCTCGAGT TGCTGTCACG CGGCGTGGCC
TTCGCCGTCG CCCATGTTCG CGGCGGCGGC GATCGCGGCG AACCCTGGTA TCTCGCCGGC
AAGCTGGAGC ACAAGGAAAA CAGCTTCCGC GATTTCCTGG CCGCACGGCA CGCGCTGGTC
GAACACGGCA TCGCGGATGG CGAACGCATC GCCGCCTACG GCGCCAGCGC CGGCGGCCTG
CTGGTCAGCG CCAGCCTCAA TCTCGACCCC ACGGCGTTCT GCGCCGCCGT GCTCGACGTG
CCCTTCGTCG ACGTGCTGCG CACCATGGAA AACCCGGACC TGCCCTTGAC CACGGCGGAG
TACAGCGAAT GGGGCAATCC CAGCGAGCCC GAGGCGCACC GGCGCATTCG CGATTACTCG
CCACTCGACA ACCTCGTCGC GCGGCCCTAC CCCACTCTCT TCCTGCAGGG CAGTTGGCAC
GACTCCCGCG TTCCCTACTG GGAGCCGGCC AAGCTGTACG CACGCTTGAC CGAGATCGTC
GCCCAGCTCC CCGCCGCCGA GCGCCGTCCG ATCATGCTAC GCACCGACAT GGCGGCCGGG
CACGGCGGCG CCTCGGGCCG ATTCAAGGCC TGGCACGACA ATGCCCGTCA GGATGCCTTC
ATCCTCTGGG CACTGGGCCT CGCCGAGACT GCCGCCCCCT GA
 
Protein sequence
MKAQPHIPAL HDATQLSPVA RFQRKEDPDW HWLENRDDPE VVAFLEQANQ EFSQWSAPLA 
PLVEALYSGH LARRELAVEG LGTPLDHYTY WSETARDADY PVWWRHPNQV PSRRECVLDL
QARASEQTFM ELGDIAIAPD EHWLAWTEDT SGDEYFTLFH RALPDGTPRR LLTDIGPELC
CAEDNRTLFF TRYDDTQRPD SVWRLDIESG EIACVFRETD PEFWVGVGKT RSREWLVLET
ASKDTSECHL VPAAHPHVPP RCVRERIKGI EYALEHRPGH FYILHNQDAP HFRLDVADET
APDTWHPLVA HDAQLTLESI DAFAWGLVIT ERDHREAQVH LSVLDLDAPA PTRRRLPLPE
APCSLMLGDT PDFHTRRLRL HEESFTLPSR WIEHDLDNDA RRLLKVQPVY GDLPPERLVC
RRVWAEAHDG ERIPVSVVAR DDLWAQGPMP TLLYGYGAYG EVLDPWFSVA RLELLSRGVA
FAVAHVRGGG DRGEPWYLAG KLEHKENSFR DFLAARHALV EHGIADGERI AAYGASAGGL
LVSASLNLDP TAFCAAVLDV PFVDVLRTME NPDLPLTTAE YSEWGNPSEP EAHRRIRDYS
PLDNLVARPY PTLFLQGSWH DSRVPYWEPA KLYARLTEIV AQLPAAERRP IMLRTDMAAG
HGGASGRFKA WHDNARQDAF ILWALGLAET AAP