Gene Csal_0323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0323 
Symbol 
ID4026638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp361873 
End bp363159 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content64% 
IMG OID637965472 
Productprotocatechuate 4,5-dioxygenase 
Protein accessionYP_572384 
Protein GI92112456 
COG category[S] Function unknown 
COG ID[COG3384] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02792] protocatechuate 4,5-dioxygenase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.306263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGTA TCATCGGCGG ACTCGCCGTT TCCCACACCC CGACCATCGG CTTCGCCGTC 
GACCATGAAA AACAGCAGGA GGCCGCCTGG GCGCCGATCT TCGAAAGCTT CGCGCCAATG
ACGCAGTGGC TGCAGGACAA GCGCCCCGAC GTGCTTTTCT ACATCTTCAA CGACCACGTG
ACGTCGTTTT TCTTCGACCA CTACTCTGCC TTCGCGCTGG GGGTCGACGA CCATTACGCG
GTCGCCGACG AAGGCGGCGG ACCGCGTGAC CTGCCCGCCA TCGGCGGACA CGCGGCGCTC
TCGCGGCATA TCGGCGAGAG CCTGATGGCC GACGAGTTCG ACATGGCGTT CTTTCAGGAC
AAGCCGCTCG ATCACGGGCT GTTCTCGCCG ATGTCGGCCC TGCTGCCGTT CGAGGAGGGC
TGGCCGGTCG AGGTCGTGCC CCTGCAGGTG GGCGTCCTGC AATTCCCGAT CCCCAGCGCG
GCGCGTTGCT ACAAACTGGG GCAGGCGCTG CGCCGTGCCA TCGAAAGCTA TCCGGAGGAT
CTCGACGTCG CCATCGTCGC TACCGGCGGC GTCTCGCACC AGGTACACGG CGAGCGCGCC
GGGTTCAACA ATCCAGAGTG GGATGCGCAA TTCCTCGATC TGCTGGTCGA CGATCCACAG
CAGCTCACCC AGATGACCCA GGCGGAGTTC GCTACCCTGG GCGGCCTGGA AGGCTCGGAA
GTGATCACCT ACCTGATCAT GCGCGGCGCG TTGTCCCACA CCGTGATCAA GCGCCACCAG
GATTATTACC TACCGTCGAT GACCGGCATC GCCACGTTGA TTCTCGAGAA TCAGGCGCGG
CCCAACCCCG TCGACCTGAG CGAGCGCTAC CGCCAGCACA GCCGCCATCA GCAGGAAGGC
ATCGAGGCAC TGGAAGGCAC TTACCCATTC ACCCTCGAGC GCAGCCGCAA AGGCTATCGC
ATCAACCGTT TCCTGCATCG CCTGATCGAG CCCGACTGGC GCGAGCGCTT TCTCGCCGAC
CCTGAAGCGT TGTTCGACGA GGCGGCTCTC AGCGAGGAGG AACGCCGCTT GATCCGCGAG
CGCGATTGGC GGGGCATGAT CCATTACGGC GTCATCTTCT TCCTGCTGGA GAAGCTCGGT
GCCGTCATCG GCACGTCCAA TCTCCATATC TACGCCGCCA TGCGCGGCGA GAGCCTGGAG
GAGTTCCAGA AGACCCGCAA TCAACAGGTC ACCTACTCGG TGGCCCGCCG CCAGGGTTCC
GGTGACGCAT CCTCTGCCGA GACTTGA
 
Protein sequence
MARIIGGLAV SHTPTIGFAV DHEKQQEAAW APIFESFAPM TQWLQDKRPD VLFYIFNDHV 
TSFFFDHYSA FALGVDDHYA VADEGGGPRD LPAIGGHAAL SRHIGESLMA DEFDMAFFQD
KPLDHGLFSP MSALLPFEEG WPVEVVPLQV GVLQFPIPSA ARCYKLGQAL RRAIESYPED
LDVAIVATGG VSHQVHGERA GFNNPEWDAQ FLDLLVDDPQ QLTQMTQAEF ATLGGLEGSE
VITYLIMRGA LSHTVIKRHQ DYYLPSMTGI ATLILENQAR PNPVDLSERY RQHSRHQQEG
IEALEGTYPF TLERSRKGYR INRFLHRLIE PDWRERFLAD PEALFDEAAL SEEERRLIRE
RDWRGMIHYG VIFFLLEKLG AVIGTSNLHI YAAMRGESLE EFQKTRNQQV TYSVARRQGS
GDASSAET