Gene Csal_0292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0292 
Symbol 
ID4027062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp330650 
End bp332497 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content62% 
IMG OID637965442 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_572354 
Protein GI92112426 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCCA TTGCAACCGT ATGCTTGAGC GGCGATCTGC GTAGCAAGCT AGAGGCCATT 
GCCCGGGCAG GTTATAGCGG TGTGGAGATA TTCGAGAATG ATCTCCTGAC CTTCGACGGT
TCGCCGAGCG ACGTTCGCAG CCTCTGCGAG TCGTTGGGGT TGGCTATCCT CGCGTTTCAG
CCGTTCCGTG ACTTCGAGTC GATGCCCGAG CCTCAGCGGC GGCGCAATTT CGAGCGTGCC
GAACGCAAGT TCGACCTGAT GGAGGAACTG GGCACCGACT TCCTGCTGGT CTGCAGCAAC
GTCTCGCCGC AGGCGTTCGA CGACCTTGGT CGCGCTGCCG AGGATCTGCG CGAGCTGGCC
GAGCGCGCCG CGCGCCGCGG ACTCCGCATT GGCTTCGAGG CGCTCGCATG GGGGCGACAT
ATCAGTGATT ACCGGGACGC CTGGGATGTC GTAAAGCGCG CTGACCACCC GGCACTGGGC
ATCGTGCTGG ACAGTTTCCA CATTCTTGCG CGTGGCCACG AATTGGAGAC CATGGCCGAT
ATTCCCGCCG AGAAGATTGC TTTCGTGCAG ATCGCCGATG CGCCGCTGCT CGACATGGAT
GTGCTGCAGT GGAGCCGCCA TTTCCGCTGT TTCCCCGGCC AGGGGCGGTT ACCTCTGGCG
TCCTTCATGC AGGCGCTGGC GCGCACCGGC TATGCGGGAC CGCTGTCGTT GGAGATCTTC
AATGATGCCT TCCGCGCGGC GCCCACCGAA GCTACTGCGA TCGATGGACT GCGTTCGCTG
ATCTGGATCG AGGAACTCGC CGAGGGCGCG GCCTGGTCCG AGGCGTCGCC GCCCGCGGTG
GGCTACGATG GCGTCCATTT CATCGAATTT ACGCTCGACG AGGAGAGTGC CGCGCCGCTC
GGCGAGTTCT TCTCGGCGTT GGGCTTTCGC CATATCGGCC GCCATCGCTC GAAGAACGTC
GAGCTATGGC ATCAGGGCGA CATTCACCTG GTACTCAACT TCGAGACCGA TAGCTTCGCG
CATACCTTTC GGCTGCTGCA CGGTACGTCG GTTTGCGCGG TGGGAGTCAG GGTCAACGAT
CTCGATTCGG CGGTCACTCG AGCGGCGCAC TACAAGGCGC AATGGTTTCG GGGGCCAGTA
GGCGAAGGAG AGATGGAGAT CCCGGCGCTG CGCGGCATCG AGGGCAGCCT GGTCTATCTG
GTCGACGACG CCCAGGCACG CGAGATGCAG TGGAAGACCG ATTTCCACCT CTTCGAGGAT
GGCCAGGATG ATGATGCGGG ATTGATCAAC ATCGACCACA TCTCCTACGT ATTGCCACCA
ACCCAATTGC TGAGCTGGTT GCTGTTCCAC CGCACGGTGT TTGGCTTCGA CGCCGGCCCC
GAGCACGAGA TCGCCGATCC GCACGGCATG GTGGTCAGTC AGACCGTAAC CAGCCCTGAC
AACTCGGTTC GCATTCCGCT GACCGTGTCT TCGGCGCGCG AGACCTTGCC GGGTCGCTTC
CTGTCGGAGC ACCAGGGCGG CGTGCAGCAG ATCGCCTTCG CCAGTGGCGA CATCTTCGCC
ACCATTGATG CGATGCTTGC GAGAGGTTTG CCGATGCTGC GCATTCCCGC CAACTACTAC
GACGACCTGG CGGCCCGCTT TGACCTCGAT GATGCATTGC TCGAGGCGAT GCGCAGCCGC
AATATTCTCT TCGATCGCAA CGACGATGGT GACTTCTTCC ACGCCTATAC CGAAACCTTC
ATGGGCCGCT TCTTTTTCGA GGTGGTCGAG CGGCGTGGCA GCTATTCGCA GTTCGGTGCC
GTCAATGCGC CGATCCGCCT GGCGGCCCAG GCCGGCCAGC AACGTTGA
 
Protein sequence
MRAIATVCLS GDLRSKLEAI ARAGYSGVEI FENDLLTFDG SPSDVRSLCE SLGLAILAFQ 
PFRDFESMPE PQRRRNFERA ERKFDLMEEL GTDFLLVCSN VSPQAFDDLG RAAEDLRELA
ERAARRGLRI GFEALAWGRH ISDYRDAWDV VKRADHPALG IVLDSFHILA RGHELETMAD
IPAEKIAFVQ IADAPLLDMD VLQWSRHFRC FPGQGRLPLA SFMQALARTG YAGPLSLEIF
NDAFRAAPTE ATAIDGLRSL IWIEELAEGA AWSEASPPAV GYDGVHFIEF TLDEESAAPL
GEFFSALGFR HIGRHRSKNV ELWHQGDIHL VLNFETDSFA HTFRLLHGTS VCAVGVRVND
LDSAVTRAAH YKAQWFRGPV GEGEMEIPAL RGIEGSLVYL VDDAQAREMQ WKTDFHLFED
GQDDDAGLIN IDHISYVLPP TQLLSWLLFH RTVFGFDAGP EHEIADPHGM VVSQTVTSPD
NSVRIPLTVS SARETLPGRF LSEHQGGVQQ IAFASGDIFA TIDAMLARGL PMLRIPANYY
DDLAARFDLD DALLEAMRSR NILFDRNDDG DFFHAYTETF MGRFFFEVVE RRGSYSQFGA
VNAPIRLAAQ AGQQR