Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0292 |
Symbol | |
ID | 4027062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 330650 |
End bp | 332497 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637965442 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_572354 |
Protein GI | 92112426 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCCA TTGCAACCGT ATGCTTGAGC GGCGATCTGC GTAGCAAGCT AGAGGCCATT GCCCGGGCAG GTTATAGCGG TGTGGAGATA TTCGAGAATG ATCTCCTGAC CTTCGACGGT TCGCCGAGCG ACGTTCGCAG CCTCTGCGAG TCGTTGGGGT TGGCTATCCT CGCGTTTCAG CCGTTCCGTG ACTTCGAGTC GATGCCCGAG CCTCAGCGGC GGCGCAATTT CGAGCGTGCC GAACGCAAGT TCGACCTGAT GGAGGAACTG GGCACCGACT TCCTGCTGGT CTGCAGCAAC GTCTCGCCGC AGGCGTTCGA CGACCTTGGT CGCGCTGCCG AGGATCTGCG CGAGCTGGCC GAGCGCGCCG CGCGCCGCGG ACTCCGCATT GGCTTCGAGG CGCTCGCATG GGGGCGACAT ATCAGTGATT ACCGGGACGC CTGGGATGTC GTAAAGCGCG CTGACCACCC GGCACTGGGC ATCGTGCTGG ACAGTTTCCA CATTCTTGCG CGTGGCCACG AATTGGAGAC CATGGCCGAT ATTCCCGCCG AGAAGATTGC TTTCGTGCAG ATCGCCGATG CGCCGCTGCT CGACATGGAT GTGCTGCAGT GGAGCCGCCA TTTCCGCTGT TTCCCCGGCC AGGGGCGGTT ACCTCTGGCG TCCTTCATGC AGGCGCTGGC GCGCACCGGC TATGCGGGAC CGCTGTCGTT GGAGATCTTC AATGATGCCT TCCGCGCGGC GCCCACCGAA GCTACTGCGA TCGATGGACT GCGTTCGCTG ATCTGGATCG AGGAACTCGC CGAGGGCGCG GCCTGGTCCG AGGCGTCGCC GCCCGCGGTG GGCTACGATG GCGTCCATTT CATCGAATTT ACGCTCGACG AGGAGAGTGC CGCGCCGCTC GGCGAGTTCT TCTCGGCGTT GGGCTTTCGC CATATCGGCC GCCATCGCTC GAAGAACGTC GAGCTATGGC ATCAGGGCGA CATTCACCTG GTACTCAACT TCGAGACCGA TAGCTTCGCG CATACCTTTC GGCTGCTGCA CGGTACGTCG GTTTGCGCGG TGGGAGTCAG GGTCAACGAT CTCGATTCGG CGGTCACTCG AGCGGCGCAC TACAAGGCGC AATGGTTTCG GGGGCCAGTA GGCGAAGGAG AGATGGAGAT CCCGGCGCTG CGCGGCATCG AGGGCAGCCT GGTCTATCTG GTCGACGACG CCCAGGCACG CGAGATGCAG TGGAAGACCG ATTTCCACCT CTTCGAGGAT GGCCAGGATG ATGATGCGGG ATTGATCAAC ATCGACCACA TCTCCTACGT ATTGCCACCA ACCCAATTGC TGAGCTGGTT GCTGTTCCAC CGCACGGTGT TTGGCTTCGA CGCCGGCCCC GAGCACGAGA TCGCCGATCC GCACGGCATG GTGGTCAGTC AGACCGTAAC CAGCCCTGAC AACTCGGTTC GCATTCCGCT GACCGTGTCT TCGGCGCGCG AGACCTTGCC GGGTCGCTTC CTGTCGGAGC ACCAGGGCGG CGTGCAGCAG ATCGCCTTCG CCAGTGGCGA CATCTTCGCC ACCATTGATG CGATGCTTGC GAGAGGTTTG CCGATGCTGC GCATTCCCGC CAACTACTAC GACGACCTGG CGGCCCGCTT TGACCTCGAT GATGCATTGC TCGAGGCGAT GCGCAGCCGC AATATTCTCT TCGATCGCAA CGACGATGGT GACTTCTTCC ACGCCTATAC CGAAACCTTC ATGGGCCGCT TCTTTTTCGA GGTGGTCGAG CGGCGTGGCA GCTATTCGCA GTTCGGTGCC GTCAATGCGC CGATCCGCCT GGCGGCCCAG GCCGGCCAGC AACGTTGA
|
Protein sequence | MRAIATVCLS GDLRSKLEAI ARAGYSGVEI FENDLLTFDG SPSDVRSLCE SLGLAILAFQ PFRDFESMPE PQRRRNFERA ERKFDLMEEL GTDFLLVCSN VSPQAFDDLG RAAEDLRELA ERAARRGLRI GFEALAWGRH ISDYRDAWDV VKRADHPALG IVLDSFHILA RGHELETMAD IPAEKIAFVQ IADAPLLDMD VLQWSRHFRC FPGQGRLPLA SFMQALARTG YAGPLSLEIF NDAFRAAPTE ATAIDGLRSL IWIEELAEGA AWSEASPPAV GYDGVHFIEF TLDEESAAPL GEFFSALGFR HIGRHRSKNV ELWHQGDIHL VLNFETDSFA HTFRLLHGTS VCAVGVRVND LDSAVTRAAH YKAQWFRGPV GEGEMEIPAL RGIEGSLVYL VDDAQAREMQ WKTDFHLFED GQDDDAGLIN IDHISYVLPP TQLLSWLLFH RTVFGFDAGP EHEIADPHGM VVSQTVTSPD NSVRIPLTVS SARETLPGRF LSEHQGGVQQ IAFASGDIFA TIDAMLARGL PMLRIPANYY DDLAARFDLD DALLEAMRSR NILFDRNDDG DFFHAYTETF MGRFFFEVVE RRGSYSQFGA VNAPIRLAAQ AGQQR
|
| |