Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4079 |
Symbol | |
ID | 4447721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4601652 |
End bp | 4603541 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691910 |
Product | xylose isomerase domain-containing protein |
Protein accession | YP_833554 |
Protein GI | 116672621 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCACCG GAATCGCCAC CGTGTGCCTG TCCGGCACGC TCAAGGAAAA AATGCAGGCA TGCGCCATCG CCGGCTTCGA CGGAATCGAA ATCTTCGAGC AGGACCTGGT GACGTCCTCC CTTAGCCCCG AGGACATCCG GAAGACAGCC GCGGACCTGG GCCTGACACT GGACCTCTAC CAGCCGTTCC GCGACTTCGA CAGCGTCCCC GAGGACCTCC TCGCCGCCAA CCTCCGCCGG GCCGGGGCCA AGTTCAAGCT GATGTCCCGC CTGGGCATGG ACACCATTTT GGTCTGCTCC AACGTCGGCA CGGCGACCAT CGACGACGAC TCCCTGCGCG CCGAACAGCT GGCCCGGCTC GCCGGACTCG CCGCGGACCA CGGCGTGAAG GTGGCCTACG AGGCACTCGC CTGGGGCAAG TACGTCAATG ATTACGAGCA CGCCCACCGC CTGGTGGAGA CCGTGGACCA CCCGAACCTG GGCACCTGCC TGGATTCCTT CCACATCCTT TCCCGGGACT GGGACACGGC GCCCATCGAA GCGTTCAGTG CGGACAAGAT TTTCTTCGTC CAGGTGGCCG ACGCTCCAAA GCTGTCCATG GACGTCCTGT CCTGGAGCCG CCATTACCGG GTCTTCCCGG GCGAGGGCCA GTTTGAGCTC GCCAAATTCA TGGGCCACGT GGTGCGCGCC GGATACACCG GACCGGTCTC GCTGGAGGTC TTCAATGACG TCTTCCGCCA GTCCGACGTC GAACGCACGG CAGTGGACGC CATGCGCTCG CTGATCTGGC TGGAGGAGCA AAGTGCCAAA TGGCTGGACG CAAACGAAAA AGCGGCCGGC CGCCATCGCT ATCCCATGGA ACTGGCCACC CTCCCGCAGG TGGCCGAACC GGCCGGTTTC AACTTCGCCG AGGTCAAAGC GGCCGATACC GCGGGCCTGG AAAAGGTGCT GGGACAACTA GGATTTGAAT TCAACGGCAG ACACCGCACC AAGGACGTGC AATTGTGGAG CATGGGCCAC GCACGCGTGA TCATCAATGA GGCTTCGGCA GGCGCCGGGG ACTCTTCGCC AGCGATTGCC GCCCTCGGCT TCGATGTCGA TTCTCCCGTG ATCGCCGCGG CCCGCGCCCA GCAGCTCAAG GCGCCCGCCG TGCCCCGCAA GAGCCAGGCC GACGAAGAAG TGTTCCAGGG ATTCGCTGCG CCGGACTCCA CCGAGATCTT CCTCTGCCAG GGCAGCCCGG ACGGCACCGC AGCCTGGACC CGCGAGTTCG GCGAAGGGCT GGAGTTTCCG GGCGCCGGCG GACGCAACGC GGTGATCGAC CACGTGAACC TCGCCCAGCC GTGGCAGCAC TTTGACGAAG CTGTGCTGTT CTACACCAGC GCCCTGGCCC TGGAGCCGCA GCCGTTCGCG GAGGTGCCCA GCCCCAGCGG ACTGGTGCGC TCCCAGGTGA TGCTGACGGC CGACCGTGCC GTGCGCCTGG TGCTGAACCT TGCCCCGGTG ATCCAGCAGG ACGGCGCGGA TTCGGGCACC GCGCACCGGA AGACCTACCA GGAGCACATC GCCTTCGCCG TGGACGACCT CGTGGAGGCA GCCCGTGCAG CCCGGGACCG GGGCCTGGAT TTCCTGCAGA TCCCGGCCAA CTACTACGAG GACCTGGACG CGCGGTTCGA CCTCGACCCC GCCTTCCTGG CCACGCTCCG GGAGCTCAAC CTCTTGTACG ACCGCGACGC CGACGGTGAG TTCCTGCACT TCTACACCGC CACCGTGGGC AGCGTCTTCT TCGAAATGGT GGAGCGCCGC GGCGGCTACG ACGGTTATGG GGCGCCCAAC GCGCCGGTCC GGCATGCCGT CCAGTACGAC CACCTGCACC GGCTGGGCCG CACCAGCTGA
|
Protein sequence | MRTGIATVCL SGTLKEKMQA CAIAGFDGIE IFEQDLVTSS LSPEDIRKTA ADLGLTLDLY QPFRDFDSVP EDLLAANLRR AGAKFKLMSR LGMDTILVCS NVGTATIDDD SLRAEQLARL AGLAADHGVK VAYEALAWGK YVNDYEHAHR LVETVDHPNL GTCLDSFHIL SRDWDTAPIE AFSADKIFFV QVADAPKLSM DVLSWSRHYR VFPGEGQFEL AKFMGHVVRA GYTGPVSLEV FNDVFRQSDV ERTAVDAMRS LIWLEEQSAK WLDANEKAAG RHRYPMELAT LPQVAEPAGF NFAEVKAADT AGLEKVLGQL GFEFNGRHRT KDVQLWSMGH ARVIINEASA GAGDSSPAIA ALGFDVDSPV IAAARAQQLK APAVPRKSQA DEEVFQGFAA PDSTEIFLCQ GSPDGTAAWT REFGEGLEFP GAGGRNAVID HVNLAQPWQH FDEAVLFYTS ALALEPQPFA EVPSPSGLVR SQVMLTADRA VRLVLNLAPV IQQDGADSGT AHRKTYQEHI AFAVDDLVEA ARAARDRGLD FLQIPANYYE DLDARFDLDP AFLATLRELN LLYDRDADGE FLHFYTATVG SVFFEMVERR GGYDGYGAPN APVRHAVQYD HLHRLGRTS
|
| |