Gene Arth_4079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4079 
Symbol 
ID4447721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4601652 
End bp4603541 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content66% 
IMG OID639691910 
Productxylose isomerase domain-containing protein 
Protein accessionYP_833554 
Protein GI116672621 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACCG GAATCGCCAC CGTGTGCCTG TCCGGCACGC TCAAGGAAAA AATGCAGGCA 
TGCGCCATCG CCGGCTTCGA CGGAATCGAA ATCTTCGAGC AGGACCTGGT GACGTCCTCC
CTTAGCCCCG AGGACATCCG GAAGACAGCC GCGGACCTGG GCCTGACACT GGACCTCTAC
CAGCCGTTCC GCGACTTCGA CAGCGTCCCC GAGGACCTCC TCGCCGCCAA CCTCCGCCGG
GCCGGGGCCA AGTTCAAGCT GATGTCCCGC CTGGGCATGG ACACCATTTT GGTCTGCTCC
AACGTCGGCA CGGCGACCAT CGACGACGAC TCCCTGCGCG CCGAACAGCT GGCCCGGCTC
GCCGGACTCG CCGCGGACCA CGGCGTGAAG GTGGCCTACG AGGCACTCGC CTGGGGCAAG
TACGTCAATG ATTACGAGCA CGCCCACCGC CTGGTGGAGA CCGTGGACCA CCCGAACCTG
GGCACCTGCC TGGATTCCTT CCACATCCTT TCCCGGGACT GGGACACGGC GCCCATCGAA
GCGTTCAGTG CGGACAAGAT TTTCTTCGTC CAGGTGGCCG ACGCTCCAAA GCTGTCCATG
GACGTCCTGT CCTGGAGCCG CCATTACCGG GTCTTCCCGG GCGAGGGCCA GTTTGAGCTC
GCCAAATTCA TGGGCCACGT GGTGCGCGCC GGATACACCG GACCGGTCTC GCTGGAGGTC
TTCAATGACG TCTTCCGCCA GTCCGACGTC GAACGCACGG CAGTGGACGC CATGCGCTCG
CTGATCTGGC TGGAGGAGCA AAGTGCCAAA TGGCTGGACG CAAACGAAAA AGCGGCCGGC
CGCCATCGCT ATCCCATGGA ACTGGCCACC CTCCCGCAGG TGGCCGAACC GGCCGGTTTC
AACTTCGCCG AGGTCAAAGC GGCCGATACC GCGGGCCTGG AAAAGGTGCT GGGACAACTA
GGATTTGAAT TCAACGGCAG ACACCGCACC AAGGACGTGC AATTGTGGAG CATGGGCCAC
GCACGCGTGA TCATCAATGA GGCTTCGGCA GGCGCCGGGG ACTCTTCGCC AGCGATTGCC
GCCCTCGGCT TCGATGTCGA TTCTCCCGTG ATCGCCGCGG CCCGCGCCCA GCAGCTCAAG
GCGCCCGCCG TGCCCCGCAA GAGCCAGGCC GACGAAGAAG TGTTCCAGGG ATTCGCTGCG
CCGGACTCCA CCGAGATCTT CCTCTGCCAG GGCAGCCCGG ACGGCACCGC AGCCTGGACC
CGCGAGTTCG GCGAAGGGCT GGAGTTTCCG GGCGCCGGCG GACGCAACGC GGTGATCGAC
CACGTGAACC TCGCCCAGCC GTGGCAGCAC TTTGACGAAG CTGTGCTGTT CTACACCAGC
GCCCTGGCCC TGGAGCCGCA GCCGTTCGCG GAGGTGCCCA GCCCCAGCGG ACTGGTGCGC
TCCCAGGTGA TGCTGACGGC CGACCGTGCC GTGCGCCTGG TGCTGAACCT TGCCCCGGTG
ATCCAGCAGG ACGGCGCGGA TTCGGGCACC GCGCACCGGA AGACCTACCA GGAGCACATC
GCCTTCGCCG TGGACGACCT CGTGGAGGCA GCCCGTGCAG CCCGGGACCG GGGCCTGGAT
TTCCTGCAGA TCCCGGCCAA CTACTACGAG GACCTGGACG CGCGGTTCGA CCTCGACCCC
GCCTTCCTGG CCACGCTCCG GGAGCTCAAC CTCTTGTACG ACCGCGACGC CGACGGTGAG
TTCCTGCACT TCTACACCGC CACCGTGGGC AGCGTCTTCT TCGAAATGGT GGAGCGCCGC
GGCGGCTACG ACGGTTATGG GGCGCCCAAC GCGCCGGTCC GGCATGCCGT CCAGTACGAC
CACCTGCACC GGCTGGGCCG CACCAGCTGA
 
Protein sequence
MRTGIATVCL SGTLKEKMQA CAIAGFDGIE IFEQDLVTSS LSPEDIRKTA ADLGLTLDLY 
QPFRDFDSVP EDLLAANLRR AGAKFKLMSR LGMDTILVCS NVGTATIDDD SLRAEQLARL
AGLAADHGVK VAYEALAWGK YVNDYEHAHR LVETVDHPNL GTCLDSFHIL SRDWDTAPIE
AFSADKIFFV QVADAPKLSM DVLSWSRHYR VFPGEGQFEL AKFMGHVVRA GYTGPVSLEV
FNDVFRQSDV ERTAVDAMRS LIWLEEQSAK WLDANEKAAG RHRYPMELAT LPQVAEPAGF
NFAEVKAADT AGLEKVLGQL GFEFNGRHRT KDVQLWSMGH ARVIINEASA GAGDSSPAIA
ALGFDVDSPV IAAARAQQLK APAVPRKSQA DEEVFQGFAA PDSTEIFLCQ GSPDGTAAWT
REFGEGLEFP GAGGRNAVID HVNLAQPWQH FDEAVLFYTS ALALEPQPFA EVPSPSGLVR
SQVMLTADRA VRLVLNLAPV IQQDGADSGT AHRKTYQEHI AFAVDDLVEA ARAARDRGLD
FLQIPANYYE DLDARFDLDP AFLATLRELN LLYDRDADGE FLHFYTATVG SVFFEMVERR
GGYDGYGAPN APVRHAVQYD HLHRLGRTS