Gene Arth_1674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1674 
Symbol 
ID4445809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1869206 
End bp1870366 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID639689495 
Productdiaminohydroxyphosphoribosylaminopyrimidine deaminase / 5-amino-6-(5-phosphoribosylamino)uracil reductase 
Protein accessionYP_831168 
Protein GI116670235 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.164227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCAG CCCTCGACGC TGCGCTCCTG GGGCCGCGCG GGGCAAACCC GCTGGTTGGC 
GCCGTCGTCA TCGGCCCTGA TGGCCGGCAA CTGGTGACCG GCTATCACCG CGGCGCCGGG
ACGGCGCATG CCGAGGCTGA CGCCATTGCC CAGGCGGGCA GGCAGGGCCT GGACCTGACC
GGGTCAACCA TGGTGGTCAC CCTCGAGCCG TGCAACCACT GCGGACGGAC CGGCCCCTGT
GCCCAGGCGA TCATCGACGC CGGCATTGCT TCGGTGGTGT ACGCAGTAGA CGACCCCCAT
GACCCCGCCG CCGGCGGAGC CGCAACGCTG CGGGCCGCCG GAGTCAGCGT ACGTTCCGGC
CTGTCCGCCC GCGCCGCGTT CGAGCTTAAC CGTCGCTGGT TTGAAGCCGT CCTCGGCCAG
CGCCCCTTCG TGACACTCCA CATCGCCCAG ACCCTGGACA GCCGCATCGC GGCAGCGGAC
GGTACCAGCC AATGGATCTC CAGCCCGGAG TCCCTCGCGG ACAACCACGG GCTCCGCAGC
CGGGTGGATG CCATCCTGGT GGGTACGCAA ACGCTTCTCG TGGACAACCC CCGGCTCACG
GCCCGGGACG CGTCCGGCAA GCCGGCCGGG AACCAGCCGT TGCGCGCAGT CATGGGGCTC
CGGGGAATTC CCGACGACGC CGCTATCCAC GGCGACGACG GCCGCGTCCT GCACCTGCCC
ACCCGGGATC CGCACGAGGC ACTGGAGAGG CTCTTCTCCG CCGGTGTCCG GCACGTCATG
GTGGAAGGCG GATCCAGCAT CCTGAGCGCC TTCCTCGCCG CCGGCCTCGT GGACGAACTG
ATCGTCTATC TGGCGCCCAC CCTGCTCGGA TCCGGGACTG CCGCCCTTGG CGACCTCGGC
ATCACGACCC TCGCCGACGC CCAGGCCTGG GACTGGGACC AGGCATCCGG GGGAGCCGTG
CAGAGCCTGG GCCGGGACCT TAGGCTCCAC CTCTTTCCAG GGAGCGTCGA ACCGGCAGCA
AGAACGGCCG CGTCCACGGA CCCGGCCGAA GCAGATCTGC CGGCAGCACC GGCCGCGCCC
GAATCCTTTC TATCCTTGAC CAGCTCACCC GAATCCTTAC CGCACCGCAC CGGCGCGGGC
ACCGCCACAG GAGGCAACTG A
 
Protein sequence
MDAALDAALL GPRGANPLVG AVVIGPDGRQ LVTGYHRGAG TAHAEADAIA QAGRQGLDLT 
GSTMVVTLEP CNHCGRTGPC AQAIIDAGIA SVVYAVDDPH DPAAGGAATL RAAGVSVRSG
LSARAAFELN RRWFEAVLGQ RPFVTLHIAQ TLDSRIAAAD GTSQWISSPE SLADNHGLRS
RVDAILVGTQ TLLVDNPRLT ARDASGKPAG NQPLRAVMGL RGIPDDAAIH GDDGRVLHLP
TRDPHEALER LFSAGVRHVM VEGGSSILSA FLAAGLVDEL IVYLAPTLLG SGTAALGDLG
ITTLADAQAW DWDQASGGAV QSLGRDLRLH LFPGSVEPAA RTAASTDPAE ADLPAAPAAP
ESFLSLTSSP ESLPHRTGAG TATGGN