Gene Acid345_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3337 
Symbol 
ID4071255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3958724 
End bp3959923 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content63% 
IMG OID637985359 
Product5-amino-6-(5-phosphoribosylamino)uracil reductase / diaminohydroxyphosphoribosylaminopyrimidine deaminase 
Protein accessionYP_592412 
Protein GI94970364 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.557389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGATT ACGCGCCGCG CATGATCTCT ATCAACGCCA GCCCCGATGA GCGCTTCACG 
CTGCGTGCGC TGGAACTCGC GAGCAAGGGC ATTGCCATGG CTTCGCCGAA TCCGCGCGTC
GGAGCCGTGG TTGTCAACGC CGATGGCGAA GTTATCGGCG AAGGCTTTCA CATGTACGAC
GGCCTGAAGC ACGCGGAAGT TCTCGCGCTG GAACAAGCGG GAAACGCTGC GCGCGGCGCC
ACGCTCTACT TGAACTTGGA ACCGTGTTCG CACGAGGGCC GCACCGGGCC ATGCGCGGAT
GCGGTGATCG CCGCGGGAAT CAAGCGCGTG GTGTGCTCGA TGCCGGACCC GAATCCGCTG
GTCGCTGGGC GCGGTTTCGC GAAGTTGCGC GAGGCCGGAG TCGAACTGCT CGTGGGCGTT
TTCGCTGACT ACGCGCGCAA GTTGAACGAA GGCTTCTCCA AACGCATCCG CACTGGGCTG
CCGCTGGTGA CGCTGAAGGC CGCGATGACG CTTGACGGCA AAATCGCGCC GCCTCCTGCC
GAAGTGACCG ATATCGGCTT CTCGCAGCCG GGATGGATCA CCAGCGAGAT CGCGCGCGCA
CACGTGCAGG AGCTGCGCCA TGCAGCGGAC TCCATCATGA TCGGCGTAGG CACGGTGGTC
AGCGATAATC CGCTGCTCAC CGATCGCACG GGGCTGCCAC GGCGGCGTCC GCTGCTGCGC
GTGGTGCTTG ATTCGCAATT GCGGCTGCCA CTGGATTCGC GGCTGGTGAA GACCGCGCAT
CATGATGTGC TCGTCTTCTG CTCATTTGCG GAAGAGAAGA AGCGCGCGGC GCTGGAAGAA
CGCGGCATTC AAGTGGAACA AGTGCCGCTC GCGAAGAACG AACCCGGTGG AGTTGCCGTT
GCCGGCGGGC GTCCGGATTT GAATGCCGTC GTGGAACGAC TAGGCGCGCG CGAGTTGAAT
AGCCTCATCG TTGAAGGCGG CGCGGCGGTG AACTGGGCTG CGTTGGCTGC GGGCATCGTG
GACAAAGTTT TCTTGTATTA CGCGCCGAAG ATTCTCGGTG GCGGCAATGC CGTCCCGTTC
GCACTGGGAC CGGGATACGC CCGCATGGAT GAGGCCGCGT ATTTGCGCAA TCTCGAACTG
CACCGCTTCG GCGAGGACTT CGCAGTCGAA GGGTATTTGC GCGATCCTTA CAGCGACTGA
 
Protein sequence
MPDYAPRMIS INASPDERFT LRALELASKG IAMASPNPRV GAVVVNADGE VIGEGFHMYD 
GLKHAEVLAL EQAGNAARGA TLYLNLEPCS HEGRTGPCAD AVIAAGIKRV VCSMPDPNPL
VAGRGFAKLR EAGVELLVGV FADYARKLNE GFSKRIRTGL PLVTLKAAMT LDGKIAPPPA
EVTDIGFSQP GWITSEIARA HVQELRHAAD SIMIGVGTVV SDNPLLTDRT GLPRRRPLLR
VVLDSQLRLP LDSRLVKTAH HDVLVFCSFA EEKKRAALEE RGIQVEQVPL AKNEPGGVAV
AGGRPDLNAV VERLGARELN SLIVEGGAAV NWAALAAGIV DKVFLYYAPK ILGGGNAVPF
ALGPGYARMD EAAYLRNLEL HRFGEDFAVE GYLRDPYSD