Gene Sare_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1867 
Symbol 
ID5704825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2154382 
End bp2155443 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content75% 
IMG OID641271368 
Productriboflavin biosynthesis protein RibD 
Protein accessionYP_001536743 
Protein GI159037490 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGTG TCTCCGTCGA CGAGGCAATG CGCCGTGCGG TCGAGCTCGC CGCGCGCGGC 
CTCGGCACCA CCAGCCCCAA CCCGGTCGTC GGCTGCGTGC TGCTCGACCC GGGCGGCCGG
GTCGTCGGTG AGGGCTTCCA CGCGTACGCG GGCGGGCCGC ACGCCGAGAT CGTCGCGCTC
GCCCAAGCGG GAGAACGGGC GCGCGGCGGC ACCGCGGTGG TAACTCTGGA ACCCTGCGAC
CACACCGGCC GCACCGGCCC CTGCAGCACC GCTCTGGTAC AGGCCGGGAT CGCCCGGGTG
GTCATCGCCG TACCCGACCC GAACCGGGCC GCCTCGGGCG GCGCTGCCAC GCTGCGTGCC
GCCGGGATCC GGGTCGACCT GGGGGTACGC GCCGCAGAGG CGGAGGCGGG CAACGTCGCG
TGGCTCACCT CGACCCGCCG GGGCTGGCCG TACGTCATCT GGAAGTACGC GGCGACACTG
GACGGGCGGT CAGCGGCTGC CGACGGTACC AGCATGTGGA TCACCTCCGA GGTGGCCCGG
ATGGACGTGC ACGCGTTGCG CGGCACCGTG GACGCCGTGC TCGCCGGGGT GGGCACCGTG
CTCGCCGACG ACCCGCGCCT GACCGTCCGG AACCTGCGCG ACGGAAGCCT CGCCATCCGG
CAGCCGCTAC GGGTGGTGGT GGACTCGACG GGCCGGACCC CGCCGGGGGC GCGGGTCCGG
GACGACGCCG CGCCCACCTG GATCGCCACC GCCGAGGAGG TCGGTGTCGA CCCGGAGGGC
CGGGTCGAGC TGGCGGGGCT GCTCGCCGCG CTGCACCAGC GCGGGGTACG CGCCGCGCTG
GTGGAGGGAG GGCCGCGGTT GGCCGGCGCG TTCCTCGCCG CCGGGCTCGT CGACAAGATC
GTCGGCTACG TCGCGCCCCG GCTGCTCGGC GCCGGCCCGG CCGCACTCAC GGAGGCGGGC
GTGACCACGA TCACCGAGGC CATCGACTGC GAGGTCGTTG ACGTTACCCA GGTCGGTCCC
GACCTGCGGA TCACCGCACT GCCCCGGAAG AGGGAGGGCT GA
 
Protein sequence
MAGVSVDEAM RRAVELAARG LGTTSPNPVV GCVLLDPGGR VVGEGFHAYA GGPHAEIVAL 
AQAGERARGG TAVVTLEPCD HTGRTGPCST ALVQAGIARV VIAVPDPNRA ASGGAATLRA
AGIRVDLGVR AAEAEAGNVA WLTSTRRGWP YVIWKYAATL DGRSAAADGT SMWITSEVAR
MDVHALRGTV DAVLAGVGTV LADDPRLTVR NLRDGSLAIR QPLRVVVDST GRTPPGARVR
DDAAPTWIAT AEEVGVDPEG RVELAGLLAA LHQRGVRAAL VEGGPRLAGA FLAAGLVDKI
VGYVAPRLLG AGPAALTEAG VTTITEAIDC EVVDVTQVGP DLRITALPRK REG