Gene Cag_0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0623 
Symbol 
ID3746938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp897461 
End bp898561 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content50% 
IMG OID637773159 
Productriboflavin biosynthesis protein RibD 
Protein accessionYP_378939 
Protein GI78188601 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00992131 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCAC TTGAGCATAC GTTTTATATG CAGCGTGCTT TGGAGTTAGC GTTGCGTGGT 
GCTGGAAGGG TAAGCCCCAA TCCAATGGTG GGCGCTCTCT TAGTTCAAGA GGGGGAAATT
ATTGGCGAAG GGTGGCATGA GCGTTATGGC GAAGCTCATG CAGAGGTGAA TGCGATTGCT
GCCGTTACCA ATGAGGCATG GTTGCGTGAA GCGACGTTGT ACGTAACCTT AGAGCCATGT
TCGCATTTTG GCAAAACACC TCCTTGCAGC GATTTAATTA TTGCAAAGCA AATTCCGCGC
GTTGTGGTTG GCTGTCGCGA TCCATTTCCT GCTGTAGCAG GACGAGGTAT TGCAAAATTG
CGTGCTGCGG GCATTGAGGT TATTGAAGGC GTTTTAGAAG CAGAATGTTT ACAAAGCAAC
GAAGCGTTTA TCAAAAGCCA CACCGTTGGA TTGCCATTTG TAACGCTGAA GTTAGCGCAA
ACTCTTGATG GCAAGTTAGC CACGGTAACG GGTGCATCGC GTTGGATTAC CGGAGAAGAG
GCTCGTGCTG AGGTGCACCG TTTGCGAAGT GTGTATGATG CGGTGCTGGT GGGTGGCGCT
ACAGCACTTG CCGATAATTC ACAACTTACG GTTCGCCAAG CCAACGGGCG CAATCCATTG
CGCGTTGTGC TTGATCGTTC ACTTCAGTTG CCGCTTGAAA GCCTTATCTT TAACCATGAA
GCGCCAACCT TGCTTTTTAC TTCTCTCTCT CAGCAGCACT CTCCAAAAGT GGAGGCGTTA
CAAAAATTGG GCGTAAGCGT TCATGCTGTT AGCGAAAGTG CCGAGGGGTT GCAACTGCGT
GAAGTGCTGG AAGAGCTGCA TCATCGGCAC ATCCTTTCCG TATTAGTAGA GAGTGGCAGT
CGCCTTGGTG CTGCACTGTT GCAAGCAGGT TTTGTTGATA AACTCTTGAT TTTTATAGCG
CCAAAGCTCT TTGGTGGCGA TGGATTAAGT GCCTTTGCTC CGCTTGGCGT AACGGTGCCC
GACGAAGCAA TTGCACTACG CTTTGAGTTG CCACGCTTTT TTGGAAAAGA TTTGTTGCTT
GAGGCTTACA TTAACTCTTA G
 
Protein sequence
MPALEHTFYM QRALELALRG AGRVSPNPMV GALLVQEGEI IGEGWHERYG EAHAEVNAIA 
AVTNEAWLRE ATLYVTLEPC SHFGKTPPCS DLIIAKQIPR VVVGCRDPFP AVAGRGIAKL
RAAGIEVIEG VLEAECLQSN EAFIKSHTVG LPFVTLKLAQ TLDGKLATVT GASRWITGEE
ARAEVHRLRS VYDAVLVGGA TALADNSQLT VRQANGRNPL RVVLDRSLQL PLESLIFNHE
APTLLFTSLS QQHSPKVEAL QKLGVSVHAV SESAEGLQLR EVLEELHHRH ILSVLVESGS
RLGAALLQAG FVDKLLIFIA PKLFGGDGLS AFAPLGVTVP DEAIALRFEL PRFFGKDLLL
EAYINS