Gene Cag_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1806 
Symbol 
ID3746921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2327595 
End bp2328719 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content47% 
IMG OID637774344 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_380100 
Protein GI78189762 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCTCT CTCTTTACCT GCACATTCCT TTCTGCCGAG AGCGCTGCCC TTACTGCGAT 
TTTTATTTAA TAACAGGCAC AGGTCAGGTT GAGCCATTTT TTGCAGCTTT AGCGCGTGAA
ACCGCTTTTC GTGCAGATGA GCTACAAGGA GCCACGATCA GCGCTATCCA TATTGGTGGT
GGAACGCCAT CGTTAGTGCC TGTTGCCATG CTCTCGCGCT GGCTTGAGCA ACTTGCTTGT
TATGCAACCT TTGCGCCTAC TATGGAGCTG GCGCTTGAAG CAAATCCCGA AGATATAACG
CCTGCGTTGT TAGACGAGTT ACAATCGCTT GGCGTTAATC GTTTAAGCAT TGGCGTACAA
TCATTCTCCA TTCAAAAGCT CACCGTGCTG GGGCGAAAGC ATAGTGCCAC TGATGCCTTA
CGGGTAACGG AGATGGCGCT TGAGCGTTTT GCTTCTGTAA GCCTTGACTT AATGTGCGGC
TTGCCACATG AAACGCTTGC CGTATGGGAA GGTGATTTGT CCACAGCGTT AGCACTTCAG
CCGCATCACC TCTCTGTTTA TATGCTTTCT ATTGAGCCAA AAACGCGCTT TCATTGGCTT
GTGGCGCGTG GAGAATTGCC TGCGCCAATG GAGGCAGAGC AAGCACTCTT TTATGAAACC
GCAATCCATA CCATAAAGCT GCAAGGCTAT CAGCATTACG AAGTATCGAA CTTTTGCTTG
CCTAACTTTC ATTCACGCTA CAATCTTGCA AGCTGGGAGC GCAAACCATA TCTCGGTTTT
GGGGCAGCAG CTCATAGCTT TATTGTGCAA CAAAACCGTG AAATTCGTCA AGCTAATATT
GAAAGCTTAA GCCGTTATCT TGCCCATCCC GAAAATGCTG TAGCTTTTCG TGAAGAGCTT
GGCTGCAATG AGCGTTTTAC GGAGGAGCTT TTTTTAACCT TGCGTTTAAA TCGAGGATTA
TCAAGGAGCT TTTTTAGTCG AACAGCTTCG AGGGATGTGG TAGAAACCTT GTTTGCTACT
TTTCAAGAGC AAGGCTGGAT GTATGAGGAC AATGAACGTT TTTATTTAAC GGAGCGAGGT
TTTCTTTTTG CCGATTATAT TGCTGAGGAG TTGCTTGCAA AGTAA
 
Protein sequence
MSLSLYLHIP FCRERCPYCD FYLITGTGQV EPFFAALARE TAFRADELQG ATISAIHIGG 
GTPSLVPVAM LSRWLEQLAC YATFAPTMEL ALEANPEDIT PALLDELQSL GVNRLSIGVQ
SFSIQKLTVL GRKHSATDAL RVTEMALERF ASVSLDLMCG LPHETLAVWE GDLSTALALQ
PHHLSVYMLS IEPKTRFHWL VARGELPAPM EAEQALFYET AIHTIKLQGY QHYEVSNFCL
PNFHSRYNLA SWERKPYLGF GAAAHSFIVQ QNREIRQANI ESLSRYLAHP ENAVAFREEL
GCNERFTEEL FLTLRLNRGL SRSFFSRTAS RDVVETLFAT FQEQGWMYED NERFYLTERG
FLFADYIAEE LLAK