Gene Cagg_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3367 
Symbol 
ID7267107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4080306 
End bp4081733 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content59% 
IMG OID643568176 
Productprotoporphyrinogen oxidase 
Protein accessionYP_002464647 
Protein GI219850214 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.55383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000608505 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGATGG CAAACTACGA CAGTGTCGTG ATCGGGGGTG GTATTGGCGG GTTGGCGGCA 
GCCTATACCC TCTATAAACG AGGGTACCGC GTGTTGGTGA TCGAAGCTGC CAATCGGGTC
GGTGGCGTGA TCCACAGCAT TACCACGCCC GAAGGTTTCA CACTCGACTG CGGGCCAAAT
ACGATTGGGA CGAATGACGT GCGTCTGTGG CAGGAGTTGA TCGATCTCGG TCTGCGCGAT
CGGATCAGAC CGGCAGCACG GTGTGGCAGA CGACGATACA TTTTGATCAA CGGGACGCCG
ATTGAGATTC CCTCGTCGCC GGTGGGACTG ATCACGACCC GTTTGCTCTC GTGGCGTGGT
AAATTGCGTG TGCTGGGCGA ACCATTTGTC AATATCGGTA CACCTACCGG TGAAGAGAGT
GTTGCCGCTT TCTTCAGCCG ACGGATCGGC CATGAAGCAG TTGCTCACTT GCTCGATCCG
TTTGTGGCCG GAGTCTACGC CGGCGATCCC AATCAACTGT CGGCGGCAGC AGTCTTCCCA
TCGCTGTGGG AAGCGGTGCA GCGCGGTGGT AGTATCGTGC GCGGGATGCT GAGGCGTCCG
AAGCAAAAAA CGCTTATCAG CGAACCCAAG ATGCGGAGCC GAACCTTCAG TTTTCAAGGC
GGATTAGCCG ATTGGCCGCG AGCTATTGCC CGCGCCCTTG GCACCGGCAA TGTCTGGACG
GGGCGTAGGG CTGTCGGCCT GCGTGATCTC GGCACGTATT GGGAAGTGAC GGTTGATGGA
ACAGGCCGTC TTGAGACGAT CACGACGCGC AGTGTGATCA TCGCGACACC GGCCTACGTC
GCCGCCGAAC TCGTTGAAGC GCTTGATCCG GCGGCAGCGA GCGCGCTCCG CAGCATCCCA
TACGCACCGG TATCCGTCGT TCACCTCGGT TTTCGCCGCG ATCAGCTCTC GCACGAACTG
AACGGATTTG GGGTTTTAGC CCCTTCAAGC GAACGTCGGC AGTTTTTGGG GATTCTATGG
GCGTCTAGCC TTTTCCCACA CGTTGCCCCG CCTGACCGTG TCTTAACGAT CACGTTGTCG
GGTGGTGCGA TCCGACCAGA AGTGGCCGAA CAGAGCGAAG AGGCGCTGAT CGAATCGGCC
ATCCGTGACA ATCAAGAAGT GTTGGGCATT CGGGGCCAAC CGCTGCTGAC CCACGTCACG
CGCTGGCACC ATGCAATTGC ACAATACACG CTCGGTCATC GTGAGCGGAT TGCAACCCTC
GAACGGCTCG AACAGCGGCG TCCAACATTG CAATTGACCG GTAGCTACCG CGGTGGGATC
GGTATCCCGA AGACGTGGGC TAGCGGTGTT GGAGCGGGCG AACGGATTGC AGCAGCGCTC
GATGCGCAAG GCACGACTGC CGATACGCTG GAACAGGCAC GCGGGTAA
 
Protein sequence
MMMANYDSVV IGGGIGGLAA AYTLYKRGYR VLVIEAANRV GGVIHSITTP EGFTLDCGPN 
TIGTNDVRLW QELIDLGLRD RIRPAARCGR RRYILINGTP IEIPSSPVGL ITTRLLSWRG
KLRVLGEPFV NIGTPTGEES VAAFFSRRIG HEAVAHLLDP FVAGVYAGDP NQLSAAAVFP
SLWEAVQRGG SIVRGMLRRP KQKTLISEPK MRSRTFSFQG GLADWPRAIA RALGTGNVWT
GRRAVGLRDL GTYWEVTVDG TGRLETITTR SVIIATPAYV AAELVEALDP AAASALRSIP
YAPVSVVHLG FRRDQLSHEL NGFGVLAPSS ERRQFLGILW ASSLFPHVAP PDRVLTITLS
GGAIRPEVAE QSEEALIESA IRDNQEVLGI RGQPLLTHVT RWHHAIAQYT LGHRERIATL
ERLEQRRPTL QLTGSYRGGI GIPKTWASGV GAGERIAAAL DAQGTTADTL EQARG