Gene Cagg_3837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3837 
Symbol 
ID7266317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4675611 
End bp4676681 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID643568648 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_002465108 
Protein GI219850675 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000541635 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTACCGTG TGTTGTTGCG CCCGATCTTG TTTCGGTTGG GTGGAGGTGA TGCCGAGACG 
GCTCACGAAC GTACCTTGCA TCTGTTGGCG CTGCTGAGCC GTTCACGTCC GTTGTACCGC
ACGCTGGAGC TGCTGACGAC CATTCGAGAT CAACGACTGT CGCGTACCGT CTTCGGGGTT
CAGTTTCCCA ATCCGGTCGG TTTGGCTGCC GGTATGGATA AAGACGGTGT GGCGTTGCCG
GCATGGGCAG CCCTTGGCTT TGGTTTTGTC GAGGTGGGGA CTGTGACGCA CCTTCCGCAG
CCCGGTAACC CCCGCCCGCG TCTGTTTCGG TTACCGACCC ACGAGGCATT GATCAACCGG
ATGGGGTTCA ATAATGCCGG TGCAGCCGCC CTGGCCCATC GGTTGGCGTC CTTACAGCCG
GCCCCTGTTC CGGTTGGTGT CTCGATTGGT AAGTCGAAGG TGACACCACT CGAACACGCC
ATTGACGATT ATTGCGCTTC GTTTCGCGTG CTGTTTCCCT ATGCCGCATA TGTGGCGATT
AACGTAAGCT CGCCGAATAC GCCCGGTCTC CGCCAATTGC AAGATGCCGA TCATTTGCGC
GCATTGTTGG CAGCTCTGCA ACGTGTCAAC ACCGAGTTGG GGCGTACCCA TTCGCGTGGA
CCGCTTCCGC TATTGGTCAA GATTGCTCCC GATCTCAGTG AACCGGCGTT GGATGAACTC
TTGACCGTTT GTGCCGACCA TGCCGTTGCC GGCATTATTG CGACCAATAC GACAATTAGT
CGTCACGGTT TGGCCGGTGC TGACCCGGCG TTGGTTGTCG AGACCGGTGG CCTCAGTGGT
CGACCACTGA CATTACGTGC CCGGCAGCTA GTGCAGTATG TTGCCCGTGC AACCGGTGGT
CGGTTGCCGA TTATCGGGGT CGGTGGAATT CATTCACCGG ACGATGCACT GCGGATGTTC
GAGGCCGGGG CGGCGTTGAT CCAACTCTAC ACCGGGTTGG TGTATCACGG GCCGCTACTG
CCGCGGCGAA TCAACCATGC TCTGCTGTCG TATCGTAAGG GAGCTGCATG A
 
Protein sequence
MYRVLLRPIL FRLGGGDAET AHERTLHLLA LLSRSRPLYR TLELLTTIRD QRLSRTVFGV 
QFPNPVGLAA GMDKDGVALP AWAALGFGFV EVGTVTHLPQ PGNPRPRLFR LPTHEALINR
MGFNNAGAAA LAHRLASLQP APVPVGVSIG KSKVTPLEHA IDDYCASFRV LFPYAAYVAI
NVSSPNTPGL RQLQDADHLR ALLAALQRVN TELGRTHSRG PLPLLVKIAP DLSEPALDEL
LTVCADHAVA GIIATNTTIS RHGLAGADPA LVVETGGLSG RPLTLRARQL VQYVARATGG
RLPIIGVGGI HSPDDALRMF EAGAALIQLY TGLVYHGPLL PRRINHALLS YRKGAA