Gene Cagg_3360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3360 
Symbol 
ID7267100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4073050 
End bp4074516 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID643568169 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_002464640 
Protein GI219850207 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.321157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000187525 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGTATC TCATAAAAAA CGGCACAATT ATCGATCCGG CCAACCGAGT GGCAACTATC 
GGTGATATTT TGGTCGCCGA CGGCAAAGTT GAGCGATTGT ACGATCTGGC CGATCTCCAC
AGCGATCGCG AACCCATCGG GCCGGACGTG GAAGTGATTA ATGCTCGTGG CTGTGTCGTC
GCACCGGGTT TTACCGATCT CCACACCCAC CTGCGACAGC CGGGTGAAGA ACATCGAGAG
ACGATTACCA GCGTGAGCGC GGCAGCAGCC GTTGGCGGCT TTACCACGCT GTGCGCCCGT
CCCACAACGC ACCCCACCCC GGATAACGCG GCGGCAATTC GGCAGTTGCG TGAATTGGTC
GCGCACTTTG GGAGTGTGCG GATCGATGTG ATCGGCGCGT TGACGTTGGG GAACGAAGGG
CGGATCTTGA GTGAGATGCG CGAACTGGCC GAAGCCGGCT GTATTGCGTT TAGTGACGGT
GGACGGACCA TCGCCGACGC GGCGCTGATG CGGCATGCGT TATCGTATGC GGCAGCGCTC
AATTTGCCGG TGATGGTGAC GTGTCAAGAC CCGTCGTTGG CTGCCGGTGG TGTTGCCCAT
GAAGGGGCAG TGAGTGTACG TTTAGGTCTG CCGGGCATCC CTGCAGCCGC CGAAGAAGCC
ATTGTAGCCC GCGATATTGC CCTCGCCGAA GCGACCGGTG CTCATTTGCA CATCAGCCGA
GTAAGTACGG CCGGCAGCGT CGCGCTGATC CGAGCTGCAC GAGCGCGTGG GGTGCGGGTG
ACGGCAGAAG TGACGCCGCA CCACCTGACA CTGACCGACC GCTGGCTGCT GGGCTGGCTG
GAAGAGCGAA ACGAGATCGA AACCGGCCGC GCCGGTGCCC ATCCCGATCT GAGCTTACCA
TCGTGGCTTG AGCCAAGCCT ATTACCGCCA TACGACAGTT CAACGCGGGT TGAACCGCCC
TTACGCAGCA TCGAACATGT TGAAGCGTTG GTGGCCGGCT TGCGTGATGG CGTGATCGAT
GCGATTGCAG TTGATCACGC GCCGCTGGCA CTTGTTGACC GTGAGTGTGA GTACGGGATT
GCCCCACCCG GCATCAGCGG TCTGGAGACG GCACTTGCCC TTACGCTGAC TCTCGTCCAT
CGCGGTGAGA TGGATATTGT CAACCTGATT GCGAAACTCA CCGAGGGGCC GGCGCAGGTA
CTCAACCGGT CGCCGGCGAA CTTGCGGCCC GGGGCAACCG CCGACATCGT GATCTTCGAT
CCTGAGCGGA GCTGGGTGGT AGACCCCGAT CACTTCCGGT CACGTGGGCG TAACACGCCG
CTACGCGGCC AACGGTTGAA GGGACAGGTG ATGTTGACGA TGGCTGCCGG CAAGATTGTG
TTCCGTCGCG ACAATTTTGG CCGGCAAGGA CAAGCAGCAC CACAACCCTC ACGACTCGAA
GGTATTTTGG AGAGTGAAGA GACATAA
 
Protein sequence
MRYLIKNGTI IDPANRVATI GDILVADGKV ERLYDLADLH SDREPIGPDV EVINARGCVV 
APGFTDLHTH LRQPGEEHRE TITSVSAAAA VGGFTTLCAR PTTHPTPDNA AAIRQLRELV
AHFGSVRIDV IGALTLGNEG RILSEMRELA EAGCIAFSDG GRTIADAALM RHALSYAAAL
NLPVMVTCQD PSLAAGGVAH EGAVSVRLGL PGIPAAAEEA IVARDIALAE ATGAHLHISR
VSTAGSVALI RAARARGVRV TAEVTPHHLT LTDRWLLGWL EERNEIETGR AGAHPDLSLP
SWLEPSLLPP YDSSTRVEPP LRSIEHVEAL VAGLRDGVID AIAVDHAPLA LVDRECEYGI
APPGISGLET ALALTLTLVH RGEMDIVNLI AKLTEGPAQV LNRSPANLRP GATADIVIFD
PERSWVVDPD HFRSRGRNTP LRGQRLKGQV MLTMAAGKIV FRRDNFGRQG QAAPQPSRLE
GILESEET