Gene Cagg_2635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2635 
Symbol 
ID7267226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3229496 
End bp3230563 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID643567461 
Productdihydroorotate dehydrogenase 
Protein accessionYP_002463940 
Protein GI219849507 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC TTAGTACCAC CTATCTGGGC ATGAAGCTGC GCACACCAAT CGTGGCCGCA 
GCCTCACCCA TCAGCCGCAA TGTCGAACTT GTCCGCCAAC TGGAAGAGGC CGGACTAGGG
GCAGTGGTGA TGTACTCGCT CTTCGAAGAG CAGATCATCC AGCAGAGCCT CGAGCTGGAC
CGTATGCTGA GCCATGGAGC CGAGAGCTTC GCCGAAGCCC TCAGCTATCT GCCCGAACAC
GGAGCATATA GCACCGGCCC CGAACGCTAC CTCGAGCAGG TAGCCGCCCT AAAACAGGGC
CTGAGCATTC CGGTCATCGG TAGCCTCAAC GGCGTCTCGA AGGGAGGTTG GGTGCATTAC
GCACGCTTGA TCCAGGAAGC CGGTGCTGAT GCGCTCGAAC TCAACATCTA CTTTGTGCCA
ATCGATACCA ACATCACCAG CTCCGAACTT GAAGACATTT ATGTCGATTT GGTTAAAGCA
GTGCGTGCTG AGATCAGTAT CCCACTGGCG GTGAAGATCG GCCCCTACTT CACCGCCCTC
CCCAACTTCG CATGGCGACT GATGGAAGCG GGAGCAAATG CGTTGGTATT GTTCAACCGC
TTCTACCAAC CTGATTTCGA TCTCGAACAG CTCTCGGTGC GCCCCAATTT GCAACTGAGC
ACTTCAGCAG AATTGCGGCT ATCACTGCGC TGGATCGCTT TACTCTACGG ACGCATCCCG
TTAGAGTTTG CCCTGAGCAG CGGTGTTCAC AATGCCATCG ACGTACTCAA AGGCCTGATG
GCCGGGGCCA ACGTAACGAT GATCGCATCG GCGTTCCTGC GAGGACGCGC TACCGATGTC
CTACGCACGA TTTTGCACGA CATGGAGTTG TGGCTCACCG AACACGAATA TGAATCGATA
GCACAACTGC ACGGCAGCAT GAGCCAGCGC GCCGTCGCCG AACCGGCCGC CTTTGAGCGC
GCAAACTACA TTCGCGTCCT CGATGATTAT CGTCCGCCTT ACGCCCTTGG GAGCCATACC
GATCTGACCG GACGGATGTT GTATCCGTTC CTCGGTGATG AAGTATAA
 
Protein sequence
MIDLSTTYLG MKLRTPIVAA ASPISRNVEL VRQLEEAGLG AVVMYSLFEE QIIQQSLELD 
RMLSHGAESF AEALSYLPEH GAYSTGPERY LEQVAALKQG LSIPVIGSLN GVSKGGWVHY
ARLIQEAGAD ALELNIYFVP IDTNITSSEL EDIYVDLVKA VRAEISIPLA VKIGPYFTAL
PNFAWRLMEA GANALVLFNR FYQPDFDLEQ LSVRPNLQLS TSAELRLSLR WIALLYGRIP
LEFALSSGVH NAIDVLKGLM AGANVTMIAS AFLRGRATDV LRTILHDMEL WLTEHEYESI
AQLHGSMSQR AVAEPAAFER ANYIRVLDDY RPPYALGSHT DLTGRMLYPF LGDEV