Gene Clim_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1863 
Symbol 
ID6355204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2047125 
End bp2048084 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content57% 
IMG OID642669466 
Productdihydroorotate dehydrogenase family protein 
Protein accessionYP_001943880 
Protein GI189347351 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCCG ACGCTTCGCC TCAAATAGTC AAGCCCGCTG CCGCCGTATC GCTCGGCCGC 
GGGCTTGACC TCAGGTCTCC GGTCATGCTT GCTTCAGGCA CCGTATCTTA CGGCGAAGAG
CTCAGCAGGC TCTGCGACCT TCGAAAAATC GGGGGAATCG TCACCAAAGC GATCTCTCTC
GAACCCAGAA CCGGAAATCC TCCCCAGCGC ATTGCCGAAA CCCCGTCCGG CATGATCAAC
GCCATCGGGC TTGCCAATGT CGGGGTTGAA CGATTTATCG CCGAAAAAGT CCCCTTTCTG
CGGGGACTCG GCACGGCGGT CATCGTCAAC ATCGCCGGCC GCTCCATCGA CGACTACTGC
GAAGTGGTCT CCAGGCTCGA CACCGTCGAA GGCCTCCACG CATACGAAAT CAATCTCTCC
TGTCCCAACG TCAAAGGCGA ATGCATGATC ATGGGCGTCA GCCGCGACGC AACCTTTGAA
ATCGTCTCCG AACTCCGCAA GCTGACCCGG CGCCACCTCA TGATCAAACT GACGCCCAAC
GTCACATCCA TCAGCAGCAT AGCCCTTGCC GCCCAGGAAG CCGGAGCCGA CTCCGTATCG
CTCATCAACA CCCTCGTCGG CATGGCCGTC AACTACAAAA CCCGAAAACC GCTCATTAAA
AACCGTCACC GGAGGCCTCT CAGGACCGGC AATAAAACCC GTAGCACTTG CAAAAGTCTG
GGAAGTCTAC AACGCCGTAA ATATTCCGGT AGTAGGCATG GGAGGCATAG GCAGCTTCGA
AGACGCCATG GAATTCCTGC TCGTCGGTGC AAGCGCAATA CAGATAGGCA CCATGAACTT
CGTCTACCCC GACATCAGCC AGCGAATCGC CCAAGCCATC GAAACCCACT TCTCCGCACC
AAACGCCCCG GCATACCAGG ATTATGTGGG AAGCCTGATT GTTTAAATGC CGTTGGCTGA
 
Protein sequence
MNSDASPQIV KPAAAVSLGR GLDLRSPVML ASGTVSYGEE LSRLCDLRKI GGIVTKAISL 
EPRTGNPPQR IAETPSGMIN AIGLANVGVE RFIAEKVPFL RGLGTAVIVN IAGRSIDDYC
EVVSRLDTVE GLHAYEINLS CPNVKGECMI MGVSRDATFE IVSELRKLTR RHLMIKLTPN
VTSISSIALA AQEAGADSVS LINTLVGMAV NYKTRKPLIK NRHRRPLRTG NKTRSTCKSL
GSLQRRKYSG SRHGRHRQLR RRHGIPARRC KRNTDRHHEL RLPRHQPANR PSHRNPLLRT
KRPGIPGLCG KPDCLNAVG