Gene Synpcc7942_2385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2385 
Symbol 
ID3774669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2454974 
End bp2455996 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content59% 
IMG OID637800833 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_401402 
Protein GI81301194 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00885466 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACCTGA GCATGGATTA CCTGGGGCTG AAGTTGCGAT CGCCCTTGGT GGTTGGGGCT 
GCTGCTCCCC TGAGTGAGCG GACGGAACAA CTCCCCGCCT TAGAAGCAGC GGGAGCGAGT
GCGATCGTGC TCCATTCCCT ATTTGAGGAG CAGGTAGAAG CCGAGCATCA GGCCGTTTGG
CAGCAGTTGG AAGTCGGCAG TCATCACTAT GCAGAATCCC TGACCTATGC GCCGGAGCCC
GCTTGGTTCC CGATCGGGCC GGTGCATTAT CTCCGGCAGA TTGAGAAGGC TAAAGCTCAA
GTTCAAATCC CGATCATCGC CAGTTTGAAT GGCACCAGTG ATAACGGCTG GGTTGATTAC
GCCCGCCGAA TTGAAGGTGC TGGCGCGGAT GCACTGGAGC TGAACTTGTA CGCGCTGCCG
GTCGATCCGA ATCAGAGCGG AGCGGAAGTC GAAGCCCAAT ATCTGCGGGT GGTTGAGCAA
GTGCGGGCGG CGACGCAGTT ACCGCTGGCG GTTAAGCTCA GCCCTTTTTT CAGCAGTCCC
GGCCACATGA TCCGGCAGTT TGCCCAAGCG GGGGCTCAGG CGATCGTGCT GTTTAACCGC
TTTTATCAGC CGGATATCGA CATTGAGAGT TTGGACGTCG TGCCGCGCCT GATCTTGAGT
AACCCACAGG ATCAACGGCT GCCGCTGCAT TGGATTGCGC TGCTCTACGG ACAGGTGCCG
GTGGATTTTG CGGCGACGGG TGGCATTCAA CGCGCTGATG ATGTGATCCG CATGGTGATG
GCGGGAGCAG CCACCACGCA AATTGTGGGG GCACTGTTAC GCCATGGCCC CGACGTCTTG
CAGCGGATTG AAGCAGACTT AAAAACGTGG CTGGCGGAGC ATGATTGCCC CGCGTTGTCG
CTGCTGCAAG GTTGCATGAG TCAGCAGTCC TGTCCGGCGC CCGATCGCTT TGAACGGGTG
CAGTATCTGC GATCGCTGCA GAGTGGCACT TGGGCGGTGC CGGAAGTCTT CCCGGGGGGT
TGA
 
Protein sequence
MDLSMDYLGL KLRSPLVVGA AAPLSERTEQ LPALEAAGAS AIVLHSLFEE QVEAEHQAVW 
QQLEVGSHHY AESLTYAPEP AWFPIGPVHY LRQIEKAKAQ VQIPIIASLN GTSDNGWVDY
ARRIEGAGAD ALELNLYALP VDPNQSGAEV EAQYLRVVEQ VRAATQLPLA VKLSPFFSSP
GHMIRQFAQA GAQAIVLFNR FYQPDIDIES LDVVPRLILS NPQDQRLPLH WIALLYGQVP
VDFAATGGIQ RADDVIRMVM AGAATTQIVG ALLRHGPDVL QRIEADLKTW LAEHDCPALS
LLQGCMSQQS CPAPDRFERV QYLRSLQSGT WAVPEVFPGG