Gene Ent638_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1457 
Symbol 
ID5114422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1610450 
End bp1611460 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content47% 
IMG OID640491643 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001176188 
Protein GI146311114 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0360083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTACC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCCGAGCG CGCTCATGAA 
TTGACATTCC AGCAGTTACG TCGCATCACA GGAACACCTT TGGAAGCGCT GGTGCGCCAG
AAAGTGCAGG AAAAACCTGT TCAATGTATG GGGCTGACGT TTAAGAATCC CCTGGGTCTG
GCTGCTGGCC TGGACAAGAA CGGCGAGTGT ATTGATGCGC TGGGCGCGAT GGGATTTGGT
TCCATCGAAG TCGGCACGGT CACTCCACGT CCACAAGCGG GTAACGATAA ACCGCGACTG
TTCCGTCTGG TTGAAGCCGA AGGGTTGATC AATCGAATGG GCTTTAATAA TCACGGCGTC
GATCATCTGA TCGAGAACGT AAAAAAAGCG CATTTTGACG GCGTGCTGGG AATTAATATC
GGCAAAAATA AAGACACGCC GGTAGAGCAG GGTAAAGATG ACTATCTGAT TTGTATGGAA
AAAGTCTATG CTTATGCGGG TTATATTGCG GTGAATATCT CATCGCCAAA TACCCCTGGC
TTGCGTACGC TGCAATATGG TGAAGCGCTG GACGATCTGT TATCAGCCAT TAAAAATAAA
CAAAATGAAC TGCAGGAAAT TCACCATAAA TATGTTCCGG TCGCGGTAAA GATCGCTCCG
GATCTTTCCG TTGAAGAATT GATCCAGGTT GCCGATAGTT TGGTTCGCCA TAATATTGAT
GGTGTTATTG CGACCAATAC GACACTCGAT CGTTCGCTGG TAAATGGAAT GAAACATTGT
GATGAAATGG GTGGGTTAAG CGGCCGTCCG GTACAATTAA AAAGCACCGA AATTATTCGC
GCATTGTCCG CAGAATTAAA AGGGCGTTTA CCGATTATTG GCGTGGGTGG TATCGACTCT
GTCATCGCTG CGCGTGAGAA GATGGCTGCG GGTGCGACGC TTGTACAAAT CTATTCTGGT
TTTATTTTTA AAGGCCCTCA ATTGATTAAA GAAATCGTTA ATCATATCTA A
 
Protein sequence
MYYPFVRKAL FQLDPERAHE LTFQQLRRIT GTPLEALVRQ KVQEKPVQCM GLTFKNPLGL 
AAGLDKNGEC IDALGAMGFG SIEVGTVTPR PQAGNDKPRL FRLVEAEGLI NRMGFNNHGV
DHLIENVKKA HFDGVLGINI GKNKDTPVEQ GKDDYLICME KVYAYAGYIA VNISSPNTPG
LRTLQYGEAL DDLLSAIKNK QNELQEIHHK YVPVAVKIAP DLSVEELIQV ADSLVRHNID
GVIATNTTLD RSLVNGMKHC DEMGGLSGRP VQLKSTEIIR ALSAELKGRL PIIGVGGIDS
VIAAREKMAA GATLVQIYSG FIFKGPQLIK EIVNHI