Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0597 |
Symbol | |
ID | 4270376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 650401 |
End bp | 651432 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638125344 |
Product | dihydroorotase |
Protein accession | YP_741441 |
Protein GI | 114319758 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0418] Dihydroorotase |
TIGRFAM ID | [TIGR00856] dihydroorotase, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000000000000356431 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCAACGCA TCACCCTGAC CCGCCCGGAT GACTGGCACC TGCACCTGCG CGACGGCGAG CAGCTGGCCA CCGTGCTGCC CCACACCGCG ACGGTGTTCG GCCGGGCGAT TATCATGCCC AACCTGAAAC CGCCGGTGAC CACGGTGGAC CAGGCCGCCG CCTACCGCGA GCGCATCCTG GCGGCACTGC CCGCCGGCGC GCACTTCGAG CCGCTGATGA CCCTGTACCT GACCGACAAC ACGCCGCCCG CGGAGATCGA GAAGGCGGCG GCCAGCGGCT TCGTGCACGC GGTCAAGCTC TACCCGGCCG GCGCCACCAC CAACTCCGAC GCCGGGGTCA CCGACCTGGC CCGCTGCGAG GAGACCCTGG CCGCCATGGC CGAGCGGGGC CTGCCGCTGT GCGTGCACGG CGAGGTCACC CGCGACGAGG TGGACATCTT CGATCGCGAG GCGCACTTCA TCGACGAGGT GCTCGACCCG CTGGTGCAGC GCCACAGCCG GCTGCGGGTG GTCTTCGAAC ACATCACCAC CCAGGCGGCG GTGGACTACC TGCGCCAGGC CCCCGACCGG GTCGGCGCCA CCCTCACCGT CCAGCACCTG ATGGCCAACC GCAACCACAT GCTGGTGGGC GGCGTGCGCC CGCACTACTA CTGCCTGCCC ATCCTCAAGC GCGAGCGCGA CCGCCAGGCG CTGGTGGAAG CGGCCACCTC CGGCCACCCG CGCTTCTTCC TCGGCACCGA CAGCGCCCCC CACCCCAAGG GCGCCAAGGA GTCGGCCTGC GGCTGCGCCG GCGTCTACAG CGCCCACGCC GCCCTGCCCT TCTACGCCGA GATCTTCGAG GCCGCCGGCG CCCTGGACCG GCTGGAGGGC TTTGCCAGCC ACCACGGGGC CGACTTCTAC GGCCTGCCGC GCAACCGCGA CAGCGTCACC CTGGAGCGGG CCGCCACCCC CATTCCGGAG GCCTTCCCCA TGGGCGACGA CACCCTGGTG CCCTTCCGGG CCGGCGGCGA GGTGGCCTGG CGGGTCGTCT GA
|
Protein sequence | MQRITLTRPD DWHLHLRDGE QLATVLPHTA TVFGRAIIMP NLKPPVTTVD QAAAYRERIL AALPAGAHFE PLMTLYLTDN TPPAEIEKAA ASGFVHAVKL YPAGATTNSD AGVTDLARCE ETLAAMAERG LPLCVHGEVT RDEVDIFDRE AHFIDEVLDP LVQRHSRLRV VFEHITTQAA VDYLRQAPDR VGATLTVQHL MANRNHMLVG GVRPHYYCLP ILKRERDRQA LVEAATSGHP RFFLGTDSAP HPKGAKESAC GCAGVYSAHA ALPFYAEIFE AAGALDRLEG FASHHGADFY GLPRNRDSVT LERAATPIPE AFPMGDDTLV PFRAGGEVAW RVV
|
| |