Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1663 |
Symbol | |
ID | 8415962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1964340 |
End bp | 1965410 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024632 |
Product | dihydroorotate dehydrogenase family protein |
Protein accession | YP_003182020 |
Protein GI | 257791414 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000377649 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00130858 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCGATA GCTTCAACGC AACGCAACCT GTTCGGTCGG GCGGCGGGGA CACCCCTCCA GTCGCTCGGA CAGTCCGCGC TCGGTCGAAC AGTCCGCAGG ACTGTTCGAT TCGCTGCGGA ACTCGCTTGG TGGAGGGGTG TCCCCGCCGC CCTGTTCCGA CTTCGGTGGA CATGCGCGTG AATCTTGGCG GGTTGGAGAT GAAGAACCCG GTGACGGTGG CTTCGGGGAC GTTTGCTGCG GGGCGCGAGT ACGGGGACTT CGTGGACGTG GCGGGCCTCG GGGCCGTCAC GACGAAGGGC GTGTCGCTGA ACGGCTGGGC GGGCAACGCC AGTCCCCGCA TCGCCGAGAC GCCTTCGGGC ATGCTCAACT CCATCGGGCT TCAGAACCCC GGCGTGGCGC ACTTGAAGGA ATGCGATCTG CCGTGGCTGG CCGAGCGCGG CGCGACGGTC ATCGTGAACG TGTCGGGCCA CAGTTTCGAC GAGTACGTGC AGGTGATCGA GGCGTTGGAA GACGTGCCGG TGGACGCGTA CGAGGTGAAC ATCTCGTGCC CGAACGTGGA CGCGGGCGGC ATGACCATCG GCACGTGCAC GGACAGCGTC GAGGCGGTCG TGTCCCGGTG CCGCGCCGCC ACGAAGCGCC CGCTCATCGT GAAACTCACG CCGAACGTCA CCGACGTGAC CGAGATCGCG CGCGCCGCAG TGTCGGCCGG CGCCGACGCG CTGTCGCTCA TCAACACGCT TCTGGGCATG GCCATCGATG CGGAGCGCCG CCGACCGCAG CTCGCGCGCG GCGTGGGCGG ACTGTCGGGC CCGGCCGTCA AGCCTGTGGC GCTGCGCATG GTGTGGGAGG TTCACCAGGC CGTCGACGTG CCGCTGCTCG GCATGGGCGG CATCTCGTGC GCGACCGACG CGGTGGAGTT CATGCTGGCC GGGGCCACGG CGGTGGCCGT CGGCACCGCG AATTTCGTGA ATCCGCACGC CACGGTTGAA ATCATCGACG GAATGGCGCA GTATTGCGAA AGGCACGGCA TCGAAGACGT GCAGCAACTG ATAGGAGCTT TGGAATGGTG A
|
Protein sequence | MGDSFNATQP VRSGGGDTPP VARTVRARSN SPQDCSIRCG TRLVEGCPRR PVPTSVDMRV NLGGLEMKNP VTVASGTFAA GREYGDFVDV AGLGAVTTKG VSLNGWAGNA SPRIAETPSG MLNSIGLQNP GVAHLKECDL PWLAERGATV IVNVSGHSFD EYVQVIEALE DVPVDAYEVN ISCPNVDAGG MTIGTCTDSV EAVVSRCRAA TKRPLIVKLT PNVTDVTEIA RAAVSAGADA LSLINTLLGM AIDAERRRPQ LARGVGGLSG PAVKPVALRM VWEVHQAVDV PLLGMGGISC ATDAVEFMLA GATAVAVGTA NFVNPHATVE IIDGMAQYCE RHGIEDVQQL IGALEW
|
| |