Gene Elen_1663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1663 
Symbol 
ID8415962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1964340 
End bp1965410 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID645024632 
Productdihydroorotate dehydrogenase family protein 
Protein accessionYP_003182020 
Protein GI257791414 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000377649 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00130858 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGATA GCTTCAACGC AACGCAACCT GTTCGGTCGG GCGGCGGGGA CACCCCTCCA 
GTCGCTCGGA CAGTCCGCGC TCGGTCGAAC AGTCCGCAGG ACTGTTCGAT TCGCTGCGGA
ACTCGCTTGG TGGAGGGGTG TCCCCGCCGC CCTGTTCCGA CTTCGGTGGA CATGCGCGTG
AATCTTGGCG GGTTGGAGAT GAAGAACCCG GTGACGGTGG CTTCGGGGAC GTTTGCTGCG
GGGCGCGAGT ACGGGGACTT CGTGGACGTG GCGGGCCTCG GGGCCGTCAC GACGAAGGGC
GTGTCGCTGA ACGGCTGGGC GGGCAACGCC AGTCCCCGCA TCGCCGAGAC GCCTTCGGGC
ATGCTCAACT CCATCGGGCT TCAGAACCCC GGCGTGGCGC ACTTGAAGGA ATGCGATCTG
CCGTGGCTGG CCGAGCGCGG CGCGACGGTC ATCGTGAACG TGTCGGGCCA CAGTTTCGAC
GAGTACGTGC AGGTGATCGA GGCGTTGGAA GACGTGCCGG TGGACGCGTA CGAGGTGAAC
ATCTCGTGCC CGAACGTGGA CGCGGGCGGC ATGACCATCG GCACGTGCAC GGACAGCGTC
GAGGCGGTCG TGTCCCGGTG CCGCGCCGCC ACGAAGCGCC CGCTCATCGT GAAACTCACG
CCGAACGTCA CCGACGTGAC CGAGATCGCG CGCGCCGCAG TGTCGGCCGG CGCCGACGCG
CTGTCGCTCA TCAACACGCT TCTGGGCATG GCCATCGATG CGGAGCGCCG CCGACCGCAG
CTCGCGCGCG GCGTGGGCGG ACTGTCGGGC CCGGCCGTCA AGCCTGTGGC GCTGCGCATG
GTGTGGGAGG TTCACCAGGC CGTCGACGTG CCGCTGCTCG GCATGGGCGG CATCTCGTGC
GCGACCGACG CGGTGGAGTT CATGCTGGCC GGGGCCACGG CGGTGGCCGT CGGCACCGCG
AATTTCGTGA ATCCGCACGC CACGGTTGAA ATCATCGACG GAATGGCGCA GTATTGCGAA
AGGCACGGCA TCGAAGACGT GCAGCAACTG ATAGGAGCTT TGGAATGGTG A
 
Protein sequence
MGDSFNATQP VRSGGGDTPP VARTVRARSN SPQDCSIRCG TRLVEGCPRR PVPTSVDMRV 
NLGGLEMKNP VTVASGTFAA GREYGDFVDV AGLGAVTTKG VSLNGWAGNA SPRIAETPSG
MLNSIGLQNP GVAHLKECDL PWLAERGATV IVNVSGHSFD EYVQVIEALE DVPVDAYEVN
ISCPNVDAGG MTIGTCTDSV EAVVSRCRAA TKRPLIVKLT PNVTDVTEIA RAAVSAGADA
LSLINTLLGM AIDAERRRPQ LARGVGGLSG PAVKPVALRM VWEVHQAVDV PLLGMGGISC
ATDAVEFMLA GATAVAVGTA NFVNPHATVE IIDGMAQYCE RHGIEDVQQL IGALEW