Gene Elen_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1989 
Symbol 
ID8416300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2333068 
End bp2334045 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content71% 
IMG OID645024966 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_003182342 
Protein GI257791736 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR03151] putative enoyl-(acyl-carrier-protein) reductase II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.563999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGA AGACGCGGGT AACGGAACTG CTGGGCATCG AGGTGCCCGT CGTGCAGGGC 
GCGATGGCGC GCATCGCGGA TGCGAGCCTG GCCGGCGCGG TGAGCGAGGC CGGCGGCCTC
GGCATCATCG CATGCGGCGG CGCGCCGCTC GACTGGGTCG AGGAGCAGGT GCGCATCGCC
CGCTCGATTA CCGACAAGCC CATCGGCGCG AACGTCATGC TCATGGATCC GAACGCGGGC
GAGACGGCCG AGCTTCTGGC GAAGCTGCGT GTTGACGTCA TCACGACGGG CGCGGGTTCT
CCCGCGAACT ACATGCAGCT GTGGAAGGAC GCCGGCATCA AGGTGGTGCC CGTGGTGGCC
TCCAGCGCGC TGGCCGCGCG CATGGAGCGC CTCGGAGCCG ACGCCGTGGT GGCCGAGGGC
ACCGAGGCCG GCGGCCATAT CGGCGAGCTG ACCACGATGG CGCTCATCCC CGCAGTATGC
GACGCCGTGT CCATCCCCGT GATCGCCGCA GGCGGCATCG CCGACGGGCG CGGCATGGCC
GCCGCCTTCG CGCTGGGCGC CGAGGGCGTG CAGGCGGGCA CCCGCTTCCT CACGGTGGAC
GAGTGCACCA TCGCCGACGC GTACAAAGAG CGCGTGATCG CCGCCAAGGA CGCCGACACC
ATCGTCACAG GCCGCGGCAG CGGGCATCCC GTGCGCTGCC TCAAGAACAA GTTCGCCCGT
ACCGTGCGCA AGCTCGAAGG CGACGTCGCC GCCAACGGCG ACGAGCTGGA GGCTATGTAC
GTGGGTTCCC TGCGCCGCGC CGTGGAGGGC GACGTGGACA ACGGCACCAT GATGGCGGGC
CAGTCGGCCG CGCTCGTGCA CGAGCGCGCC ACGGCGGCCG AGGCTATCGC TCGGATGATC
GAAGAAGCCG AGGCTCTGGG CGGTCTCGAC TTGGAAGCAC TGGCTGCGCT GAGCGCCCGG
CGCGGGCGTG CGATCTAG
 
Protein sequence
MSMKTRVTEL LGIEVPVVQG AMARIADASL AGAVSEAGGL GIIACGGAPL DWVEEQVRIA 
RSITDKPIGA NVMLMDPNAG ETAELLAKLR VDVITTGAGS PANYMQLWKD AGIKVVPVVA
SSALAARMER LGADAVVAEG TEAGGHIGEL TTMALIPAVC DAVSIPVIAA GGIADGRGMA
AAFALGAEGV QAGTRFLTVD ECTIADAYKE RVIAAKDADT IVTGRGSGHP VRCLKNKFAR
TVRKLEGDVA ANGDELEAMY VGSLRRAVEG DVDNGTMMAG QSAALVHERA TAAEAIARMI
EEAEALGGLD LEALAALSAR RGRAI