Gene Ndas_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1868 
Symbol 
ID9245718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2280829 
End bp2282037 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_003679802 
Protein GI297560828 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.501601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACT ACCGCCAGGT CGGCGAGGTG CCCCGCACCC GCCACACCCA GCACCGCACC 
CCCGAGGGCG GGCTGTACTA CGAGGAGCTG ATGGGGGAGG AGGGCTTCTC CGCCGACTCC
TCGCTGCTCT ACCACCGCGC CATCCCCTCG GCCATCGTCG ACGCCGCCGA GTGGGAGGTC
CCCGACCAGT CCCGCACCCG CAACGCCCCG ATGGTGCCGC GCCACCTCAA ACTGCACGAG
CTGTTCTCCG ACCAGGAGTG GAAGGCCGCC GACGTCGTCG AGTCGCGCCG CCTGGTCCTG
GGCAACGACG ACGTGCGCAT CTCCTACGTG GTCGCCGGGG CGCCCTCGGA GCTGTACCGC
AACGGCATCG GCGACGAGTG CGTGTACGTG GAGTCGGGCA CCGCCCGCGT GGAGACCGTC
TTCGGTCTGC TGGAGGTCGG CCAGGGCGAC TACGTCGTCC TGCCGCGCGC CACCACCCAC
CGCTGGGTGC CCACCGGCGA CCAGCCCCTG CGCGCCTACG TGGTGGAGGC CAACAGCCAC
ATCGCCCCGC CCAAGCGCTA CCTGTCCCGC TACGGGCAGT TCCTGGAGCA CGCACCCTAC
TGCGAACGCG ACCTGCGCGG CCCCTCCGAG CCGCTGCTGG CCGAGGGCGA GAACGTCGAC
GTGCTCATCA AGCACCGCGG CGACGGTCCC GGGGGCATCT CCGGCACCCG ATACACCTAC
CCCACCCACC CCTTCGACGT CGTGGGCTGG GACGGGTTCC TGTACCCCTA CGTCTTCAAC
GTCGCCGACT TCCAGCCCAT CACCGGCCGC GTCCACCAGC CGCCGCCCGT GCACCAGGTG
TTCGAGGGCC ACAACTTCGT CATCTGCAAC TTCGTGCCGC GCAAGGTGGA CTACCACCCC
GGCGCCATCC CGGTGCCCTA CTACCACTCC AACGTGGACT CCGACGAGGT CATGTTCTAC
TGCGGCGGCG ACTACGAGGC CCGCAAGGGC TCCGGGATCG GCCAGGGCTC CATCTCCCTG
CACCCCGGCG GCCACTCCCA CGGCCCCCAG CCCGGCGCCT ACGAGCGCAG CATCGGCGCC
GAGGCCTTCG ACGAACTCGC CGTCATGGTC GACACCTTCC GCCCGCTCGA CCTGGGCGAG
GGCGGCCTGG CCAGCGACGA CGGAGTCTAC GCCTGGACCT GGGCCGGGGG ACGGAGGGAG
CGGCGGTGA
 
Protein sequence
MAYYRQVGEV PRTRHTQHRT PEGGLYYEEL MGEEGFSADS SLLYHRAIPS AIVDAAEWEV 
PDQSRTRNAP MVPRHLKLHE LFSDQEWKAA DVVESRRLVL GNDDVRISYV VAGAPSELYR
NGIGDECVYV ESGTARVETV FGLLEVGQGD YVVLPRATTH RWVPTGDQPL RAYVVEANSH
IAPPKRYLSR YGQFLEHAPY CERDLRGPSE PLLAEGENVD VLIKHRGDGP GGISGTRYTY
PTHPFDVVGW DGFLYPYVFN VADFQPITGR VHQPPPVHQV FEGHNFVICN FVPRKVDYHP
GAIPVPYYHS NVDSDEVMFY CGGDYEARKG SGIGQGSISL HPGGHSHGPQ PGAYERSIGA
EAFDELAVMV DTFRPLDLGE GGLASDDGVY AWTWAGGRRE RR