Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18900 |
Symbol | pyrD |
ID | 7760824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1879640 |
End bp | 1880683 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643804788 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_002799077 |
Protein GI | 226944004 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.388059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACACCC TGGCCCGCCA GCTGCTGTTC AAACTGTCCC CGGAAACCGC CCACGAACTG ACCATCGATC TGCTCGGGGC CGGTGGCCGT CTGGGTCTCA ACGGCCTGCT GTGCCACCGG CCGGCGAGCC TGCCGGTGCG GGTGATGGGC CTGGACTTCC CCAATCCGGT CGGCCTCGCC GCCGGACTGG ACAAGAACGG CGACGCCATC GACGGCCTCG CCCAACTGGG TTTCGGCTTC GTCGAGATTG GCACCGTGAC GCCCCGGCCG CAGCCGGGCA ACCCCAGGCC GCGGCTGTTC CGTCTGCCGC AGGCCGAGGC GATCGTCAAC CGCATGGGCT TCAACAACCT GGGTGTCGAC CATCTGCTGG CGCGGGTCCA GGCGGCACGC TACAGCGGCG TGCTCGGTAT CAACATCGGC AAGAATTTCG ACACTCCCGT GGAGCGGGCG GTGGACGACT ACCTGATCTG CCTGGACAAG GTCTACCCCC ATGCCAGCTA CGTGACGGTC AACGTCAGCT CGCCGAACAC TCCCGGCCTG CGCAGCCTGC AGTTCGGCGA CTCGCTCAGG CAACTGCTCG AAGCCTTGCG CCAGCGCCAG GAGGAACTGG CCGGTCGCCA CGGCCGGCGC GTGCCGCTGG CGATCAAGAT CGCCCCGGAC ATGAGCGACG AGGAGACCGC GCAGGTCGCC CGGGCGCTGC TGGATACCGG CATGGACGCG GTGATCGCCA CTAACACCAC CCTCGGCCGC GAGGGCGTCG AGGGGCTGGC GCATGCCGGC GAGGCCGGCG GGTTGTCCGG TGCGCCGGTA CGCGAGAAGA GCACCCATGC GGTGCGGGTG CTGGCCGGGG AACTGGGCGG GCGGCTGCCG ATCGTCGCGG TCGGCGGCAT CACCGAAGGG CGCCACGCGG CGGAAAAGAT CGCCGCCGGA GCCAGCCTGG TGCAGATTTA TACCGGCTTC GTCTACAAGG GGCCGGCGCT GATACGCGAA GCGGTGGAGG CCATCGCCGC GCTGCGGGGC GAGCGGCCGG TCGGGACGCA TTGA
|
Protein sequence | MYTLARQLLF KLSPETAHEL TIDLLGAGGR LGLNGLLCHR PASLPVRVMG LDFPNPVGLA AGLDKNGDAI DGLAQLGFGF VEIGTVTPRP QPGNPRPRLF RLPQAEAIVN RMGFNNLGVD HLLARVQAAR YSGVLGINIG KNFDTPVERA VDDYLICLDK VYPHASYVTV NVSSPNTPGL RSLQFGDSLR QLLEALRQRQ EELAGRHGRR VPLAIKIAPD MSDEETAQVA RALLDTGMDA VIATNTTLGR EGVEGLAHAG EAGGLSGAPV REKSTHAVRV LAGELGGRLP IVAVGGITEG RHAAEKIAAG ASLVQIYTGF VYKGPALIRE AVEAIAALRG ERPVGTH
|
| |