Gene Avin_18900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18900 
SymbolpyrD 
ID7760824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1879640 
End bp1880683 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content70% 
IMG OID643804788 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_002799077 
Protein GI226944004 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.388059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACCC TGGCCCGCCA GCTGCTGTTC AAACTGTCCC CGGAAACCGC CCACGAACTG 
ACCATCGATC TGCTCGGGGC CGGTGGCCGT CTGGGTCTCA ACGGCCTGCT GTGCCACCGG
CCGGCGAGCC TGCCGGTGCG GGTGATGGGC CTGGACTTCC CCAATCCGGT CGGCCTCGCC
GCCGGACTGG ACAAGAACGG CGACGCCATC GACGGCCTCG CCCAACTGGG TTTCGGCTTC
GTCGAGATTG GCACCGTGAC GCCCCGGCCG CAGCCGGGCA ACCCCAGGCC GCGGCTGTTC
CGTCTGCCGC AGGCCGAGGC GATCGTCAAC CGCATGGGCT TCAACAACCT GGGTGTCGAC
CATCTGCTGG CGCGGGTCCA GGCGGCACGC TACAGCGGCG TGCTCGGTAT CAACATCGGC
AAGAATTTCG ACACTCCCGT GGAGCGGGCG GTGGACGACT ACCTGATCTG CCTGGACAAG
GTCTACCCCC ATGCCAGCTA CGTGACGGTC AACGTCAGCT CGCCGAACAC TCCCGGCCTG
CGCAGCCTGC AGTTCGGCGA CTCGCTCAGG CAACTGCTCG AAGCCTTGCG CCAGCGCCAG
GAGGAACTGG CCGGTCGCCA CGGCCGGCGC GTGCCGCTGG CGATCAAGAT CGCCCCGGAC
ATGAGCGACG AGGAGACCGC GCAGGTCGCC CGGGCGCTGC TGGATACCGG CATGGACGCG
GTGATCGCCA CTAACACCAC CCTCGGCCGC GAGGGCGTCG AGGGGCTGGC GCATGCCGGC
GAGGCCGGCG GGTTGTCCGG TGCGCCGGTA CGCGAGAAGA GCACCCATGC GGTGCGGGTG
CTGGCCGGGG AACTGGGCGG GCGGCTGCCG ATCGTCGCGG TCGGCGGCAT CACCGAAGGG
CGCCACGCGG CGGAAAAGAT CGCCGCCGGA GCCAGCCTGG TGCAGATTTA TACCGGCTTC
GTCTACAAGG GGCCGGCGCT GATACGCGAA GCGGTGGAGG CCATCGCCGC GCTGCGGGGC
GAGCGGCCGG TCGGGACGCA TTGA
 
Protein sequence
MYTLARQLLF KLSPETAHEL TIDLLGAGGR LGLNGLLCHR PASLPVRVMG LDFPNPVGLA 
AGLDKNGDAI DGLAQLGFGF VEIGTVTPRP QPGNPRPRLF RLPQAEAIVN RMGFNNLGVD
HLLARVQAAR YSGVLGINIG KNFDTPVERA VDDYLICLDK VYPHASYVTV NVSSPNTPGL
RSLQFGDSLR QLLEALRQRQ EELAGRHGRR VPLAIKIAPD MSDEETAQVA RALLDTGMDA
VIATNTTLGR EGVEGLAHAG EAGGLSGAPV REKSTHAVRV LAGELGGRLP IVAVGGITEG
RHAAEKIAAG ASLVQIYTGF VYKGPALIRE AVEAIAALRG ERPVGTH