Gene VC0395_A1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1098 
SymbolpyrD 
ID5137343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1150183 
End bp1151193 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content45% 
IMG OID640532556 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001217044 
Protein GI147675557 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.730125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTACC GCTTAGCCAG AGCTGGCTTT TTCCAATTGG ATGCCGAAAA GGCACACGAT 
CTGGCCATCT CTAATTTCAA ACGTTTCACT GGCACTCCTT TCGATCTCTT CTATCGTCAA
CAACTTCCTC ATCGTCCAGT TCAATGCATG GGCTTAACCT TTAAAAATCC AGTCGGTTTA
GCAGCAGGGC TCGACAAAAA CGGCGAGTGC ATCGAAGCGT TTGGCGCGAT GGGCTTCGGA
TTTGTTGAAG TAGGCACGGT CACACCAAGA CCACAAGCAG GTAACGACAA ACCACGCCTG
TTTCGTTTAG TGCATGCTGA AGGCATCATC AATCGAATGG GCTTTAACAA TCTGGGTGTT
GATCACTTGG TTGAGAATGT TAAGCGAGCC AAATACGATG GGATCATCGG GATCAACATC
GGTAAAAACA AAGATACTCC GATTGAGAAA GGGGCAGAGG ACTATTTGAT CTGTATGGAT
AAAGTTTATC CTTACGCAGG TTACATCGCC GTAAATATCT CTTCTCCGAA CACACCAGGA
CTTCGTTCTC TACAATACGG TGAAGCGCTG GATGAACTGC TTGCTGCATT GAAAACTCGC
CAAGCTGAAT TAGCAGCGAA ACATGATAAA TATGTCCCGC TTGCACTTAA GATTGCACCA
GATTTAAGTG ACGATGAAAT TCAGCAAATC TGCCAATCAC TTTTGAAAAA CAAAATCGAT
AGTGTCATCG CGACAAACAC CACCTTAGAT CGTTCATTGG TTGAAGGGAT GAAGTTTGCC
AACGAAGCTG GCGGCCTCAG TGGACGACCT TTGCAAAACC GCAGTACAGA AGTTATTAAG
TGTCTGTATA AAGAACTCGG TGAAGAAATT CCGATCATCG GGGTCGGTGG TATCGATTCC
TACATCTCCG CCAAAGAAAA GCTCTTAGCA GGAGCAAAAT TAGTTCAGGT CTATAGCGGC
TTTATTTATC AAGGACCAGG GCTGGTCGCC GATATCGTCA AGAACCTGTA A
 
Protein sequence
MLYRLARAGF FQLDAEKAHD LAISNFKRFT GTPFDLFYRQ QLPHRPVQCM GLTFKNPVGL 
AAGLDKNGEC IEAFGAMGFG FVEVGTVTPR PQAGNDKPRL FRLVHAEGII NRMGFNNLGV
DHLVENVKRA KYDGIIGINI GKNKDTPIEK GAEDYLICMD KVYPYAGYIA VNISSPNTPG
LRSLQYGEAL DELLAALKTR QAELAAKHDK YVPLALKIAP DLSDDEIQQI CQSLLKNKID
SVIATNTTLD RSLVEGMKFA NEAGGLSGRP LQNRSTEVIK CLYKELGEEI PIIGVGGIDS
YISAKEKLLA GAKLVQVYSG FIYQGPGLVA DIVKNL