Gene HMPREF0424_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0923 
Symbol 
ID8709427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1054306 
End bp1055307 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content46% 
IMG OID646483021 
Productdihydroorotate dehydrogenase 1B 
Protein accessionYP_003374137 
Protein GI283783383 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTG TAAATAATGA CGAAAATAAT TCTGTAAATA ACGCAAATTC CGCAACAAAT 
TCCGCTGAAA CTTTTGGCTT TGACACTAGC CACGTTTGGA AACATCCAAC TCAAGTTGCA
GGCGTTAAGT GGAAAAACAT GGTTGGTACA GCTTCTGGAA CTTTCCAGCT TGCAGCTTGC
CGACGTTTTT ACGACGTAAG CCAACTCGGC GCAATTTGCA CAAAAGGCGT TTCACCTGTT
CCGTGGGAAG GAAATCCTTC TCCGCGCACT GCAGAATCTC CTTCTGGCAT GGTAAACGCA
GTTGGATTGC AAAATCCTGG CGTCGACCAC TACTTAGTAG ACGAGCTTCC GAAACTAAAG
AAAATGGGAG CGCTTGTTAT TACTAATGTT GCAGGGCACA GTGACGACGA TTATGCGCAA
GTTGTTGAAA AGCTTGCAGA TTCTGCTGCA GACATGCTTG AAATTAACGT AAGCTGCCCA
AACGTAACTC ACGGCGGAAT GAGCGTTGGC ACGGATCCGG TGGCATTACA CCGCTTAATT
AAGCGACTTC GCGCAATGAC AGATAAGCCA ATGATTGTAA AAATGACGCC AAATGTGACG
GATATTGTCT CGATTTGCAA AGCTGCAGTT GATGCTGGAG CAGATGCTTT AAGCATGATT
AATACGCTTG TTGGTTTGCG AATTGATATT CGAACAGGCG AGCCTATTAT TGCAAACCGC
ACAGGCGGTG TTTCCGGTCC TGCAATCTTC CCGATTGGTC TTGGATTTGT GTGGCGAGTT
CGTCAAGCTA TGCCAGATAT TCCAATTATT GGCATTGGTG GCATTGATTC CGGCGAAAAA
GCTTTGGAAT ACTTGTATGC TGGCGCTAAT GCTGTAGAAG TTGGTGCTGC CGCTTTGGTG
GATCCTACTG CTCCTATTCG CATTGCTCGC GAGCTTGATG ATTTGCTTGA TTCTCGTCCA
AAGCTTGCGT CTTTACTTGC CGAAGGAAAG ACTTGGCGCT GA
 
Protein sequence
MNAVNNDENN SVNNANSATN SAETFGFDTS HVWKHPTQVA GVKWKNMVGT ASGTFQLAAC 
RRFYDVSQLG AICTKGVSPV PWEGNPSPRT AESPSGMVNA VGLQNPGVDH YLVDELPKLK
KMGALVITNV AGHSDDDYAQ VVEKLADSAA DMLEINVSCP NVTHGGMSVG TDPVALHRLI
KRLRAMTDKP MIVKMTPNVT DIVSICKAAV DAGADALSMI NTLVGLRIDI RTGEPIIANR
TGGVSGPAIF PIGLGFVWRV RQAMPDIPII GIGGIDSGEK ALEYLYAGAN AVEVGAAALV
DPTAPIRIAR ELDDLLDSRP KLASLLAEGK TWR