Gene Mflv_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3039 
Symbol 
ID4974360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3219766 
End bp3220827 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content71% 
IMG OID640457262 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001134304 
Protein GI145223626 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.892025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGGGG CACTGCGCAA GGCGCTGTTC CTGGTCCCGC CCGAACGCAT CCACGGTCTC 
GTGTTCGCCG GGTTGCGTGC GGCCACGACC CCGGTCCCGC TGCGCCGGAG TCTGTCGCGG
CGCCTCGCCC CCCACGATCC GGTGTTGGCC AGCACCGTGT TCGGGGTGCG TTTCCCGGGT
CCGCTGGGGC TGGCCGCAGG ATTCGACAAG GACGGCCTCG GCGTGCACAC CTGGGGCGCA
CTGGGTTTCG GGTATGCCGA ACTGGGAACC GTGACCGCGC AGGCACAGCC GGGCAATCCT
CCCCCACGGA TGTTCCGGCT GCCCGCCGAC CGGGCCCTGC TCAATCGCAT GGGGTTCAAC
AACCACGGGT CCGCGGCGCT GGCGCTGCAG CTGGCCCGCA GCTCCTCGGA CGTGCCGATC
GGGGTGAACA TCGGCAAGAC GAAGGTCACC GAGCCGCAGG ACGCACCGGC CGACTACGCC
GAGAGCGCCC GTCTGCTCGG GTCGCTGGCC GCCTATCTCG TGGTGAACGT GAGTTCGCCG
AACACCCCGG GTCTGCGCGA TCTGCAGTCG GTGGAGTCGT TGCGTCCGAT CCTGTCGGCG
GTCCTCGCCG AGACCTCGAC CCCGGTGCTG GTGAAGATCG CCCCCGACCT CGCCGACACC
GACATCGACG ACATCGCCGA TCTGGCAGTC GAACTCGGTC TCGCCGGGAT CGTGGCCACC
AACACCACGA TCTCCCGCGA CGGCCTGAAG ACTCCCGGTG CGGCCGACCT CGGCGCCGGG
GGTATCTCCG GCCCGCCGGT GGCCCGCCGC GCGCTGGAGG TGTTGCGCCG CCTGTACGCC
CGGGTGGGCG ACAAGCTGGT GCTCATCAGT GTCGGAGGCA TCGAGACGTC CGACGACGCG
TGGGAACGGA TCACCGCGGG CGCCTCGCTG CTGCAGGGGT ACACCGGGTT CGTTTACGGC
GGCGGCCTGT GGGCCAGGTC GATCAACGAC GGCGTCGCCG CCCGCCTCCG CGAGAACGGT
TTCGGGACCC TCGCGGAGGC GGTCGGCTCG GCGGCGCGCT AG
 
Protein sequence
MYGALRKALF LVPPERIHGL VFAGLRAATT PVPLRRSLSR RLAPHDPVLA STVFGVRFPG 
PLGLAAGFDK DGLGVHTWGA LGFGYAELGT VTAQAQPGNP PPRMFRLPAD RALLNRMGFN
NHGSAALALQ LARSSSDVPI GVNIGKTKVT EPQDAPADYA ESARLLGSLA AYLVVNVSSP
NTPGLRDLQS VESLRPILSA VLAETSTPVL VKIAPDLADT DIDDIADLAV ELGLAGIVAT
NTTISRDGLK TPGAADLGAG GISGPPVARR ALEVLRRLYA RVGDKLVLIS VGGIETSDDA
WERITAGASL LQGYTGFVYG GGLWARSIND GVAARLRENG FGTLAEAVGS AAR