Gene Namu_4274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4274 
Symbol 
ID8449900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4753908 
End bp4754975 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content74% 
IMG OID645043322 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_003203551 
Protein GI258654395 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.637191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCG GCGGCGTGGC CTACCGGCGG GTGGCCCGGC CGGTGCTGTT CCGGATGGGC 
AAGGGTGACC CGGAGGTGGT CCACCACCGG ACCCTGTCCG CGCTGGCCCG GGTGTCCCGG
TCGGCCCCGG CGCTGCGCCT GCTGGGCGGC CTGCGCCGTC GACACCCCAG CCCGCGCACC
GTCTTCGGGG TGGACTTCCC GTCCGCCGTC GGGTTGGCGG CGGGCATGGA CAAGGACGGC
GTGGCCCTGA AGGCCTGGCC GGCCCTGGGT TTCGGTCACG TCGAGGTCGG CACCGTCACC
GCGCACCCGC AGCCGGGCAA CCCGCGGCCG CGGCTGTTCC GGCTGCCCGC CTCCGGCGCG
ATCATCAACC GGATGGGGTT CAACAACTCC GGGGCGCAGG CGCTGGCCGC CCGGCTGGCC
ACCACCGGCC GGATCGGCGT GCCGCTGGGC ATCTCGCTGG GCAAGTCCAA GATCACTCCG
GTGGACGAAG CCGTCGGCGA CTACCTGACC TCGCTGCGCG CCGTCTACCC GTTCGCGGAC
TACATCGCGG TCAACGTCTC CAGCCCGAAC ACCCCGGGCC TGCGCACCCT GCAGGATCGG
GCCCCGCTGG ACGAGCTGCT GGCCGCGCTG ACCACCGAGG CGGGCAGCCT GGCCTGGTCG
CTGGGGCAGC GGCGCACGCC GGTCCCGGTG CTGGTCAAGA TCGCCCCCGA CCTGACCGAT
CAGGCCATCG CCGACCTGCT GGAGGTCTGC GTGGACCGCG GCATCGCCGG GCTGATCGCC
ACCAACACCA CCTTGACCCG GCCGGGCCTG GCTGCCGGCG ACGCGGCCAC CGCCGCGGAA
GCCGGTGGGC TGTCCGGGCG GCCGCTGGCC CCCCGATCGC TGGAAGTGGT CCGCTTCGTC
ACCGCCCACT GCGACCTGCC GGTGATCGGC GTCGGCGGCA TCGGCACGGT CGACGACGGG
CTGCGCATGC TCGACGCCGG GGCCAGCCTG CTGCAGCTCT ACACCGGGTT CATCTTCGGC
GGGCCGCCGC TGGTGACCTC GTTGAACAAG GCCATCGCCG CCCGCTGA
 
Protein sequence
MSVGGVAYRR VARPVLFRMG KGDPEVVHHR TLSALARVSR SAPALRLLGG LRRRHPSPRT 
VFGVDFPSAV GLAAGMDKDG VALKAWPALG FGHVEVGTVT AHPQPGNPRP RLFRLPASGA
IINRMGFNNS GAQALAARLA TTGRIGVPLG ISLGKSKITP VDEAVGDYLT SLRAVYPFAD
YIAVNVSSPN TPGLRTLQDR APLDELLAAL TTEAGSLAWS LGQRRTPVPV LVKIAPDLTD
QAIADLLEVC VDRGIAGLIA TNTTLTRPGL AAGDAATAAE AGGLSGRPLA PRSLEVVRFV
TAHCDLPVIG VGGIGTVDDG LRMLDAGASL LQLYTGFIFG GPPLVTSLNK AIAAR