Gene CNA07120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA07120 
Symbol 
ID3253518 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1943187 
End bp1944506 
Gene Length1320 bp 
Protein Length353 aa 
Translation table 
GC content50% 
IMG OID638253034 
Productdihydroorotase, putative 
Protein accessionXP_567031 
Protein GI58259237 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0497269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACAAG AAATCGCTCT TCCTTCTCCT GCTGAGTAAG TAATGCCTTG TTCCGTATCG 
AGCACCGCCA ATCTTCTCTT CTCAGCTTCC ATGTCCATGT GAGACAAGGA AAGATGTGTG
AGCTCGTCAC TCCTCAAGTT GGTAAGGGAG GTGTTCGCAC CGCGTATGTA ATGGTAAGGG
ATTTGTCGTA GATAGATGAA CAACGACTCA TAGTAATCTG TAACAGCCCA ACCTCGTCCC
ACCTCTCACT TCCACCGATG CTGTTTTATC CTACAAGGCC GAGCTTGAGA AGATTGATCC
CTCTGTTCAG TGGCTCATGA CCCTTTATTT GCATCCTGAC GTAACTCCCG CTGAGATCCG
CAAAGCTGCT AAAGCGGGCA TCAAAGGTCA GTCCTCAAGC TGTTGAGACT AAGCCGATGG
CGTAGGGTGG GTGCGAGGCG TATGGTCACT CGAACAAATG TGCTTATATC CTTTACTACC
ATGCAGGTGT CAAGTCTTAC CCTCGAGGAG TCACTACCAA CTCCAGTTCT GGCATCGAAG
ATTACGAAGT CTACTACCCC GTCTTTAAGG CTATGGAGGA AGAAGACATG GTCTTGAACT
TGCACGGAGA AGTGCCCAGT GATGCTGACA AGGTAAGGAC CACTCAGTTT GGAGAATATG
ATGTACGTAA GCTCACCAGG AGAACAGAAC ATCTCAATCC TCAATGCTGA GATTCATTTC
CTCGAACATC TCCGTAAACT CGCGACCGCT TTCCCCAAGC TTCGTATCGT TCTTGAGCAT
GCTACCACCT CTGCTGCGGT CGAGACTGTT GCTTCCCTCC CTTCCAACGT CGCCTGTACA
ATCACTGCTC ACCATCTGTA TCTTACAATC GATGAGGTCG CTCCCCAACC CCACCACTTC
TGCAAACCTC TTGCTAAGGA GCCCAAGGAT AGAAAGGCCC TCCAAGACGC CATCAAGAGC
GGGAACGAGA AGTTCTTCTT GGGCAGTGAT AGTGCTCCCC ACCCCTTGTC CAGTAAGGCC
CCTGCTTTAA CTGATAAGGG CGCTGTGAGC GCTTGTGCTG CCGGTGTATA CACCAGTCCC
ATTTTGATCC CGCTAGTTGC GACCCTTTTG GAAAGTTTCG GTGCTTTGGA TAAGCTTGAG
GGATTCGTCA GTGGACACGG AAGGAAATTC TATGGGGAGC CTGCCAAGGA GGGACAGGAG
TTAAGGTTGA GGAGGACGAA GGAGGATGAA GGTTTCGTTA AGGGAACATT CAGGGGTGAT
GGTGTGGAAG TCATGCCTTT TTGGGCTGGC AAGAGGCTGG GTTGGGAAAT TGCTCAATGA
 
Protein sequence
MLQEIALPSP ADFHVHVRQG KMCELVTPQV GKGGVRTAYV MPNLVPPLTS TDAVLSYKAE 
LEKIDPSVQW LMTLYLHPDV TPAEIRKAAK AGIKGVKSYP RGVTTNSSSG IEDYEVYYPV
FKAMEEEDMV LNLHGEVPSD ADKNISILNA EIHFLEHLRK LATAFPKLRI VLEHATTSAA
VETVASLPSN VACTITAHHL YLTIDEVAPQ PHHFCKPLAK EPKDRKALQD AIKSGNEKFF
LGSDSAPHPL SSKAPALTDK GAVSACAAGV YTSPILIPLV ATLLESFGAL DKLEGFVSGH
GRKFYGEPAK EGQELRLRRT KEDEGFVKGT FRGDGVEVMP FWAGKRLGWE IAQ