Gene CNB04920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04920 
Symbol 
ID3255782 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1406779 
End bp1408479 
Gene Length1701 bp 
Protein Length435 aa 
Translation table 
GC content51% 
IMG OID638255136 
ProductGTP cyclohydrolase I, putative 
Protein accessionXP_568984 
Protein GI58263148 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0302] GTP cyclohydrolase I 
TIGRFAM ID[TIGR00063] GTP cyclohydrolase I 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTATACACA CACTCCGGCC CACACAGCCG TTATTATGCC TTCCACAACT GACCCTCTCG 
CCGACTCCCG CGCATCCCCT CAGTCTCCCC GTTCTATCCC CACCGCCAAC CTCAACAATC
TCTCTCTCCT TTCCGAATCC TCTACTGGTT CTTGGGAACG CGGCAGGATG GCGGGCAACG
CTCGTTCACC GCCTACAAAT TCTATCCTTG CCGACTCGCT CGCAGCTGGT GTTGCTCCCG
TAGCGGCGGG TGTAAATGTG GTCGAAGGCG GTAACGACTT ATCCCCATCT AGTGGAGCCA
AAAGTTTGAG GTCTGGCGGA GCCTACCCTC AACCTCAACG TCCTTGGCCC GCTGACAAGG
GCCTTTCCCG TCCTAACCCT GCTACTGGTA GGGTATCCTT CCCGTCTGTT CCCCAGTATG
CTCGTCAGGC GCGTGAAGGC TACGGCTTCC GCCCCAGGTC TGGTCAGACT ACCCCTACCA
CCGGTGCGGA AGCTAGTTTG ACTTATCCTT TCCCCAGTAC CAGCTCTTCT AGAGTTCCAA
GGCCCAAAGA GGATGACGAT GATCTTGATC ATAGACCGGA TGGGAAGTCA ATGGACGAAT
TGAGGGGCGA GGTCAGGGAA GAACTGGATA AACAGGGGTT GGTGCAGGCT GCCAAAGGGG
TTGTTGAACC TCACATGGGT ATCTCTGGGA TTGCTGATGA AGAGGGTCTC GGATGGCCTG
GTAAGTATCC TAGCGTCATT TTTATGAAAC CACCGAAAAA TTGACACCCC GATTAGCCAA
GTCCACACAT CTCCGTCTGC ACTCCACCCC CGAAGAGAAG TCCGCCAACA TGGAGCTCCT
CACCTCTGCT CTTCGAACCG TGCTTGAATG CATTGGTGAA GACCCTGACC GGGAAGGATT
GCAACGTACC CCAGAAAGAT ATGCCAAAGC TTTGTTGTGG ATGACTAAGG GTTACGAGGA
GAGGCTTGTA GATGTAATCA ACGATGCAGT CTTTGCGGAA GACCACGATG AGATGGTTAT
CGTGAGGGAT ATTGAGGTAT TTAGTCTTTG CGAGCATCAC TTGGTTCCTT TCACTGGAAA
GGTATGTTAC TGTTGCGTCG GCTTCGTGGG TTATGCTAAT GCTCTGATAG ATTTCTATTG
GCTATATCCC CAGCAAGCTT GTTTTGGGTC TTTCCAAACT CGCTCGTATC GCCGAAACTT
TCTCCCGTCG TCTTCAAGTT CAAGAACGTC TCACAAAGCA AGTTGCTCTT GCTGTTGAAG
AGGCCATCCG CCCTCGGGGT GTCGCTGTTG TGATGGAGGC TTCGTAAGTG TCATTCTTAG
GTGTATACCA TATTCCATAA ACTGATTGGG AAATTATAGG CACATGTGCA TGTCTATGAG
AGGCGTCCAG AAGCCCGGCG CGACCACTGT CACCAGCACC ATGTTGGGAT GTTTCAGGCA
GCAGCAAAAG ACGCGAGAGG AGGTGAGTTT GTTAACTTTC CATCTTCTCA AACTCGAGCT
GACAAATCGT CACAGTTCTT GACCCTCATC CGTACTCCCA GCGCTGCCAG ACACTAAATC
TTATGTCCTG ATCTCGGAAC TTTCTCTCGG CCTCTACTCA TCTCTCCTCT ATTGATTAGA
TCTCAACATC AGCTTCTCAG ATAAGGAAAC TACGCAAATC TGTACATTAC TATACTCCTT
GTTATTCAAC ATACAAATAT T
 
Protein sequence
MPSTTDPLAD SRASPQSPRS IPTANLNNLS LLSESSTGSW ERGRMAGNAR SPPTNSILAD 
SLAAGVAPVA AGVNVVEGGN DLSPSSGAKS LRSGGAYPQP QRPWPADKGL SRPNPATGRV
SFPSVPQYAR QAREGYGFRP RSGQTTPTTG AEASLTYPFP STSSSRVPRP KEDDDDLDHR
PDGKSMDELR GEVREELDKQ GLVQAAKGVV EPHMGISGIA DEEGLGWPAK STHLRLHSTP
EEKSANMELL TSALRTVLEC IGEDPDREGL QRTPERYAKA LLWMTKGYEE RLVDVINDAV
FAEDHDEMVI VRDIEVFSLC EHHLVPFTGK ISIGYIPSKL VLGLSKLARI AETFSRRLQV
QERLTKQVAL AVEEAIRPRG VAVVMEASHM CMSMRGVQKP GATTVTSTML GCFRQQQKTR
EEFLTLIRTP SAARH