Gene CNN00040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00040 
Symbol 
ID3255317 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp10535 
End bp12981 
Gene Length2447 bp 
Protein Length520 aa 
Translation table 
GC content50% 
IMG OID638254419 
Productexpressed protein 
Protein accessionXP_568512 
Protein GI58262204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCTATTCGGG CCTGGCTATT CGGGCCTGGT CCGCCATCAA GCGGCTATTC CTTATACGGG 
CAGCCATAAT TTTCCCTTCA GTACAGGATG TAGGAAATGA CCCAACTGCG GTGCCGGTGC
GCCGCCCTCT CGAACAGGGG CGATCAGTCG GCTGGCGCGA TAACGACGGA AATCTGGCTT
GGCTAGCTAT ATAAAAGTCA AGAAGCAGAG CCGGCCCTTG TCGACCCCAT GTCATCCACT
TTTCCTCTAC AACCTCCTCC GCATTTTTAT ATTTTACGCC AACAATGTCC GACCCATCGC
AAAGGATTTT AATCGTAGGC GCTGGGGTCT TTGGTCTCAG TACTGCTCTG TTCCTTGCTC
GACGAGGTGT GTTTGGCGTA TCTAAGGCGG ATGAGTCTGC ATGCTGACAT TGTATCACTT
AGGTTACAAA GACGTCACTG TCATTGATAG GATTTCATTG GAAGTCAACC AGTACCGCCC
AGATGCTGGT TGTGACGGGG CTTCCGCCGA TATCAACAAG GTCTTCCGAA CAGGCTACGG
CGTTCGCCAC ATGTGAGCTT TTAATGATCA CATGCCACTG TGGAGCTAAT GGGCCTCAAT
AGGTACGAGA AGCTCGCTTT TCAAGCTCTT CCCGTTTGGC AAGAATGGAA TGCGACTATC
AGGAACTCCT CTCCTGAGGA GTTGCCCACC GGTCTTACTC CTGACGATGA TCTGCTCGTT
CAGTGCGGGG TGATGCGTCT GGGAGAAGGC AAAGAGCTTG ACGATTACCA TGCGCAATGC
TTACAGGGAG CCATTGAGGA GGGACACCGG GACGACGTCT ATCTCATCGT GAGTCCAAAC
GTCTTTTCGT AGTTACGCCT CAGTGTGCTG ACGCGACCTC AGAATGATGC CCATGACTTG
GCTCGAGCAG CTCAGAAGGA CAAAAATTCT CCCGGGATGC ATTGGGCCTC CAAGCTTCAA
CGTTACGGAG TATTAATAGG GGGCAATATC CACGGTATGC TTGATATGGA AGGCGGTTTT
ACTCGAGCCG ATAAGGTACG TTATGTAGCA CGCTTCTTTT GCAAAACATG GTTCTACATT
GGCTGATTTC GTACCCTAGT CTTGTCTTTA TGCCCTGCAC CTTTGCAAGA AGGCCGGAGT
CAAATTCATC CTTGACCAGA AAAACGGTGC ATTCGACAAA TTCATCCATG AAGGTAAGAC
TATACTGGGT GTAGTTACCA AGGATGGCAA AGAGCATCGA GCAGACAAGA CCATTGTCGC
TGCTGGAGGG TGGACCGCTT CTATTGTTCC AGAGGTCAGC AGTCTGCTTG AAACCACCGC
CGGTTCTCTG TGTTTTATCG ATATCCCGGA AAACCGTCCC GACCTTCGCA AGAAATATAG
CCCTGCTGAA TTCTGTGGCT GGAGCTTGAA ACTAAAGGCC GACCCGGGAG ATCATGAGTA
AGTTCGAGTC ATACGTGCTT GCTTCTCCAT AAGCTGAATT TGCTTTAGGG GCTCTTACAG
GTCTATGGGT GGTTTCGTAA GTTGCATGTC CTATGCCATG TTAAAGAGAC TTTGCTAATG
CCCCTTCTAG CCTGCAGACC CCAATGGCCG ACTCAAGTTC GGCTTCCGAG CTACTAAATT
TACCAATTAC GAGACCCTTC CAAACGGGCA GTGAGTGACT GGCCATTCTA TCAATTCCAA
ATGGCGCTTA CGATCATCCT TATCCTACAG GAAAATTTCC GTCCCCCGCA CTGCCTTTAC
CGAGAAAAAT ATAGACAACA TCCCCAAGAA GGCCTTGGAT GCGATCAAGA GCAACATCAT
GGCCTTGTAT CCCGATCTCG CCGAGTTTGG GCTGACTGGT ACGAGGGTGA GTAGTTGGAA
CATGGTAGAT CTGACGATGG CTGACATCCG GCTAGATGTG CTGGTACACA GACTCCATTG
ACTGCCATTT CCTTGCTGAC TATGTGCCTG ATACGAACGA ATCCTTGTTT GTGAGTATAC
ATCTTAAGCT GTTAATGATC GGGCCATTAA CATGCTAGAT AGATCGCTAG CGGAGGCTCA
GGTCATGGAT TCAAGTTCTT GCCCATTTTT GGAGAGGTAA GTGCGATGTC CGACCATTTA
TAGATCCTGC TGATCCCTTT GCCCTTTAGA ATTTCGTCAA TCAACTCGAA AAGAAAGGAG
ATGAGTTTAC CCAGTACTGG AAGTGGCGCA GCGCAATTCC CGGAGAAAGT GCCAATGGCC
TTGAGGAAGG ACCAGATGGT CCCAGAACTT TGGATAAGGC TGCTTTGGCT ACACGGGCGG
ATTGGAAGTT TGATACGGAT AGGATAGTTG ACAAGCCTGC CACGTGCAGG GCGGATTGCA
AGTTTGATAC CGATGGGGTC GTTGACAAGC TTGCGGAGTG CACTGTGTAA AACTGGTACC
TTTGGCGGGA ACATACCAAA TTTGTAGAAT TACATCCTCG ATGAAAA
 
Protein sequence
MSDPSQRILI VGAGVFGLST ALFLARRGYK DVTVIDRISL EVNQYRPDAG CDGASADINK 
VFRTGYGVRH MYEKLAFQAL PVWQEWNATI RNSSPEELPT GLTPDDDLLV QCGVMRLGEG
KELDDYHAQC LQGAIEEGHR DDVYLINDAH DLARAAQKDK NSPGMHWASK LQRYGVLIGG
NIHGMLDMEG GFTRADKSCL YALHLCKKAG VKFILDQKNG AFDKFIHEGK TILGVVTKDG
KEHRADKTIV AAGGWTASIV PEVSSLLETT AGSLCFIDIP ENRPDLRKKY SPAEFCGWSL
KLKADPGDHE GSYRSMGGFP ADPNGRLKFG FRATKFTNYE TLPNGQKISV PRTAFTEKNI
DNIPKKALDA IKSNIMALYP DLAEFGLTGT RMCWYTDSID CHFLADYVPD TNESLFIASG
GSGHGFKFLP IFGENFVNQL EKKGDEFTQY WKWRSAIPGE SANGLEEGPD GPRTLDKAAL
ATRADWKFDT DRIVDKPATC RADCKFDTDG VVDKLAECTV