Gene CNF03550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03550 
Symbol 
ID3258364 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1049174 
End bp1051484 
Gene Length2311 bp 
Protein Length612 aa 
Translation table 
GC content50% 
IMG OID638257473 
Productprotein-vacuolar targeting-related protein, putative 
Protein accessionXP_571433 
Protein GI58268554 
COG category[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5391] Phox homology (PX) domain protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.199341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCGTCCTCG CCACCGGTAC TGCCATGTTC AACGCGCCAC GACCAATGGC CAGTTCTTAT 
AGCTATACTG ACCCTCTGTC AAATTCGGCT GCGGCTGGTG CAGCTTTCGG GGAGCTGGAT
CCCTGGAGCA GCGCCCCCAG CCCGGCGGGG AGTGTAACTC CTGCAAGGGC TACTGCATCC
GCATCTGAAG GCAGGAATAT AGCTGCGAAT GGCAATAAAG AAGAAGGGCT GAATGGGTTA
ATCAGTGAGT GTACAGGTCA TCAGCTTGCG CTTGTAGTCC GGGCGGCGAA CATTGCTGCG
AGCGAAACAC TAACGTTGGC TACTATGGCA GACGATCCGC CTGCGCTGTA CGTCTCTTTG
CTCGACCAGC TAGATACTAG TGGTACTGGG GAGGTATCCT TGGCTGCTGT CCATCGATTA
CTAGGTACTA GCAAGCTGCC TGCTGTGGTC GTTGAAAAGG TGCGTCCATG ATTCGCGCTT
TGCTTTTCGC AACTGCTGCT ACTGTTGTAC GCCAATGCTG ATCATGGACT GCCATAATCT
TGCACTGTGA TGTTTGGTAC GATTCACAGA TCATTCATCT TACATCTCGA GATAAATCAA
CTCTCACTCG ACCCGAATTC TTCTGCGCTC TGGCCCTTGT CTCACTCGCC CAATCGTCCC
CCGATCCCAA TGATATCTCA ATTGAAAAAC TCTCCTTTTC CCTTTCCAAC CTTCCTTTAC
CCAAGCTCAA ACCCTCTGAT CCTCCGTCAG TCTCATCTGG TGTGGCTGCT AGTACTGCTG
CGGCCACTGG GTTTAACGCC TGGGACGGGA CTATCAATAA AGGCACGACG TATAGTGCGA
ATAACAGCAC TTTCAGATCC ACTGATCCTA TGGTAGATAA CGCCGAGGAC AGATGGTGGA
AGGATCAAGA ACGGATAGTG GTGACTCTCA TACCTGAGAA GGAAGGGTGG TTTTTGCAAA
AATATCGGAT AGAAAGTGAT GTGGGTCCCA CTTGATTCTT TGAAAATTGT ATACTGATAT
AGAGGGCAGA AAAGAGGAGA AGGACCTGTG GCAAGGAGGT ACAGTGATTT CGTGTGGTTG
ATGGATGTTC TGGAGAAGCG ATATGTACGT CTTTTCCCTA CTTAACTGAT CGTGAGCTGA
CGGTCATCCC ATAGCCCTTC CGTATATTGC CCCCTCTTCC GCCGAAGCGC ATAAATCGTG
AGTCCATAAG CGCTTTACTT GATAAGAACT GTGTTAATTA TTTCATAGCG TCTTCTGCTT
TCCTTGAAGC TCGTCGCCTA GCTTTGATTC GCCTCTTATC TTTTCTCACT GCTCACCCCG
TCCTCCGCAC CGACGCATGC CTCAACATTT TCCTCACCTC GTCCTCATTC GAATCGTGGC
GCAAGCGCAC CCCTGTCTCC ACAGACGAAG AATCACTTTC CAAAAAGTTG ACCACGGCCC
AAGAGATGTC AATCCCTTCC GATCTCGAGC TCAAGCTTGA CAATTTGAGA GAACGATTAC
CGGCAATGTT GGGGCATTAT ACTCGATTAG TGGTGATGGC CGAAAGGAGC TTGGTGAGGT
TGCAGGTGCA AGCCGCTGAA GCCGCCAGAA TGGCGATGAG CACGCAAAGT ATTGGAGAGT
TGGTGCCCAG ATGTTGTTGG AGGAGTGTGC AAGGTGACGA CGGTGAGAGT GGGAGAGGAG
TAGCGAGGGA ATGCGGACTA TGTGAAGGAG TAGGAAGGGG GTGGGGAGAT GTTGGTGACG
GATGGGTCAG TGTTGGTGAA GAGCTCGAAA AAGGAGTGAG TAGTCCTCAA GAATTCAGTT
GCAACTTGTT ATCGCTAACG CAAATTCAAG GTGCAATTAT TACAAAAACA CATTGAATCT
CTCAAGTCAC AACGTGATCT CTATTCGTCA TTCCATGCCC TTTTCTACCG ACATAACAAA
CTGTCACTGG ACAATGTCGA TGTTCTTCGA AAAAGGGTAG ACTCCCGGTT CAGTAAGATC
GAGTCTCTCA AATCCGCAAA AAAGCCTGGG TGGGAGGGTG AAGTCGATAA GCTTGCCAGC
CAAAGCGACA GAGACACTGC AGAAATCCAA CGTCTATTAG CCAGAAGAGT GTTCGTGAGA
GCTTGTATGT GGCACGAATT GAGTGTGGTG TTCCATTCGA TGCAGGCTGC CCAGGGCACC
ATGGGTTGGA AGGACTTCGT GAAGGATCAG AAAGAAAGGA CAAAGAGATT GAACGGTGTT
TGGCAGGGGT TGGAAGAGAC TTTGGAGAGT ATGCCGTTGG AATAAGGAAT GTTTGTAGAT
GTGATGACCT TCTATGCCAT GCTGTGATAG T
 
Protein sequence
MFNAPRPMAS SYSYTDPLSN SAAAGAAFGE LDPWSSAPSP AGSVTPARAT ASASEGRNIA 
ANGNKEEGLN GLINDPPALY VSLLDQLDTS GTGEVSLAAV HRLLGTSKLP AVVVEKIIHL
TSRDKSTLTR PEFFCALALV SLAQSSPDPN DISIEKLSFS LSNLPLPKLK PSDPPSVSSG
VAASTAAATG FNAWDGTINK GTTYSANNST FRSTDPMVDN AEDRWWKDQE RIVVTLIPEK
EGWFLQKYRI ESDKRGEGPV ARRYSDFVWL MDVLEKRYPF RILPPLPPKR INPSSAFLEA
RRLALIRLLS FLTAHPVLRT DACLNIFLTS SSFESWRKRT PVSTDEESLS KKLTTAQEMS
IPSDLELKLD NLRERLPAML GHYTRLVVMA ERSLVRLQVQ AAEAARMAMS TQSIGELVPR
CCWRSVQGDD GESGRGVARE CGLCEGVGRG WGDVGDGWVS VGEELEKGVQ LLQKHIESLK
SQRDLYSSFH ALFYRHNKLS LDNVDVLRKR VDSRFSKIES LKSAKKPGWE GEVDKLASQS
DRDTAEIQRL LARRVFVRAC MWHELSVVFH SMQAAQGTMG WKDFVKDQKE RTKRLNGVWQ
GLEETLESMP LE