Gene CNE03710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE03710 
Symbol 
ID3257686 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1053781 
End bp1056589 
Gene Length2809 bp 
Protein Length750 aa 
Translation table 
GC content52% 
IMG OID638256954 
Productvacuolar protein sorting-associated protein vps27, putative 
Protein accessionXP_570941 
Protein GI58267570 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCGTCTGC TGCCAGCCTC TTGCTTCCAT ACGGCACTCC ACCCCGCCCA TATCCGCTGT 
CTCCATTGCC CGCCACCGCC CTTCTCCTAG TCCCAACAAT TTGCTTCCGC GCCGAGAGTC
CTTTGTCGCC GAGCACAAAC CCAACCGCCA TTCCTCGCTG TAATCCAACG TCGCAACCGC
CCCTCCTACA TTAATCATCC ATCATGTCAT GGCTATGGGG AAGCACTACA AACCCGCAGT
TCGAAGAGCT TGCTGGTTCG TACTTTACTG TTATGTTGTT ATCTATCGCT CACAGCCCTA
TCGCAGAGAA GGCGTGTTCT CCTCTCAACC TCCCGTACCC ACAATCAGAA GACATCGCGA
CCGCCTTGGA GGTTGCAGAT ATGATCCGCT CAAAGGCTAT ACAGCCTAAA ATGGCAATGC
AAAGCTTGAA GAAGAGGATA GCCAGTAAGA ACGGGAGAGT ACAGATGTAT GCCATCGGAG
TAAGCGATAT TTATTCGTGC ACTCGCTGAC GGATTGAAGC TAACCGACCG TGTGCAGCTT
ACAGACACCT GTATCAAGAA TGGAGGAGAT CATTTCTTGC TAGAAGTAGC TAGCAAAGAA
TTTGTGGATG AGTTGTCAAA CCTCATTAAA GCAACAGTAA GCAACAGTTA CTTTTGTCCC
ATTTGTTCAT TACTAAGCTC TATCATAGAC GACTAGCCCG GAAGTCAAGC AGATGCTTAT
CAAATACTTT CAACAATGGG CCCTTGCTTT CAAATCCAAG TCAGAGCTAT CCTTCTTTGT
GGAAGTCTAT AATGAGCTCA GGGCTTCTGG CAAGTCATCT GGTTGCCTTA TAACATGTTC
CGCCCTGACA TCCGTATAGG AATCACTTTC CCACCGCCAC CCGCTCCTGT CCCTTCTCAT
CTCTTGACAA CAACCACTGC TCCTGCATGG GTCGACTCTG ATGCGTGCAT GCGCTGCCGC
TCTGCCTTCA CATTTACTAA CCGTAAACAC CACTGTCGCA ACTGTGGTCT CGTATTCGAC
CAAGCGTGCT CGAGCCACAG CATGCCTTTA CCCAAATATG GAATCACGGA AGAGGTCCGA
GTGTGCGACG GCTGCTGGGC TAAAGCTGGG AGGAACAAGG CTGATGCTCC TGCCCCCGCT
GTCCCCGGGC GTACGCCAAG GTCTAGAGCG GATCTTGATG CTGATCTTCA ACGAGCCATC
GAACTCTCTC TCGCGGAATC TCAACATAGT CAGAACCGCC ACCACAGCCA TTTCACGCCT
TCCGAGCCTC CTCTGGCGCA TGGCACTGTT GAAGATGAGG ATGAGCAAAT GCGCCTTGCG
ATTGAGGCTT CTCTTCGTGA TATGGAGGCT CGTCCATCAG CTCCAGCGGG CCTCGGTGAA
GCGCCAGAAC CAGAATACAG ACCTCTGCCA ACATTCGACC TTTCCCCAAG AGAGAATGAG
ACGATCTTGA CGTTTAGCAA CACAATGGAT CAGATGGCGG CGTACGGTGA GCGAGATTTG
AGAAGGTTTC CACATGCACA TGTGTTGGCG GAGCAGGCTA ATACAGTGGG CGGGAGGTTG
AGGAGGAATG TGGAGGAAAA AAGCACAAAA CAACGTAAGC GCCTGTTATG CCTCAAGAGA
GTTGCGTTAA CAATACCACA GAAATGTTGA TGGAAATGCA AGATAAACTG TCCCAAGCTG
TCAACCTCTA CGGTCAAATT CTGGATGGAC AACAGGCTTA TGCTGCTAAA CGGGCGCACG
AAGAGCAAGC GAGGAGGTAT CAACAACAGC AGAGCTACTA CACCCAACAG TACCAGCCGC
AACCGCAGCT GTATGGCCAA TACCCTCCTA ACGGCTACCA AGCATTCGTG CCTCCTCAAC
AAGCCTACCA ACCCCCTCAG CCTCAGCCCG AAGCCCAAGC TCAACACGCA CCCTCCCTTT
ACCCTACGAT GCCTTATACT ACTCCCAATT TTACTTCTCC TCCCCAGGAA CGGGTTTACC
CTCAACAATC CCATTCATCG CCTTACAGCC AATGGTCTCC CGCGCCATCG CATGTTCAGC
CGGGATTAGC AAGACAAGCG TCGGTCGTTG TCCCGCCCGT TTCTTCACCC GTTCCCGCTG
GTGTGCAAAG ACAGGCTTCT ATGACTTATG GTGCACCTAT ACCTGTAGCC GAGCAATCTC
AACGACAACA GCAGCAATAT GCCTCTGCTC CTCCGTTCGC ATCTGGAGCG GCACCCGTCG
ACATACCTTC CGCTCCTCCA CCTGTTAATC TCTCCACCCA CCCCAACTCG CCCCAGCGAC
ACTCTTACAT CCCTTCCCAT CCCCAAACCC AAACCCAAAC CCAGTATGAA TCTCAACCAC
AAGAAATCCC GTCACAGCAA GATATGCAGT ATGGCGCGTC GGCTCCGCCG CCAGACTCTT
TAGGTTCGTA TGTGTCGGAA GGAACGGTAG GAAGCGCCAA GTCGGGGCTT GAACAGGAGC
ATGCGGCTTC ACAGATTCAA CCCCAACCTC AGCCCCAGGC TTCCGCACAA ACTCAAACAC
AGTCTCAATC CCAATTGCAG GCTCAACCCC AGCAAAACCA ATACGCTGCT CAAACACAGC
TCCCCGCGGG AATGTATAAC GCTGCCTCTT TCCCTCAACC GTTACCCCCA ACCATCTTCC
CAGATGCGCC CGTAGAAGCA CCCAAAGGTT TGGAAAAGGA GGAGAAAGAA GAAGCTTTGT
TGATTGAGCT TTGATCTAAT GCAAGAGGAG CGTGATGTTG TCTTCAGAGG TTGTATTATA
TTTTTATTTT TTTCCGTGGC CCTAGCAATG CATTACTCCG TGTGTAGGT
 
Protein sequence
MSWLWGSTTN PQFEELAEKA CSPLNLPYPQ SEDIATALEV ADMIRSKAIQ PKMAMQSLKK 
RIASKNGRVQ MYAIGLTDTC IKNGGDHFLL EVASKEFVDE LSNLIKATTT SPEVKQMLIK
YFQQWALAFK SKSELSFFVE VYNELRASGI TFPPPPAPVP SHLLTTTTAP AWVDSDACMR
CRSAFTFTNR KHHCRNCGLV FDQACSSHSM PLPKYGITEE VRVCDGCWAK AGRNKADAPA
PAVPGRTPRS RADLDADLQR AIELSLAESQ HSQNRHHSHF TPSEPPLAHG TVEDEDEQMR
LAIEASLRDM EARPSAPAGL GEAPEPEYRP LPTFDLSPRE NETILTFSNT MDQMAAYGER
DLRRFPHAHV LAEQANTVGG RLRRNVEEKS TKQQMLMEMQ DKLSQAVNLY GQILDGQQAY
AAKRAHEEQA RRYQQQQSYY TQQYQPQPQL YGQYPPNGYQ AFVPPQQAYQ PPQPQPEAQA
QHAPSLYPTM PYTTPNFTSP PQERVYPQQS HSSPYSQWSP APSHVQPGLA RQASVVVPPV
SSPVPAGVQR QASMTYGAPI PVAEQSQRQQ QQYASAPPFA SGAAPVDIPS APPPVNLSTH
PNSPQRHSYI PSHPQTQTQT QYESQPQEIP SQQDMQYGAS APPPDSLGSY VSEGTVGSAK
SGLEQEHAAS QIQPQPQPQA SAQTQTQSQS QLQAQPQQNQ YAAQTQLPAG MYNAASFPQP
LPPTIFPDAP VEAPKGLEKE EKEEALLIEL