Gene CNF00620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF00620 
Symbol 
ID3258303 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp206401 
End bp209930 
Gene Length3530 bp 
Protein Length655 aa 
Translation table 
GC content47% 
IMG OID638257186 
Productexpressed protein 
Protein accessionXP_571240 
Protein GI58268168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.170983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTCTTTCCG ACTTCAACAA AGCATTTTCC CACGCTGAGC GAGTCTAGCA TTCGCCTTTG 
GCAGGCAGAA TACGCACAGG ATGTCACATC CAGAGAATAA TGGAGTTGGA AGTTCCGGCC
TTCCCAATGG GGCAGACCCG TCTCGTCCAG CTCCGAAACG CAAGTCCAGG TCAACCAGAG
GCCTACTCAA GTAGGTTTAT GGAAATAAGA AGATTTCCAT CTATCATTTA ACAAATAGGC
GGCTTACGGT TTCTGGGCCG ACTCAGCAAA GTTGTCGCAC TTGAAGATCA GATGCAGCAC
GTCCAATCCA ACCTTGAGAA TGTTGTATCT CTGCTGTTAC GTCTGGCTCC GACGGCTCTA
CCTGCCGGCT CTCAGCCATC GACCAATTCG ATTAGGCATG ACAGTATGTC TAATTAAGTG
CTAAGCGTGG TCCCACTTGA TGACTGATCC TGATTTATAG TCGCGCAACC GGGTTCTACT
GGCTCTCATA ACCGTGTCCA GACACCTTTA TCCGGGTTCT GCGAAAAATA TGAGCAGAGA
CCTGTATCGT CCCTTGAACA ATGGGTACAG CAAATAGCTG GAGATGTTGC ATCAACTACA
GCCGATCTTC AACAGCAATC TGTTCCTCGA GACCGAACAG CTGGTGGGAG CGTTGCAGAA
GGATCGGAAA TGTCCTCAGA AGACGAGAGA GACCACACAG TGGAGGAAGA AACGAAAGAG
GGGCTCTCTT TTATGCGGGA TATGTTACGA AGCGAGGAAA AGAAGAGGCT ACATGCGGAT
GGTCACTGGG ACTCACCGGG GGAACCAGGT TTATCGAGCC ATCGAGATCA AGAGCACATA
GCAGCCGATG ACCGAGGGCA CAGGGCTGAA CCAGACGGTC GCTTGAGAAA TGCACGCCTC
AAACGGAAGC GAGGCGCAAA TGATTGGGAA CCTCCCGAAT GGTTACCGGC CGGAATGCGT
GATCCGATCG ATCTTGGCTT CTGCAGCGAA GCACAAGGGA GAAATCTTTT TGATATGTGA
GCTTGGTCTT CTTTTCGCAA CCTATTGTCG TCTTGGTCCT TTAGAAGTAT AGCAGCATGC
TTACATGCCA GTACAGGTTC TTCCAAGGGG CACATTCCTT CATGCCTGTC TTTGATCCCT
CTAGAGACTC TTGGGAAAGG TATGCACCGA TGACACTTTT CGTCTGAGCA GTGACGCTGA
AGCAAAATTG TATGATCCTT TCAGTCTGCG CCATCGGTCG CCATTCTGTA TATCCGCGAT
TCTTTTCGTG GGACAGAAAA TCAAAGATGC TGGTCATGAG CCTAGCAACT TGCAACGGCT
ACTGAGGGAG CATGCGGAGA GTATAGGTAA ATCACGATTT CATCAACGCG TCGTATAGAA
GATGGGCCCT CTTTTATCTG ACGCGTCTTT GCAGGAAAGA CCACATTGTT CTGTCCCATA
GCTGATACGG AAGCTCTCCA AGCCATGAGT GAGTCGCATG GCATATTCAA AACGTTGATC
AACCATGACT GATACTGACC CGGTTTGTCA AAAGTCATTT TGGCCTCGTG GGGAGATACT
GGCTGGCGTC CTGGTAGTCA TGCAGTTAGT ATGGCGTGAG TGACATTCAT TTCCTTATAC
ATCTCTTCCA GCATCTCACT TGGTTTGGGC GTGTTTAACA GAATTGATAT GGAGCTTTAT
AAATGTCTGC CAAGACTTGC TGAGAGATTG CGTACCCCTT CCGCGTTCAC GAGCGATCGA
GATGAAGATA CTGAACGGCG GCTGGTGATA GGATCGAGGC TATGGCTCTT GGTTTGTAAA
ATGGCCATCG AGTGAGTTTT GACACGTTCC AGAAGTAAAC TTATGAGGAA CTCATGTCTG
CCAACTTTTA GGATGGCGTA TAATTATGGT CGTCCAATGC TCATCGACGA GACGTTGATG
ATTCCTCATT TGCACGCGCT GTTGAAGCAT CCTGCGCATT TGCCGACCGA TGGTCGAATT
GTTGCTTTTT GCGAATTACT TTATTTGAGG TGTGAGTCTC CATCTCCGAC GATGCTAGAT
AAAGATCATG ATTAACCATT TTATGGATGA CCAGTAACAT TGCACAGAAA TGCCATAAGT
GGTAATCACC AAGAATTGGA TCAACTTTTG CAAGTCTACA ATGACGAAGC GCTTGGCTGG
GAAGAGAGAT GGAGAAATTA CTATAGTTGG TAAAATCCGA TGTTGGAAGT CGATTGTACT
GATGGCAATG CGCACAAGTC CAACAAGGAG TTTCCTTGGA TAATATTTTG GTAACGGATC
GTAAGTACTT TGATATCTCT GAGCTTTGAT TGACGTTCTG AACGATGCAT ATCCCTCTAG
TGACAACGCA GAGGTGTTTC GGCTCGTGAG TCGTTTTTCT CAGCGAGTTT ACACCAACAG
AGTTGGGATT GATCGCAGAA ACTTGGCCTT GTCTTATAGT GTTCTGGCAA ACTCGTGTCT
GTTGCGAGAT ATTCGAAGTC CACATGACAT CGAAAGACTC TCCCCTCATC GGAGACGTTT
GCTTTTAGCA AGTCTTGACG ATGCACGATT GATAGCCAGC AGGATCATTA CGACAGAAGT
GAGTAGTCTT AGCCGTTGTA TCCCATTGGG GCTGTAGGCG CACATACAGT TGATTATTCG
CTGACTTGCG AGATTTAATC AGAAAGACAA ACTTTTGCAC GCTAATCATT ATTCACATGT
TGCCCTGGCT AGCGTAACGA GGATCTATAT ACGTCTAGCT ACCCTTTTCC CAGGATCAGT
CGATCTAAGG AAAGTAGCAA AAGACCTGTC TCAATTGACC GAGGTACTGG CACGATGTAA
GTCTTTTGAT TACTCTTCCA GTCGTCATGA TTGGAAAGCT TTGGTCTGGC GCTGCGCCGT
TTGCTGAATT TTTTTTAGTC CCGGGTTTCC ACTTTGCTCA ACAACTGCGC TATGCCATCA
GCAAAGCGAG GCAAAAGAAG ATCTTACCTC CGGAAACGAG ACCGACTTCA CCAAGATCAC
TTTTCAACCA AAGTGTCGAG TGTACACGTC AAAATCAACC GAAACCTAGA TCAAGCTCTG
TGCCTTCAAT GTCGTGGTTC AATCACTTCC CGCCTTCAGA ACCCCCACCA CACCAGCACC
ACGGCGAGAG CGCGCCCGAG CGACCAGGTA GCACCGGTAA TACGATGACC AGTGGCAGTC
CGGTGGAATT CGATCCATTC CTAGCTGAAC ATGTGTTGAA CGAAACTTTC AATCAAGTTC
GGCCTCTTGA TGGTATTTCG GAGTGGAATC CTCAACCTGG TGAAACAGGG AATTGGAATC
AATCCTCGCA GGGAGGAAAC CAGGCTGAAT GGTTACCGAG TCAACCTTCC ATTGGGACTG
CCATACCAGA TGCCCACAAT AGTTTCTTGT CATGGCTGGA ATTCCCTGTT CTTGGTAAGT
AGCGTCTTGC GTCCACAAGG GTTATTTAAA TCACGATGAA GCTGACACGA GAACCATACC
TTAGATTTCA GTGATTTTGG TACCCAGTAA ATATATCATG TTATATTTTA
 
Protein sequence
MSHPENNGVG SSGLPNGADP SRPAPKRKSR STRGLLNKVV ALEDQMQHVQ SNLENVVSLL 
LRLAPTALPA GSQPSTNSIR HDIAQPGSTG SHNRVQTPLS GFCEKYEQRP VSSLEQWVQQ
IAGDVASTTA DLQQQSVPRD RTAGGSVAEG SEMSSEDERD HTVEEETKEG LSFMRDMLRS
EEKKRLHADG HWDSPGEPGL SSHRDQEHIA ADDRGHRAEP DGRLRNARLK RKRGANDWEP
PEWLPAGMRD PIDLGFCSEA QGRNLFDMFF QGAHSFMPVF DPSRDSWESL RHRSPFCISA
ILFVGQKIKD AGHEPSNLQR LLREHAESIG KTTLFCPIAD TEALQAMIIL ASWGDTGWRP
GSHAVIGIDR RNLALSYSVL ANSCLLRDIR SPHDIERLSP HRRRLLLASL DDARLIASRI
ITTEKDKLLH ANHYSHVALA SVTRIYIRLA TLFPGSVDLR KVAKDLSQLT EVLARFPGFH
FAQQLRYAIS KARQKKILPP ETRPTSPRSL FNQSVECTRQ NQPKPRSSSV PSMSWFNHFP
PSEPPPHQHH GESAPERPGS TGNTMTSGSP VEFDPFLAEH VLNETFNQVR PLDGISEWNP
QPGETGNWNQ SSQGGNQAEW LPSQPSIGTA IPDAHNSFLS WLEFPVLDFS DFGTQ