Gene CNH02700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH02700 
Symbol 
ID3258982 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp362341 
End bp365431 
Gene Length3091 bp 
Protein Length796 aa 
Translation table 
GC content48% 
IMG OID638258215 
Productconserved hypothetical protein 
Protein accessionXP_572443 
Protein GI58270574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.738564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATACCGTCCA TGTCCACAAC AATACAGCAA GGCGACACAG ACACAGGTGC AGGGAACCAT 
CTCCCAGCAT GGCTCCTCGC CATGTCTGGA ATATTCACAG CAGTTGGTCC GTAACTTCCT
CCTCGTCGTT CCCAGCAGGA CATCTTGTCA CTCACCCGCA ACGTCGCTGT AGCTACTGCG
ATATCTATGA TGTCCATCGT TCTCCAATTG AAGAATTATC GAAAGCCGAC ATTGCAGAGG
GCAGTAGTGA GAATCATGGT GATGTTCGTC AGACCCAAAA TATTCATGAA GTTCGCTAAC
AAATTCCGTC TGTCACAGGG TTCCCCTCTA TGCCATATCT TCTCTCATAG CTTTGTTCTC
ACTTGAGGCT GCATTCTTCA TTGACGCCAT TCGTGATCTA TACGAGGTAT ATGCTTTCTC
TATCTGTGGC GACTAGAGAT AGCTTACACA GCTTTAGGCG TTTGTGATTT ACACCTTTCT
TCAACTCCTC ATTACGTATC TTGGTGGTGA ACGCTCTCTC CTTATAATAC TCCATGGACG
CCCACCAATA CCTCATCCAT TTCCTGTAAA CATCTTTCTT CAGCCAATGG ATGTTAGTGA
TCCATGGGTA CTTTTGAACC TGAAACGGGG TGTGCTTCGT AAGTCATATT CATTGTCACA
ATTACAGTCG CAATCAATCG CCCATCGATT TAGGCTGGAC TAGCAAACTA AACAACAAGC
TTATACATAT AACGCTTTGT AGAATATGTG CAAGTGAAGC CTTTGTTAGT GCTTGCCACT
GTAGCTCTCA AAGCCACTGG AACCTACCAA GAAGGCAGAT TCGCCGCTGA TTCAGGATAC
ACGTATGTCA GTATTGCATA CAATACCAGT ATCTGTCTGA GCCTCTAGTG AGTTTTTCCT
TTTAAGCCTA ATGCTCCAAA AGCCAAAATT AGGGTGGATG GCGGGGGGAT TAGAAATCAG
TTAGGCTGAC CACAGTGGCC GGCACAATAG TTGTTTAGCT ATGTTCTGGG TTGCGGTAAA
CAAAGACTTG AAACCCTTCA GACCAGTTCG TAAGTAGTGT TTATTTCCCC CCAAGGCTTC
ATGACCCTTA GAATTAAGTC TCTTCTGTAG CCAAATTCCT CTGTGTCAAA GGAATTCTGT
TCTTTTCTTT TTGGCAAAGT ATCGGTATAT CCTTACTCGT AGCCATGGGC GCCATCAGAA
AAGGTTAGTC CCAGGCTAAG ATATGAGCTC AATGATCGGC CAAGCTAATT TTGATGAGGA
TAGTTGGGCC GTATACGGAT CCTGAACACA TGTCTCTTGC TCTTGTGGAC TCTTTGATCT
GTTTCGAGAT GCCAATCTTC GCTATCGCTC ATGTAAGTCC TTTCAAGCCA CCATTCCCTT
TTTATTCCCT CCCTAATCTT GATGTTCAAA ACGCATCCAG CAATACGCAT TCCAAGCCAG
CGATTATATT GACCACAACC TCGTCTATGC GGCCCGTCTT CCATTCATCT ATGCTTTCCG
CGATGCCTTT GGGTTCAAAG ACGTCTGGCA AGACACGATC GACACATTCA AAGGCCGCGG
CGTATCATAC CAAGCCTACG AACCTGCTGA GGGGGGTCTG CATTATGGCG TTGGTCGACA
AAAGAGGATA AGAGCGGGAT TGAGGTACGC GAAAGGCGGG AAGATGAAGT ATTGGATGCC
AAAGCCAGGG GATGAGGCGA GGATGAAGGG ACAGAGTGGG CTGATTACGA GTATGAAACG
GAGAGTGGAT GAGAGGTTGG CAGAGAGAGA AGGGTATGCA CCATTACTCC CTCAACAGGC
GGCGAGGGTT GTGCATCTTG ATCCCGGGCG CTATACCACC TATACAGGGG GGGAACGGAT
GTTCGACAGT GATTCTTCGG ACGATTCGGA TGCCCCCAGC CTCACTTTCC ACTCTGCAGA
CGAGATGGAA GATGCAATGT ACGACCGAGC AAGGAGGATA GGGTATGCGG GGTTCCCGAA
TGTGGATGTG AGTAAAGAGG AAGCGAGAAG GAAAAGGAGG GAGGAGGAGG AAGGCATTTT
GAAGGGGAGA TGGACGAGGA GTGGGTTGAG GGTAAGAGAT GGGACTGGAC TTGGCGGTGG
GGAGGGGCAG GGGGAAGGGA GAATGAGGGC GGGTTCAGCA GCCAAGGGTA AGAGTAATGG
TAAAAATAAA GATAAAGGGA AGGGGAAAGG GCAAAGCAGG AAAGTGTACG GAACATGTAA
GTCTGACATT ATTGTCATTT GCTGGGTCGA AAAGGGTTGA CAAAGGCGGT ACAGGGGCAG
ATCACCCGCC ATCCGTACAA CGCTTCAACT CTTCATCATC ATCAACTCGC AACGGAAACG
GAAATGGACA TCCGGCAGAC TTTGATGAGG CTGATGATGA AGGCGTCGTG CGGCAGGAGG
AGGAGGGAAG GGCTATTAAA AAAAAGAGAC AGAGACAAAG TCCACCTGTT TGGGTTACAC
GCACGGGCGC GACAACGGGT GTAGGCTCCT CTTCGTCCCC TTTTTCCATT GGGGATGATG
ATGATGATGA CGATGAGGAC GGAGAGGGTG TCAAGAACAG AGAGACTCAA GGCGACAGAT
CTATGAGCTT CAATCACCCT GATAACTCCA GCCACAACTC TGACTCAGAT TCTGAACATT
CAGTTAAGCC GTCAACCACA AAAAAATTAC CCGCCGACGC GGTCGATCTG GTCAAGGAAG
ATTACGAAGC CGTTGAAGCC GCTCGGGAAC GCGAACGTAG ACGAGGAGAA CCGCAGACGA
ATGCCCCTGC TCATGTGTAT CGCAAAACCA TCATTGACGA GATACCAGAG ACGGATACGA
ATGCAGGTGC AATGAAGAGG AGGAGAGGAA TAGGAGGAAG AATTGAGGAA ATACAGGAAG
TGTATGCGCA TGATCCTCAT GTGATGACAA AGGATGAGAT ACAGACAGGC GTTAAGGAAG
TGGTGGATCA CGTGGAGACA AGTTTGACTG TCGAGCCTCC AAAGCACGCG ATGAGTCTGG
ATTTAGAAGA TAACCCGTGG TCGTGAGGTT GCTTGGGCTT GTGTGATGGG GTGAATAGAC
AAGATCCTTA GATGTCCATT TATTGTTTCT T
 
Protein sequence
MSTTIQQGDT DTGAGNHLPA WLLAMSGIFT AVATAISMMS IVLQLKNYRK PTLQRAVVRI 
MVMVPLYAIS SLIALFSLEA AFFIDAIRDL YEAFVIYTFL QLLITYLGGE RSLLIILHGR
PPIPHPFPVN IFLQPMDVSD PWVLLNLKRG VLQYVQVKPL LVLATVALKA TGTYQEGRFA
ADSGYTYVSI AYNTSICLSL YCLAMFWVAV NKDLKPFRPV PKFLCVKGIL FFSFWQSIGI
SLLVAMGAIR KVGPYTDPEH MSLALVDSLI CFEMPIFAIA HQYAFQASDY IDHNLVYAAR
LPFIYAFRDA FGFKDVWQDT IDTFKGRGVS YQAYEPAEGG LHYGVGRQKR IRAGLRYAKG
GKMKYWMPKP GDEARMKGQS GLITSMKRRV DERLAEREGY APLLPQQAAR VVHLDPGRYT
TYTGGERMFD SDSSDDSDAP SLTFHSADEM EDAMYDRARR IGYAGFPNVD VSKEEARRKR
REEEEGILKG RWTRSGLRVR DGTGLGGGEG QGEGRMRAGS AAKGKSNGKN KDKGKGKGQS
RKVYGTWADH PPSVQRFNSS SSSTRNGNGN GHPADFDEAD DEGVVRQEEE GRAIKKKRQR
QSPPVWVTRT GATTGVGSSS SPFSIGDDDD DDDEDGEGVK NRETQGDRSM SFNHPDNSSH
NSDSDSEHSV KPSTTKKLPA DAVDLVKEDY EAVEAARERE RRRGEPQTNA PAHVYRKTII
DEIPETDTNA GAMKRRRGIG GRIEEIQEVY AHDPHVMTKD EIQTGVKEVV DHVETSLTVE
PPKHAMSLDL EDNPWS