Gene CNH02340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH02340 
Symbol 
ID3259322 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp473075 
End bp476060 
Gene Length2986 bp 
Protein Length809 aa 
Translation table 
GC content58% 
IMG OID638258251 
Producthypothetical protein 
Protein accessionXP_572409 
Protein GI58270506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.501661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCTC CAGCTCCCCA AGGCGACCAG CGGGATCTCC CCCCTCCATG GTAAGTTGTC 
CTCCAACAGC GCAGCTCACT CCCCAAGGAT CAGACAGTTC GATCCCAGCT ACCAGACATA
CTTTTACGTC AACCCGACCA CAAACCCGCC CACCACTTCC TGGACCCACC CCGGCCTTGC
AGAGGGACAA GTCCATCCGG AGCAGGCACA GGCCATCCAC GAGGCTGGGC AGACAGGGGG
CGACAACCAG GGAGAGGCTG CAAAGTTCCT GAACTCGGGA AGCGTGGCCG ATCCTGCGAC
CGGGAGCTAC AACCAGCCAG GCGAGATCCA GGGTGCTGGC GGGCAGACTC CCGAGGCTGG
CGAGCGGGGA TTGGGCAGCA TGGTGAGCGG CCTGATGGGC AAGACGAACA ACAACAACGT
ATGTGTCTAC GATCCATGGA CGGGTGCGCA GAGCTCATGT ACGTCTCCGC AGCAATATGG
CTACAACCAG CAGCAGTACC CGCAGCAGTA CCCGCAACAG CAGCAGCAGC AATCTGGCGG
TAGCAAGTTT GGGTTCGGGA CGGGGATGGT CGCGGGCGGC GCTGCGTTGC TCGCCGGAAA
GCTGATCTCC AACGTCGTCG GCGGGGTAAG TCGGGCGGTG TGATTGACGC TGGAGTCGGA
GCTGACGAAT GCGACGTTGA CGTGCAGCGC CACAACTCAT CGGGCGGCGG CATGTTCGGC
GGGGGAGGAG GCTACAGCCA CAACATGGGT CCCCCTCCCT TCATGGGCGG TGGGCACCAT
GGCGGGCACG GCGGGCACCA CGGCGGCGGC GGGATGTTTG GCGGCGGCGG CGGCGGCCCC
GGTGGCTTTG GCGGTGGTCC CGGGCGATGG TGAGCTGTAG CATAGTATGG TAGATGCATA
TGCATGTAGC GAATTCGTAG ATACCTGGCG CATGTATCCC GTCGTCGCTC CCGCTGTCTC
CGACGCCATC ATCCACGCCG CCGTCGACGT CTGTTCCGCG TTCCGCATCA TCTCTGTTTC
CATCTACTCT TTATACTGCC ATAGACTCAT TCATTCATTT GTCCATACAT TCACTCCCAC
TCAGCAGCCA CCACCCACCC TCGGCCGCCA TGACAGAATC CGCAGGCCAC TCCAGTCCAC
CCATGTCCCC CGGCACCCAG CCTGCCGCCC AGCCGCCCGT CACCCTCAAA CTCAAACACT
CGATGGCCGA CCGCGCCAAC TACTCTTCGT CTGAAGAGGA AGAGGAAGGA GAAGAAGAAC
AACTGGCAGA AGACAGACTT TCTACATCTC CACCACCTAA AAGGAAAAAG CTCTCTTCCA
CACCTACCAA TTCATCCGCA GCGTCAAAGG GAAAACAGAG CATAAAACTT ACTCTTGGAC
CTCAACATGC CCATCTACAA CAGCCTTCCT CGTCCTCATC GGCTTCTGCG GGCTCTGCGG
CGAATCAGGG CAAGAAAAGT TATGACTGGC TACAGCCCTC TGCAGCTGGT GCTTCGCACA
GCGGACCTCC CGAACGTGAG CGTGAACGCG AGCGGAGCGC ACTCTCACCT GGTGACCTCC
CCAGCGCACT CTCACCGGGA GACCTTCCCA TCCCATCTGT CTCTGTCGCC TCTGCCGCCT
CGGGCTCGTC GACGAAAAGC CTAGGCATGT CGCCCGCTGA AGAAGCGATT GGTGGTCTCC
TCGATGAGTC TGTTGATGAC AACACCAGCA CCAACGACAA CGGTGATCCG TCTACTCTGA
AAAAGGAAAA TCAAAGTGCA CCGAAAGCGA AGCGGAGCCA TCATAAGAAG AAGGCATCTG
ATGCACCCCC CGGGCCTGGA AGGAATTGGA AGAAGGGCAT GAAAAAGTGG GTCTATACAT
CATCCCCCCT CTGTTTCTCC CTCCCCAATC CCCTCGGTCT GCCATCGTGC AAAGGAAAAT
ATTCAAGAGC TGACACCATT ATCCTGTTGG TATAGGGCCG CACCAGGAGC ACCAGGGGTC
AAGCTTGAGA ATGAGGGCAC CCCGGCAAGT ACGCCTGCGT TTTCAGCCAT CAGCCGTGAA
ACCTCGCCGG ATCCTCTCGG TGAGTTTTCT CCTCTTCAAA ACGCACGCCA AAAGTTGCAT
TTTACTAAAC GAAAATGTCG CAAAAGGCCT TCCCTCCCCA CGCCTCGCCC CCGACACCCA
AAGTATGACT ATAACTCCGG CTCCCGTTCC ACTCCCTTCA GCTTCCTGCC CCCCGTCCCC
GCCCTTCATC CCGGCCGACC CCACCACCCT CGGCTTCCCC GTTTTCTCCC ACCCCATCGT
CCCTCCCAAA ATCCATCTCG GCACATTCCC AAAAGTCACT TCCTTTTTCG CACCCATCAA
CGGAGGCGAT TCCGGGCCCT TTCCGAGAAA AGAAAAAGTT AGGAGCTGGA CGTTTCAGGA
AAAGGGGATT GTAGGTGTTG GCGGGGGTGT GATGAAGTAT AAATCTTGGG CAAGAGGTCT
GTTTCCTCAT TTTATCTTCC GCTGTCGATT GACCACACGT TTACTAATTT GACCATATGC
AGGCCCAACA TCTGAACTCG AACGAGCACT TCAAGAAGAA AAAGACGCGC AGACGCCACA
ACGGCAACCG AAAGCAGCCA AAGGCACCAA CGCAACATCC ACCCCTGCAC CTCAGACCGG
TGCTGACCCC ACTGCATCTG CATCTGCATC CTCCACCCCC GCCCCTCCCA ACGCTGCAAA
CGCAGCAGAT ATCACCACCG TCAATGACGA CCGCCCGAAC GTGAGTAGGG CAGACTCGTT
TGATATGAGT ATGAATGCCA GTCCTGGTCC TCCGGGCGAT GATGAGAGTG AGAATGGAAG
CGAGATTGCG GGTCCGACGA GTACACCCCC TGCAGGTGGG AAGAAGAAGA TGGGAAGCGC
TCCTGCGAAG AAGAAGGGGA AGACGCCAAA GTCGAAACTG GCACAGGAAA TTGTCATCAG
GGAGGATAAT GAAGGCGCAC CGGTTGAACA GGCGATTGCA GAGTAA
 
Protein sequence
MASPAPQGDQ RDLPPPWIRQ FDPSYQTYFY VNPTTNPPTT SWTHPGLAEG QVHPEQAQAI 
HEAGQTGGDN QGEAAKFLNS GSVADPATGS YNQPGEIQGA GGQTPEAGER GLGSMVSGLM
GKTNNNNVCV YDPWTGAQSS CTSPQQYGYN QQQYPQQYPQ QQQQQSGGSK FGFGTGMVAG
GAALLAGKLI SNVVGGRHNS SGGGMFGGGG GYSHNMGPPP FMGGGHHGGH GGHHGGGGMF
GGGGGGPGGF GGGPGRCSHH PPSAAMTESA GHSSPPMSPG TQPAAQPPVT LKLKHSMADR
ANYSSSEEEE EGEEEQLAED RLSTSPPPKR KKLSSTPTNS SAASKGKQSI KLTLGPQHAH
LQQPSSSSSA SAGSAANQGK KSYDWLQPSA AGASHSGPPE RERERERSAL SPGDLPSALS
PGDLPIPSVS VASAASGSST KSLGMSPAEE AIGGLLDESV DDNTSTNDNG DPSTLKKENQ
SAPKAKRSHH KKKASDAPPG PGRNWKKGMK KAAPGAPGVK LENEGTPAST PAFSAISRET
SPDPLGLPSP RLAPDTQSMT ITPAPVPLPS ASCPPSPPFI PADPTTLGFP VFSHPIVPPK
IHLGTFPKVT SFFAPINGGD SGPFPRKEKV RSWTFQEKGI VGVGGGVMKY KSWARGPTSE
LERALQEEKD AQTPQRQPKA AKGTNATSTP APQTGADPTA SASASSTPAP PNAANAADIT
TVNDDRPNVS RADSFDMSMN ASPGPPGDDE SENGSEIAGP TSTPPAGGKK KMGSAPAKKK
GKTPKSKLAQ EIVIREDNEG APVEQAIAE