Gene CNB03820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB03820 
Symbol 
ID3256077 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1134122 
End bp1137296 
Gene Length3175 bp 
Protein Length737 aa 
Translation table 
GC content49% 
IMG OID638255029 
Productexpressed protein 
Protein accessionXP_569028 
Protein GI58263236 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAAGTCCT AGTTTTGACA CATACAAGTC GCCAAAGGAC GCGTGGCAGA AGGAAGAAGA 
GTGTGGCGCG GTCAACAACA GATCAGAGGT GAGCATCGTC TTAGAATTTA CGACACGCCG
TTCTTAGATG TTACGTTTTC TTTTTTTACT AGCCAATCTT CTTCATCCTC ATCTGTTCCA
TGGCATTCCG GTAGAGGCGG CTTCCTACCA TAGCACAAAG CAGGCTCTCA TATAGGGTGC
GGACGCCTGA AGGTCACGGC GAGGAAGTCG CAGCAAAGTT GGTTGATTCC TCAAAAGAAA
AGCATGATCA AGAAGGAAAG CCTGACGTGA TTAGTACAGA GCAAAAAAGC ACCCAATACT
GCAAAGAATC CAATAATCCC GAAAGTCCTT TTGTCTGTCC CGATTATGCT TCTGGAAAGC
GATTACTCAG AGATGGGAAG GGAAGGACAC CGTTCTCCCG AAAAACATCG ATCTGAAGAC
AGGCGTCCCT CGGGCGAGGA GCATCTAGTC TCTAGGTATA ATAATCCTCC TATTTCTGAA
CAAACAAGGA ATTCCCCTTC CCGAGAAGCC AGTACTACTG GCCACGAAGC CCAACCTCTG
GCCCCTTCAT CAGCTGAGCA AAATCTGCCC ACCCATTATG TTGTGCGCCC CATACAGACA
CCCGACGCTC ATCAGCAATC TGGTTCTTCT GATAGAATTG AAACGGCAAC AGAAGTGAGA
CAATCCGAAC CCTCCGCTCA TCAGCAGCCC ATAGAGGGAT TGCTTTTACT TCGTCAACAA
CTTCCTATAA GCCCTGCCAG GAAACGATCC AGAAGCCCTC CCCCTTTTGG CCGTATTGAT
AAATTCGGGC GATCGCGCAG CAGCTCTCCA AGTGGATCAA AATCCGGTGA AGATAAGGCA
AGGAGTCGTC CCAATACAGG TGATGAATTG AAAAGCAGTA TGCGCGGGAG GGAAATGCAG
TCTAGATCAC CACACAATGT AACAGACTTT TTGACTGAAC CTCCTTCTTC AGCACTTCGC
TCTACTGTTC ACAGCTCATT ATCTTTTTTC GAGAGAGAGC TTGTCGACGT GAAATCACCA
TGTTTACCAG CCATATCTGA CTGGAAGCCG ACGGAGACAC CTCCAAGTGG GCTAATAAAG
CCCACGCTTT CTTCTTTGGT GTCCGATTTT CGTAGTCTGT AAGTGACGAA TATTGGACGT
ATTTGATTGT TAAGGATGAC TTTCTAACAT CACATGGTGT CAGCCCACCA CCTTTTCCCA
GCGCCCCTTT ATCATCAGCC CGTTCATTCT TTGGTCAGTC ACGAGCCGGT CCCTCCTCTC
AATCACAAGA ATCGTCTAAT CCACGGCGAA GGAGTGAAGA CGTTCATCAC CATCTTCCTA
GCCTTGAACT GCCCTTGCAA CCTCTTGTTG AAGAGCAGAC GCTTCAACCG CCTAGACCTC
CGCTCGATAG ACAGCAACAT CTTGATATAC CAAGCCCTTC TCTATCTGAT CCTGCAAAGG
GTCCTGGAGA ACAGCCTCCC TCTGCCGTCA AAGGGAGTCG TGCAGGTAGG CCACATGATA
GTTTCAATAC TATCGGGAGT AACGATGGGC TGACCGTGTA CCGACAGAAA GGACATCTTC
TACCAAAGGG ATGAGAGAAA AGGGTGGTAG ATTTGAACAA GGTGGGAGTG GCTCAGGGAA
GAAGAAGGCG AAGCATCAAT CGGCTGAGTT GCTTTCTCCA AAGGGTAGTG ATACTAAAAA
TGGTGAGCGG CTTGTCTTTG AGAAAGTCAG ACGAATCAGA GGATGACTGA TGAAACTGGA
TTGTAAAGTG GCAGATGCTC CAGGGCCGAG TGTTAATGGT AAGTTGATGA ATGAGCGCCA
GGTCACGCCT TGTGGTTCAC AATGTCCGTA CTGATGAACC TCTTTTAGCG AGGCTTTCCA
CCGAGAGCAC CGGTCATGAA AGTGCATCGG GAAGGGCCTT CGATATACAT TGGACGCCTG
ATAACGTGAA AGCATATTGG ATGGGTTATG ATTCTGCTAT GCGGGACGTT AAATTTGGGA
GGGGTGATGT TAGAGTAAAT ATGCCTAAAG AAGGTATGAC CTGGAAAAAA AAACATCAAC
ATGAAATGAC TGATGTGAAA TAGGACGTGC AGTGGCCATG GAGGTCTCAG TTGATGAACA
GCATCAGCAG CATTCCAAGA CGAGGGCTGG AGCTACAGCT GCAGAATCAC CTCAGAATCT
TGGCTTCAGA GCTGCTTACA GACCTCCTCC TTCGCCTCAA ACTGTCACAC AACCACAATT
GCAGACTCAG AGGTTGGCTC CGCGAATGAG ACCACATGGT GAAAACCAAC CTGAACCACT
TTCGCAAGGC ACTACAATTT CCCCTAGTAT GAGAACTCTG AGTCACCCGA TTTTTGCGCC
CTTTGGTTGG GAACCTTCCA ATCCTGTTGT CCATCCGCAG CAGAGGAGGT CAACGTATCC
GGTACACATG CCTAATTTCG TTGCCCCTAT TCATCCTGCT TACCACCAGA TTGTTGCGGA
CCCGCCAAAA AAGCAGGTGC AGGTACCTAT TGGAACACGA TTCCATCCGC CACAAATGCA
TCCTCCTCCA ATTTATTCGC TCTCCTCATC GGAATCTTCC ACTAATTATC GGGAAGGGTC
TGGTCTCGAG CCTGCTGGCT CTCATCGTCA GAGGAAGCGA CAGCTGATCT CTTGCTATCC
TTGCAGAAAG CGCAAGCTTC GATGTGACGG CCGGCGACCA GTCTGTGAAC AATGTGAGAG
AAGGAAAGTC GCCGACCAGT GTGGATATGC TGAAAGTATT AAACGACGGA GGAGGACCAA
GAATGCTGAG GACGATGATA TCGAGATGAG AGATGAAGGG GATGACGAGA TCGAAGAAGG
AAAGGAGGAG GAGATACAAG CCGGGCCTAG CAGAAGGGAG AACTTGGATC GCGAAGAGAG
GGACCAAGTG TAAGGATATA GAGGAAGGGG GATGGAGGAA GACAAAGAGG AAGACGAGAT
GCAGACGACG AAATAGTTCT TACGAAGACA GCGACACCGC TACACCAAGC CCTTGAAGGA
TATGAGTCCT ACTATCCCAG GAATGACCCA TTGTCATCTG ATCTGAGACC TATAAGGATT
ATCGACCTCG TGCTTGTTAA ACTGGAGTTT CCTCAGCACC GGCGTATTAA CAGAC
 
Protein sequence
MLLESDYSEM GREGHRSPEK HRSEDRRPSG EEHLVSRYNN PPISEQTRNS PSREASTTGH 
EAQPLAPSSA EQNLPTHYVV RPIQTPDAHQ QSGSSDRIET ATEVRQSEPS AHQQPIEGLL
LLRQQLPISP ARKRSRSPPP FGRIDKFGRS RSSSPSGSKS GEDKARSRPN TGDELKSSMR
GREMQSRSPH NVTDFLTEPP SSALRSTVHS SLSFFERELV DVKSPCLPAI SDWKPTETPP
SGLIKPTLSS LVSDFRSLPP PFPSAPLSSA RSFFGQSRAG PSSQSQESSN PRRRSEDVHH
HLPSLELPLQ PLVEEQTLQP PRPPLDRQQH LDIPSPSLSD PAKGPGEQPP SAVKGSRAGM
REKGGRFEQG GSGSGKKKAK HQSAELLSPK GSDTKNVADA PGPSVNARLS TESTGHESAS
GRAFDIHWTP DNVKAYWMGY DSAMRDVKFG RGDVRVNMPK EGRAVAMEVS VDEQHQQHSK
TRAGATAAES PQNLGFRAAY RPPPSPQTVT QPQLQTQRLA PRMRPHGENQ PEPLSQGTTI
SPSMRTLSHP IFAPFGWEPS NPVVHPQQRR STYPVHMPNF VAPIHPAYHQ IVADPPKKQV
QVPIGTRFHP PQMHPPPIYS LSSSESSTNY REGSGLEPAG SHRQRKRQLI SCYPCRKRKL
RCDGRRPVCE QCERRKVADQ CGYAESIKRR RRTKNAEDDD IEMRDEGDDE IEEGKEEEIQ
AGPSRRENLD REERDQV