Gene CNB00290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB00290 
Symbol 
ID3255986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp79756 
End bp81237 
Gene Length1482 bp 
Protein Length467 aa 
Translation table 
GC content51% 
IMG OID638254680 
Productexpressed protein 
Protein accessionXP_568770 
Protein GI58262720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000223097 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATGCG TTATCATTTG CCATCTCCTC GCTCGACCCC CCAACTTTAA TCATTCATAC 
GTACATCTTG CAAAAGTCCT CGTACGAATC AGATCACCAA TCATGACTTT AGCCAGTACC
CAGGTCGGCC AGACCCCCGA TATTCTGCAA CTTATCTGCT CTTATTTGCC CCCCTCGACA
CTGTTCTCCT GCCTCCTGAC GTCTTCCACA TTCTTCCATT CTTCCGCACC AGAGCTGTAT
CGTTCACTTC ATATCAAGCA TGCAAGAGAC TGTTTTGTGG GCGCTACAAG ACAGTCGAAC
GGCTTCTACA CGTCATCGTC CGCCTCGCCC ACCTCCTCCT CTTCTTCTTC CACAACAACT
CGACCACGAC CCATCTCCCC GTACAGCAAA GACGCTGTAC TGTCCTTTGT ACGAAAAGTA
CATATTCACA CGCACGGGAG GAATGAATGC CCCTTTGTGC TCCACTACAT TAACCCGCTA
CCGCACTTGG AATATGTTCA TTTAGCAGGG GGAGTTTGGC CTGCTGAGCT CGCAGCGGGA
GATATATGCG ATCCCGAGAC ATGCCAGTTT CTCGCCAAGG TTTGCACCCG TGCGAAGAAG
GCGATGATAA GACAAGCAAA CCTCGGCCCT CTCAAATTAT TAACCAGTTT AGAGCACGTC
ACCGTCAAGA TACGACCGTG CCAACTTCCG CTCTACATCT TCCACGATAC AGCTTACCAA
TGGTCGTACC CCGTGCCTAT ATCTAGCGCC AGGATCGTGG ATTTGGTCTT TTGGGACGAA
CGGCATTCAT TCCGTATTGA ATGGCGTAAC GTCCCAGGTG GCGGAATGAG GTCGTCTTTG
ATGGGCTTGA GGTACCAGCT ACCATCTCCA TACCTCGGAG GCGAATCAAT CATGCTCAAG
GGCTGCACAT ATTGCGACGA GCGGGGTTGT ATCAGGCATA CTCCACATGC CGCCGTGCAG
CTTCCGGCCC TGATGGCGAC ATTGGGTTCG CTTATGAATG TGAAGACGAT CAGGGTATGG
AATGTGGACC ACACTGCCCG GAGACAATGG AAGCAGGGTA TGGTCACTGT AGAAACAGTG
AAAAGACGGA TGGAAGAGGC ATGGCTGAAA GCCCGATCGG AGGATTTAGA TTGTGGGGGC
CACGACGAGC AGGAACTTGA CCAAGGTCAA CAAACAGGCG TTTCATTCCA CTCTGCCAGC
GAGTATTTTA CCTCGGCCCA ATTTGACATG GAGACCGATC AAGAAGAGGC CAAGTACTGG
CAAAACCTCG TAGAACCCAG TCAAGCGGTA CTACGTCTAC GCCGACAAGT CCAAGATATG
TTTGGGGGAG ATGACGAGGC GTTCGACCGG TACTCTGAAA GGGAGTTGGA GGGGTTAGTG
GAAAACGAGA GATGGAGGCG ATGAATTAGC GGAGCAGCAT TCTTGGCTAC GTTACTGGGC
GACTATGTAC TTCTTGAATA TTAGACTACT CAAGCACTAG GA
 
Protein sequence
MSCVIICHLL ARPPNFNHSY VHLAKVLVRI RSPIMTLAST QVGQTPDILQ LICSYLPPST 
LFSCLLTSST FFHSSAPELY RSLHIKHARD CFVGATRQSN GFYTSSSASP TSSSSSSTTT
RPRPISPYSK DAVLSFVRKV HIHTHGRNEC PFVLHYINPL PHLEYVHLAG GVWPAELAAG
DICDPETCQF LAKVCTRAKK AMIRQANLGP LKLLTSLEHV TVKIRPCQLP LYIFHDTAYQ
WSYPVPISSA RIVDLVFWDE RHSFRIEWRN VPGGGMRSSL MGLRYQLPSP YLGGESIMLK
GCTYCDERGC IRHTPHAAVQ LPALMATLGS LMNVKTIRVW NVDHTARRQW KQGMVTVETV
KRRMEEAWLK ARSEDLDCGG HDEQELDQGQ QTGVSFHSAS EYFTSAQFDM ETDQEEAKYW
QNLVEPSQAV LRLRRQVQDM FGGDDEAFDR YSERELEGLV ENERWRR