Gene CNA02820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA02820 
Symbol 
ID3253523 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp731534 
End bp733767 
Gene Length2234 bp 
Protein Length622 aa 
Translation table 
GC content53% 
IMG OID638252613 
Producthypothetical protein 
Protein accessionXP_566687 
Protein GI58258549 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0533898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCATGTCC CTCTACGGCG GCATCAAATT CTCTGCGACG AACCCGGAGG ATGAGCCAGA 
GCAGCGGGAG ACCCCCCGCC CGCCAGACGC AGACGCTGGC TCCCTCCCCC CCGCACAGTC
CAAGCAGAGA CAGCCCCCCG CATTCTCCTC CGCCCTCAAG TTCGCCCCCC GCATCAACAA
GAAGCCGCTG AAGCCGCCGG TCCCAACCGC ACACGTCTCC GCGCTCCCCC TCAATCCAAG
CTCCGCAGAT ATCGTTCGCT CTGCAGAACC AGTCTTGCAC TCTCGTTCCC CGGCAGCTCA
AGATGAAGGT GATGCGCAGC TGGTATTCGG GCCAGACGGT CTGCCTCTTG CCAAGGCGCC
GGCTATGACG ATAGGGACCA ACAAAGGAGG GTACAAGGAC AGATTGGGCA ATGATTTCTC
CGGTGAGAAA AAGAAGAAGA AGAAGAAGAA AGTGGGTATT GTCGGACTAC GTGGTAAAGC
ACCTCGCTGA TCCCTTCAAA GAGAAAAAAT CCACAACCTT TTTTCCCCAC GTTTGACCCT
GAAGAGATAT ATGATCCCAA CCGGCCCAAC GACCTCGGTG AATATCAACA GTATCGGAAA
CGTGCCAAGG AAGAGAGGAG GAGGAAGCTC ATGGAGGCTA AGAGACGAAG AGCTGAAGGC
TTGAGCAGTG ATGAGAGCAG TTATTATACA GACAGCGAAG AGGACATCGC TCCAAGAAGG
GACGGTAAGT GGATCCATCA ATTGGCCGGT CAATTACTGA CTGTATACCG TAAGCTCCCA
AGATGTTTGC TCCTCCAAAG ATGTATTCTC CTTCGGCCTC AAAGTCTACT GTTTCTGAAG
AACCCGAGAC TGCCGTTAAT CGTCTGCCCG AATCGCAACC CGCATTTGGC CGTGATCAGG
ACGATAGGCG GATGGACGTG TCACGCCTAT CGCCGTCGGG GGATGATGCT TATGCTAGAC
GGGTCGCCAT GACGCAGCAA GCTCCCTTGC ATCTACATCA TCCTTCACAA TCAGGTGACG
ATGCATATGC AAGGAGAGCA GCCATATCTC AGAACCCCCC ACCTACAACC TCATTCATCC
CGAGTTCTAC AGCCTCTTCA TCCGCTGCAC CAGATGCATA CCCCTCTTCT CATTTGACTC
AACCACCTAC AACCACCCCA CCACCTTTGC CCGTGATACC AAACGCGCAA GCCGAACTTC
CACAGGTTCA GGCAGCCGCA TCAGCTAGTC AAGATTTCCA GGCTATGTTG GAGGAGAGAA
AAAAGGCTGC CGAGGCTATT GCTGCCAAGT TCAAAGCTCT TGCCGGAGCG GCGCAACCCC
CATCATCACC TGCTCCCGCC TCGGCATCCT CCTCTGCAGC GCAACTTCAG GATGTGTGAG
TCGGAAAGGT GCAATATGGT GTCTGATCGA CCTGCTGACA TAACATAGTG GTGGTGGGAC
ATTTGCAGAG AAAAGTAGGT CATCTTTCAT TTCAGCCTCG TTCGTTCATA TTTAACTAAT
CCCCTTATTT CAAAGTGATG CGTAAATGGG GCCACGTCGA AGGCACAGGC TTAGGAGCTC
GTGGTGAGGG TATTGTCCAC GCTTTAACTA CGGAACATGT TGCTCCGGTT GCGAACCTCT
CGCAACCCCT ATCAAAGCGG GCCCTTGCCA AACAAAAAGC AGCAGCAGCT AATGCCAAAT
CTCGCAAATG GGTTCAAGCA CCTTCTGCCC GTGGCAGGAT CGTCAACGAC AATAAGGATG
AACGTGCAAA GGAAGAAAAG GAGAGAAAGG GTGAGGAAGG CAGGGTGATT TGCCTAAGGG
GCCTTGTGGG TTCGGTAGAG GAAATTGACG AGGAATTGGT AAATGAGATA GGAGAAGAGT
GTTCAAACTA CGGGATCGTA GAGAGGGTAG TGCTTCACCT GGTAGAACCA CCTCCGCCAG
AACCGGAAGA GTGTTTGCGC GTTTTTGTGG TGTTTTCCGG GATGGCGGGG GCATGGAGGG
CAATCAAGGA ACTGGATGGG AGATTTTTCG GGGGAAGAAA TATTGTGAGT ATACACTGTC
TAATGGCTTT GGGATATTAT TGAAACAAGA ATCAGAAAGC CACATATTTC GACGAGACCA
GATTCGACAA GGGCGATAGG GATGGCCCAG TGCTTTAGCT TTTTAGGAGA TGTGTCCGGT
GTATGGAATA CGATTTTACA TGTGCACGTG CTCTCAGACT GGACGTCTTA CCCACCAACA
ATTTAGGTAT ACAG
 
Protein sequence
MSLYGGIKFS ATNPEDEPEQ RETPRPPDAD AGSLPPAQSK QRQPPAFSSA LKFAPRINKK 
PLKPPVPTAH VSALPLNPSS ADIVRSAEPV LHSRSPAAQD EGDAQLVFGP DGLPLAKAPA
MTIGTNKGGY KDRLGNDFSG EKKKKKKKKR KNPQPFFPTF DPEEIYDPNR PNDLGEYQQY
RKRAKEERRR KLMEAKRRRA EGLSSDESSY YTDSEEDIAP RRDAPKMFAP PKMYSPSASK
STVSEEPETA VNRLPESQPA FGRDQDDRRM DVSRLSPSGD DAYARRVAMT QQAPLHLHHP
SQSGDDAYAR RAAISQNPPP TTSFIPSSTA SSSAAPDAYP SSHLTQPPTT TPPPLPVIPN
AQAELPQVQA AASASQDFQA MLEERKKAAE AIAAKFKALA GAAQPPSSPA PASASSSAAQ
LQDVGGGTFA EKMMRKWGHV EGTGLGARGE GIVHALTTEH VAPVANLSQP LSKRALAKQK
AAAANAKSRK WVQAPSARGR IVNDNKDERA KEEKERKGEE GRVICLRGLV GSVEEIDEEL
VNEIGEECSN YGIVERVVLH LVEPPPPEPE ECLRVFVVFS GMAGAWRAIK ELDGRFFGGR
NIKATYFDET RFDKGDRDGP VL