Gene CNB03380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB03380 
Symbol 
ID3256057 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1017221 
End bp1019091 
Gene Length1871 bp 
Protein Length408 aa 
Translation table 
GC content45% 
IMG OID638254982 
Productexpressed protein 
Protein accessionXP_569023 
Protein GI58263226 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.203018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTATTATTA CTGAAACATA TTGACCATCT GATCAGCTAA ATGTGAAATA TGGAGCAGGA 
GGGGAAAAAA GACAAGGCTG AAGAATATGA TGGTTAGTTG TGTTCGTAAT GGTTATTTAC
TACTCATTCG GATTTTGTCT GAAGTATGCG TGGAATCTAG GCACTCAAAT CTCAAACAAA
GATCGACGAT GATTTAACGT TCTGTGTTGA TCCGATGTTT ATCGGGCCAA GAAGATTGCG
GGACTTCACC TGCTTTGAAG ATCTAGAGCT CTGTGTTTAG CGTGCGTGCT CTTATAAACC
TTCTCGCATA GCCCAGCACC CATCAATTTC CAAGTTTAAA CAAACACAAC GGCACTTCCT
ATTTGTCTTT TAAAACATAT TCTGTCCTAT CGTCAGTGCA CGTTATCCCA CAGCCACTAT
GCCGGATCCT AATTGTCACT CTCCCAATGA CACAACACCT CTCCCGACTT CTTCAGTATC
GTTCAGCGAT ACGAGTGATT ATTCCCCAAC GAGCTCTGTT ACTTCATCCG AAATGAGGCT
GTCAGATCAG TTACCTTCTC CTAAATTTGC CACGTCTCGC GGCTTCCCAT TACTACCACC
AAGTCACCCC TTGCGCGCTC AACTATGTAC CCAGGTTGAA GGAGAAATGA GCCCACCTAG
AACACCTAAG GGACATCCTA TAGATCAACT GCCAAATTCT TACTTTCCGT TCCCCCCCTC
AAAAGCAATA AAAGTAAGTT CCCTTTAGTT TCTTTGTTTG AATCACCTTT CCACAATCAA
CGGGGAGACG TCATGCTAAT ATAGATTTAC TATTAGCACA GGAGTCAAGG TATCGCCAAT
CCATATGGTA GCGATAACCC TTCGTATCCG GTTGGTATCA CCTATTTGCT TCGTATACCT
CGAAGGCTCC GTCCATACCT CTTGGTCTCC TTCTGCCTCT TCATTTTCGG ATTCATCCTC
GTGAACAGAG CAGTGTCAGT TTCTCAGGGC ACACGCGCTG TTACTATGCA AAAGCATTTC
AGCCACTCAA CGCACAAATA CATACCGATT GACAGGCTAT ATGGAACCAA TGACGGAAGT
TCATACCTTT TATCAAGTCA ACGGGCAACA GAAGCAGGTA TCGCGGAGGT GAATGCGCGA
GCTAGAAAAA AGGATGCCGA GCTTTTCAGG TTTGAGTCAA TGGAAGAAGA ACTTGCTGCT
TTGATTTCAG TGAGCTTAAA GCCATGAGTC GCCCGATGGG AATAAAAACT TATGTCGGCC
GATAGTTTGT CACTTCTACT ACCTCAAATG TCATCCCTCA ACTTGATCCA TCGGTCCCAT
TAGACCCCTC TGTTATTCTC GACTTCGACC CTTCACACCC AAATGCCCGA GACGACCTTC
TTCTTTTGCA GGCGGAAATT AATGCTGTGT ATCCTCTGGT TCTTTTTGGG AGGATGCGCG
ACCCTCATCA CAGGGCGATC AAGCGCTTAT TATCGGAAGT CAAAATAACA CCCGCCCCTC
TTGTTATCGA AGTCGATCAG CGCAAGGATC GCAAAGTCTT CATACCAACT GTGGCAAGAC
TACTGGGGGA TGAACTTCCC GTCATAACCC TACAAGGCAA GAGGCTGGGG GGTTATAAGG
AGATAATGGC AATGCACGAG GCGGGTACCT TGAATGACCG TCTCCAGAAA GATGGAGCAG
TACTGGTGAG AGAGATAAAG AAAAAGAAGA AGGGAGCTAA GGAACAAGAG AGAATGGAAA
ACGAAAGAGT CTTGGGACCG GCGCCAGTTG TAGATGATGA ATAAGTTAAG GATTAAGAGA
TATTGCACGA CTGACAACGA GTGTTTCTGG TCATCTTATG TGTGTAATTA TATATATTTT
TTATATATTG C
 
Protein sequence
MPDPNCHSPN DTTPLPTSSV SFSDTSDYSP TSSVTSSEMR LSDQLPSPKF ATSRGFPLLP 
PSHPLRAQLC TQVEGEMSPP RTPKGHPIDQ LPNSYFPFPP SKAIKHRSQG IANPYGSDNP
SYPVGITYLL RIPRRLRPYL LVSFCLFIFG FILVNRAVSV SQGTRAVTMQ KHFSHSTHKY
IPIDRLYGTN DGSSYLLSSQ RATEAGIAEV NARARKKDAE LFRFESMEEE LAALISFVTS
TTSNVIPQLD PSVPLDPSVI LDFDPSHPNA RDDLLLLQAE INAVYPLVLF GRMRDPHHRA
IKRLLSEVKI TPAPLVIEVD QRKDRKVFIP TVARLLGDEL PVITLQGKRL GGYKEIMAMH
EAGTLNDRLQ KDGAVLVREI KKKKKGAKEQ ERMENERVLG PAPVVDDE