Gene CNB03900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB03900 
Symbol 
ID3255689 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1149043 
End bp1151638 
Gene Length2596 bp 
Protein Length609 aa 
Translation table 
GC content52% 
IMG OID638255036 
Producttranscriptional regulatory protein, putative 
Protein accessionXP_569219 
Protein GI58264126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAACA ACGCCCCCAG CAGCTTCCAG GGGTACCCCT TCCCCCCGCA GCAGTCAGAC 
CTGTACACAT TCGCCCCGGT CAGCCAGCAG CAGCAGCAGC TCTACTACGC AGCACAGCCA
GCCACCGACC AGCACGCCTC CGCAGACTCC TCCGAGAGCG CAGGGCCGTC GCAGAAGAAG
ACGCCCGCCG GGGCCAGGAA GCACGGCGTG AAGGAGGAGC CAGAGGAGCA CGGCAAGGCG
GATGAGCCAC GGAAGAAGCG CACGAAGAGG TCGGCCGGGA AGGCGTGCGT GTACTGCAGG
CGAAGGTGGG TGATGTGTCC GTGTAGGAAT GGTCGAATGC TGAATACTTG CAGCCATATG
GTGTGCGAAG GCGGCAGGCC GTGTGAGCGA TGGTGAGCAG CACGTCTGCA GCGTGGACGG
TGGCTGACGA CTATGCGCAG TATCAAGCGA GAAATCCCCC ATCTCTGCCG TGACTGCACC
CCGCCTCCCC AGAACCAGCA GTCTCCCCAT AAGCAAGAAC CTCAGGTGCG TCGTGTGTGT
CGAGTAAGAC CCAGCTGACA GGCTGGCGGT GCGAGCAGCA ACAACAACAG ACTCAGCAGC
AACAGTCTCT TAATCAGTCC CAAGTCTTGC CATCCTCAGC CCAGCCGATG CCAGTGTATA
CCGACTCCAA CTTTGTCCCT AGCTGGCCAT TACTTCCCGA TGCTGGCGCA GCTCAAATCC
CGTTCAGCGA TGCTCCTTCA GAACAGATTC AAAACCCAGG GGATGCGGGT ATCATGGGTC
CGCCGCCATT GGGATCGTCT AGAGAGGACG GGGAGCTGGC TGCTTTGAGG TGGGCTGATT
TCATGCCTAT ATTCCAAAGC ACATCTGACA TTCTAACCTG TAGCAAATTC ATGAAAGACT
TGGGCGTCCC TAATCTTCCC AATGATTTCC TCTCCTTCAT GAACCAGCTG GACAAGCCAG
ACGCTAGCAA CAGTTTATCC ATTGCGAGTT CTTCCGACGC GAATCTCTTT TCTTCGTCTG
GGGCGCCATA CGGTGTTGGG TTAAACAAGG GCAAAGGCAA ACTCAACCAA ATGTCCAGAA
TGCAAGTCTT TCTCTCCCTC TTAAAAACGA TATTCCCACT GACATAGTAA CGTCTTCTAC
AGCGACAAAT ACCTCATGGC GGCTGCTGAT CAGCCAAACG GTACCCGTGC CTCTCGTCTT
GCACAAGTCA TCAAGGCCAA GTACGATGCC GGTCTTCTCA AGCCTTACGA CTATGCCAAG
GGATACGAAC GTATGAACAA GTGGATGGAG TCTGGACGGG CGGCACCCAG GATGGACAGT
CGTGCCGGGT CCGAGATACC GGACGAAAGC CCGCAGAGGT CTACGGCCGC AATGAGGAAC
GGTAGATTGT CTGTTGCTCG TGCGTCTTTT CTTTTTTCTC CAATTTAGTC GCTTCTGATG
CTCATGATGA TGAACGAGAG CAGCTTTGGC GCCCAATACA CCTGCATTCG GGAAGAGCAT
CTCACCTGAA TCTCGCCGAC GAATCCTTGC TGCTCTTGCT GGCTTCAGAC CAAAGTTTAG
GCAAATCGCA AAGACATTGA CCAATGTCGA CTTGGTATTT GTAGAAGAAG CTATGGAGCG
ATGGATGCTC GAGTATGACA GGGCTTTTGC TTGTGAGCCG TCCATTCCAC ACTAGCAAGT
CGCTGGCGGT TAATCGCTGA CACTTTGAGA CAGCTATCCA TACCCCTTCA TGTATCTGGC
GACGAACGGG AGAAATTCAA AAAGCCAACC AAGAGTTTTC CAACCTGACC GGTATTCCCG
CATACATGTT CCGTGATGGC CAACTTTGTG TTTACGAACT CATGGATGAA GACAGTGCGG
TAAGATATTG GGAAGGATAC GCGAAAATCG CGTTCGACCC TAGTAAGTTT ACCCATCATC
TGTGAAGAAA GTCCCCCCCC CCCCGCGGAA AAAAGTCGTC TAACAAAGAG GATCAGGCCA
AAGAGCAATG TCTATATTCT GTACTCTCCA CATTCCCCTT TCCCTTACTC GCCACCCACC
GCGTCACCTC ACCAACGCCA ATTCCAACCA CATTTCCAAA TCAGCCACTT CCGGGCCCCC
ACAAGCACCC TACACGCCCG ACCTTGCGTT GCCCCACCAA AACATGCTGT TCAACGACGG
TGCAAGCTCG GATGCGGGCA CAGCCATCGG GGAAGAGTAT AGGGAGATGA AATGTGCGTT
TAGCGTGACG ATCAGGAGAG ATGCATGGGG TGTACCGGTC GCGATTATGG GACAGTGGAT
TGTGAGTCGA AATCTCTTTG GCCAAAAAAA AAAGAAAAAG CAAAGGTAAA AAAAGAAATC
AGACTGACGG ATGATTGATG TAGCCCATCC AATAGATTCT TTCAACTCTA GCACTTTGCA
CATCTTGCAG AATTATTATT TTTTTCTCGG GCTTTTCTCA GGCTTCATTC ATATTGCATC
CATCACTCAA CACTTAATCA AAGCATATCA ATCGTGTCTA ATGATGTAGC GACCAAATGT
ATATAGATGC ATGGTAATAA ATTGATTTTT CTCTTCTCAT GAGAGGGGAT GCAAATGCAA
CAAAAAAGAC TGGCTA
 
Protein sequence
MQNNAPSSFQ GYPFPPQQSD LYTFAPVSQQ QQQLYYAAQP ATDQHASADS SESAGPSQKK 
TPAGARKHGV KEEPEEHGKA DEPRKKRTKR SAGKACVYCR RSHMVCEGGR PCERCIKREI
PHLCRDCTPP PQNQQSPHKQ EPQQQQQTQQ QQSLNQSQVL PSSAQPMPVY TDSNFVPSWP
LLPDAGAAQI PFSDAPSEQI QNPGDAGIMG PPPLGSSRED GELAALSKFM KDLGVPNLPN
DFLSFMNQLD KPDASNSLSI ASSSDANLFS SSGAPYGVGL NKGKGKLNQM SRIDKYLMAA
ADQPNGTRAS RLAQVIKAKY DAGLLKPYDY AKGYERMNKW MESGRAAPRM DSRAGSEIPD
ESPQRSTAAM RNGRLSVAPL APNTPAFGKS ISPESRRRIL AALAGFRPKF RQIAKTLTNV
DLVFVEEAME RWMLEYDRAF ASIHTPSCIW RRTGEIQKAN QEFSNLTGIP AYMFRDGQLC
VYELMDEDSA VRYWEGYAKI AFDPSQRAMS IFCTLHIPLS LTRHPPRHLT NANSNHISKS
ATSGPPQAPY TPDLALPHQN MLFNDGASSD AGTAIGEEYR EMKCAFSVTI RRDAWGVPVA
IMGQWIPIQ