Gene CNG04340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG04340 
Symbol 
ID3258552 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1226566 
End bp1228522 
Gene Length1957 bp 
Protein Length544 aa 
Translation table 
GC content48% 
IMG OID638258058 
Producttranscription factor iiia, putative 
Protein accessionXP_572108 
Protein GI58269904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0235343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCTCC ACTTTTTCGC GGTAGGGGGG GACCCTTCAG GCAACAGTTT TCGTATTCTA 
AATCCTCGAC ATCTTCGCTC TGCAACTTCA GTTATAACTG GCTCATCTCC GCATAGGATG
ACGCTCGTCC TAAAACCACC GTCACTGGGT GACCTTATGG TATTTGAGGA CTTTCTCACT
CATCCTACTC ATCACCAACC CAAGTCCATG GCTGAGAGAA AACATGACTG GCCAAGTCGG
ACGGAAAAGA GATACAAATG TGCCTATGAA GGATGCGATA GAGCTTACAC CAAACCATCA
AGACTTGCTG AGCATGAGAT GACCCATCGA AATGAGGTGC GTATCGTTTG TGCCCGACTC
TGTAATTAGC TGACGGAATA CCCAGCGACC CTTCGCCTGT TCACAATGCC CTCGGACATA
CTTCAGAGAA GACCACCTCA AAGCCCATGC GCGCACCCAT TCGAACGTAA AGATCAAGCC
CTTCCCGTGT ACCCGTGAAG GCTGCAAACA GTCTTTCTGG ACTGCATCAA AACTTCGTCG
ACATGAAGAG GTTCACGATA AGGACGGTGC CTATCCTGTG AGCTGAAGCA CTAGCGTCTA
TATATTGTCC GATGCTGACG CCGGCCACAG TGTGACAAGT GTGAGGCTGC GTTCAACAAG
CACCACCTTC TCCGAGAACA CGTCGCTGTA GCCCATATGC CTCCCGGTAC CAAACCTTTC
ATCTGCACTC ATGAAGGTTG CAGCGCTTCT TTTGCAACGA AAGCTCATCT CAAGAATCAC
GAGAAGACCC ACGACGGTGC GTTACCTTTA GTTGTCATTC CCTTGCCATA CTGATATAGG
CTTTAGAAAG ACGGTATATT TGTTCTCACC ACGATCATGG CGAAGATTTC CCCAAATTTT
CCAAGTGGAC AGAACTTCAA AAGCACATAT CTACGGAACA CCCGCCCACA TGTCCTCATC
CTGAATGCAA CGGTCGAATT TTCAAGAACA ATCAACGTCT GAGGGATCAT TTGCGTGTAC
ACGCGGATCA ACAGGCCGAC AAAGCTGCGC TTGCGGACCG ACGCGAGGAA GAAATGCCAC
AATTGCTGTT AGAAGGGTTG GGCAAAAGTA GAAAGAAGAG GAAATCTTTT GCGCAACGGG
AGGCAGAGGA CAACGGACCT AGGAAGTTGA GAAAGATTCT CAACGGTGAT GCTGGAAAAG
ATTGGGCTTG TGAGCATGAG AGCTGTGACA AAAAATTCAA ATCTGTCAGT TCCTTTCTTC
ACTGACTTCA TCTTTGCTGA CTATCTTGCA GCGATATGCT TTAGAAACCC ATATCAAAGT
TGTCCACCAA AACATCCGTG AACATGTTTG CCCCCGTGAA GGGTGTGGCA AAGCGTATGC
CTACAAAACC AATCTGAATC AACATCTCGC CAAACATAAT TTATACGCGG GACCTTCGAA
AACCGCGACA TCTGAAAGCG GGATGTTGAC TGGGATGGTC AAGGAGATGA GGAGATTCAT
CTGTCCTGCA TGGGCATTAG GCGTCTTTCC AGAGAACGGG GATATGATTG TCACTCCACC
GCGGCCTGAG CTTCTTACGG AGGACAATAA TGGTAATGAA CAACTTCAAT CTATCGTCAG
ATGCAGAGAC CCTGCACCGG AGTCAACCAC TACCGCCAAC ACTTCAGCAA TACCAACACA
ATCGACACCC GAAGATTTGA TTGGCAAGAG GTGTATCCTT CGATTCTGGA GAGTGTACGA
CGTTCGACGG CATCTCAAGT CAGAGCACCG TGTCGAGCTT GACGATATGC AAATTAGGAG
ATTGTTGCTC AGCACTGGTC AAACCGGGGA ATAGAATGTT AGAATGTAGA TGGATAGAAG
GAAAGATGGG ATAATGTATT AGTAGTAGAT ATTACTCGTA TTGTTGTAGT TACTGGCTAG
TAGGATTTGT ATTTCCGACT TTTTGCTGTG GTTTTTT
 
Protein sequence
MYLHFFAVGG DPSGNSFRIL NPRHLRSATS VITGSSPHRM TLVLKPPSLG DLMVFEDFLT 
HPTHHQPKSM AERKHDWPSR TEKRYKCAYE GCDRAYTKPS RLAEHEMTHR NERPFACSQC
PRTYFREDHL KAHARTHSNV KIKPFPCTRE GCKQSFWTAS KLRRHEEVHD KDGAYPCDKC
EAAFNKHHLL REHVAVAHMP PGTKPFICTH EGCSASFATK AHLKNHEKTH DERRYICSHH
DHGEDFPKFS KWTELQKHIS TEHPPTCPHP ECNGRIFKNN QRLRDHLRVH ADQQADKAAL
ADRREEEMPQ LLLEGLGKSR KKRKSFAQRE AEDNGPRKLR KILNGDAGKD WACEHESCDK
KFKSRYALET HIKVVHQNIR EHVCPREGCG KAYAYKTNLN QHLAKHNLYA GPSKTATSES
GMLTGMVKEM RRFICPAWAL GVFPENGDMI VTPPRPELLT EDNNGNEQLQ SIVRCRDPAP
ESTTTANTSA IPTQSTPEDL IGKRCILRFW RVYDVRRHLK SEHRVELDDM QIRRLLLSTG
QTGE