Gene CNM00130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM00130 
Symbol 
ID3255057 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp28831 
End bp30494 
Gene Length1664 bp 
Protein Length340 aa 
Translation table 
GC content52% 
IMG OID638254173 
Producthypothetical protein 
Protein accessionXP_568381 
Protein GI58261942 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG5242] RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00163838 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCCC CCCCCTCCAC GCTCATCCTC GTCCTCGACA CCCACCCCCT CTCCTGGCAC 
CTCCTCGCCC ATCTCCCCCC AGCGCCCCCC CTGCCCGACA ACAAGATCCT CGACAATGCC
ACATCATCCC CGACATCGCT CGACCAGTTC ATCACCATCC TCACCGTCTT TCTCAACGCC
CATCTCGCAA GCAAATGGGG CAATGAAGTC GTCGTCTATA CTGCATCCGC CGGAAAGGCC
GAGCTGATCT ACCCGCCTTC AAACGAGAAG ATACGGCAAA GGGGAGAAGG GGCAAAGCCG
AGTGCGAATA TGTACAGGCC GTTCCAGATT TTGGATCAGG GGATTGAAGA AGGGCTAAAA
GAAGTTGTGA GGGAGGAAGA GGGAAAATTG AATACCGAAG GCGCCGGGTT TATAAACCAA
CCTCCCGCCA TGGTCTCGGC ACTCACTAAA GCTCTCTGCT GCAAGTCGTC GTCTCCCAAG
CTGTTTCGTA TTGTCTGCTG ATAGCGTTGT GACACTGATA GTCATTAACC GACGGATATC
ACCTTCAGTC CCTGCAGATC CAACAGCCCT CCCACTATCT TCAGACCCAA ATAGCGGCAC
CTCGGATACT TCTGGCGGCC TTTTACCAAG TAAAGAAGTT CGGATATTAG TTATAAACGC
GACTCCCGGA GCTGCCGTGG GAGGTCGAGC GGATCCGGAT GGTCGTCCAG GAGGAGGCGA
TGGGGCGGAG AAGGAAAATG GGGATGCAAA CGAAGAAGAA CGGAAAAATC AGAGACGGCA
ACAACGCATG CGCGGTGGTT ATGTCGGACT GATGAACTGT GTCTTTGCCG CTCAAAAAGC
TGCAAGTGAT CCCTCTTTTG TCTATCTACC GCGTTCGTCT GACATTCAGT CCTGCAGAAA
GTCCCGATCG ATATCCTGTC CCTTCCTCCA TCGACGATCG ATTCTTCCCC TCCCGTCTTC
TTGCAACAAG CCGCCCATCT GACGGATGGA GTGTATTGGC AATGGAATGG AAGAGGCGGT
TTGCTCCAGT ATCTTCACGT ACGTTCTTTT TTCTCGCCCT TTTATCATCT CATGCATCTG
CTTGCATTGA AGCTTGGTAA GATGTGATAG CAGGACAGCT GATGCGGTAA TGTCATAATG
GGAACTAGAG TATATACCTC ACACCACCTT CTCTCCGACA TAACCCGTTC GTCACCCCGC
CACAAGACGC TGTCAACTTT CGTGCAGTAT GTTTCTGTCA TCACAGGACG TTGGACGTGG
GATTCGTATG TAGCGTGTGT CTCTCCAGTA CGTTCCCCTC TTTTTTCTCT CCTTCACTTT
TCTCCCGCTT TTCGGTAAAG GGCAAAAGCT AACCTCACCC CTACAACACG TGTAGTCTTC
TGCGAACCAA AACCCATCTG TGCAATGTGT AAAACACGGT TTCCTATCAA GTCCATTCCC
AGACTCCGGA CGCTGGCCGG GTTGAATACG AGGATCCAAG TGCCTGATAC GGTCGTGAAA
CCCCCGGCAC CGAAATCGAG CAGCACAAAC GCCACAGGAG GAAAGGCCGG AGTTACAGGG
AGGAAGGGAG ATGATAGGGG TGAGCCAATC GTGATTGATT AGGATGGGTT AGAGTAGTGT
TTTCTGTTTT GTTTTTGTTT TGTAAGCATG TATCGACTAG TTTT
 
Protein sequence
MPAPPSTLIL VLDTHPLSWH LLAHLPPAPP LPDNKILDNA TSSPTSLDQF ITILTVFLNA 
HLASKWGNEV VVYTASAGKA ELIYPPSNEK IRQRGEGAKP SANMYRPFQI LDQGIEEGLK
EVVREEEGKL NTEGAGFINQ PPAMVSALTK ALCFINRRIS PSVPADPTAL PLSSDPNSGT
SDTSGGLLPS KERRQQRMRG GYVGLMNCVF AAQKAAKSPD RYPVPSSIDD RFFPSRLLAT
SRPSDGWSVL AMEWKRRFAP VSSRTVYTSH HLLSDITRSS PRHKTLSTFV QYVSVITGRW
TWDSYVACVS PSSANQNPSV QCVKHGFLSS PFPDSGRWPG