Gene CND04120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND04120 
Symbol 
ID3257040 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp1141992 
End bp1145564 
Gene Length3573 bp 
Protein Length902 aa 
Translation table 
GC content51% 
IMG OID638256347 
ProductPol II transcription elongation factor, putative 
Protein accessionXP_570584 
Protein GI58266856 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCGA GACGCCTCCG CCCAGTGATC GAGCTCGCGC CCCTCCCCCC CGCCGTCCGC 
GCCCTCTACG CCCGTCCCCC CTCCGCTGCG CCCGCCCACA CGCAGCACCG CACAGTGAGC
CGCCCCCACC CGCCTCCACC CGCGATTCAC CACTCACCAC TCCCAGTTCT GCGAAAAATG
CCGCCGGCCG CCCGCAGCCC TCATCCTCGA ATCCCTGCAC AGCCGTCCCA AGAAACGCCC
CAGAACGAAG AGGCCGTCCC CGGAAGACGA CGATCTGCTG AGTGACAGCG AGCTCGTCCA
TATCCTTCAG GGCTGGGTCA CCGTGAGCCC GCCCAGCTTT TCCAGAGAGC GCAGATACTA
AGCACAATCC CATAGTGCAG ACGGTGTGTC GTCGCCTCCC ACTATGGCTG TCTGTCGGCC
AGCCAGAAGA AGGCAGTCCT CGAGTCACTC CGCTCGCAAG ACCTGGCTGC GCTTGGAAAC
ATTGACCCTA CCACGCAGGC ATCAGACGTT CCTCTCCGGA AGACTGTGCC TATTCAACAA
GAGGCGGACT TTCTGTGCGC GAAATGCGGC GAAGGAGTAC CCTGTTTTGT TTGCTGCAAA
GACGAACTTC TAAACGTGGA TCCAGTCACA GTGGGCCGGA GAAAAGACGA TGACGCCGGG
GTAGTGGACG TTGGCAAAGA GGACAGCATG ACGGACATGG ATTCATCAAA GGCCCCCCAG
CTTCCACCCT TGGCGGAGGC AGTCTCTGAC AAGGAAATCG GTTCACCACA GCCCAAAAGC
ACTCGACCTC TTTTCAGGTG TTTGCGATGC AGACAAGCAG CTCATTATGA ACACTGCAAG
TAGCTCGGAT TGTTCCCCGC ACGACTGGCC ACTAATGTCT AATGTCTATT TGACGTACAG
TAAAAGTTCC TGAACCGCTT GGCGAAAACC CAGACCTGGC AGAGATAGCA CACCACTATC
AGACTCAAAC GGACGACGGT GATGCCTGGA CGTGCCATCA ATGTCGTGCA TCACCATGGG
GTATTGATAT CGTGCGTCAT TCTTTCGCTT CGCAGGTCAC ACCTTGCGAG ACCCTCTTCT
TATGATGCAA CCCGACGTAG GTGATCGCAT GGCGTCCTCT CCCTGCCACA TCTCCTTTCC
ACTCTTTGCC CCCTCCTCCA CCCAAAACTC CACTATACAA GACCGTCCTC CCCAGAGAAT
ATCTCATCAA ATACTCCTCT CGATCGTTCC GCCATGTCGA ATGGGTTCCC CACGCCTGGC
TCTCCGGCAT CGCCCCTATG AAACTCCGAC GATTCCTGGA AAAAGGGCCC CTACTGGACT
TGGTAACAGA TGAGACATTA GAAGCCAGAG GTGATGAAGT GGTCAGGCCT AGTATCGCGG
ACATTGATAA CCGTGCTCAC GGAAAAGATG TACATCATGA AAAAGGGGTG GGGCTGATAG
TAGATGCGGA AGAGGATGCG GAGAGTGGTT TGCCCATTTT ATGGAGTGTA AGCGGTTTGC
CATGATCGAA CAAGGCTGTT ATAAAAAAGC TGACAGCTGG GTGAATAAGA CAATTGACAG
AGTCTTGGAT GTCTTGCTGA TTCGACCTAC CGCTGCCCAA ATCAAGGGGA AGAGCAAAAA
GTCTCAATCA GGAAATGACA GGCGGAATCA GCGACGTATC GTCTCTCTGT CTCCTTCTCC
ATATGACGGT GCCGCCCCAT TTCCACCCGA TTCCAGTCCC AATGCACAAG CAGTAGCTGT
CAAGACGCCT TTTGAACAGC TTCAGGCAGA TCTCAACATC CCGGACGGCC TTCCATTACC
TGACGACGAG CTCATCGAAA TTGAGGAATG GGAAATACTT ACGGGAAGAA GGCTGCTCGA
GACAGATGTA GAGGAAGTCG CAGGATTGGT AGGATGGGGC CTGTTCAAAT GGCAAGATTT
GCAGTATGAT CAAGGTAAGA CAACCCGGTG TAACTTTTGG CGAACGAACT TGCAAAGCTC
ACCCGCGGAC CGAAAAAGCA TGCTGGGATA CACCTCCACC TTCGTCATCA CCGCTTTACA
CTGCTTATAA GCACGGTCTT TCCAAATACC TTGCAGCCCG GCACATCACC ATCCCCGTCC
TGACCCCAGC CCAAATTCGC GCCAGGGATA ATGATCCCGC AAGGGGCTTT GTGCCCCCGC
AAGAACAGCC GGATTGCATC GCCGGGGGCA ATTTGATGCC CTTCCAGATG GAAGGGTTCC
AGTGGTTGTT GTACAAATAT TTCAAAAGAG AGAGCTGTAT TCTGGCGGAT GATATGGGTT
TGGGCAAGAC GTAGGTGGTG TTTGCACAAG ATGGTGGCGC CCGTGCTGAT GGGTTGTTTG
CTTTCTTGTT TATACTTGCA GTGTGCAGAT CGCCTCTGTC TTGGGTTATC TGGGCTCTGC
CGAACATGAG ATCTATCCCT GTCTAGTAGT CGTGCCGAAT TCGTACGTGT GGCAAGACAA
AAACGTGGAA TCTGAGGCGG GTAGCGTCAT ATACTGACAA ATATGGTTGC TATCGCTAGA
ACAATCACCA ATTGGGTCAG AGAGTTTGAA AAGTGGAGTC CGCACTTACG AGTTGTGAGT
TTTTTTTTTT TTTTTTTTCT TTGCGTTCTG TGACACTCCA CCTGTTCTAT GGCTGATAAA
TCGCGCCTAG GTTCCCTTCT ATGGTGAAGC GGCTTGTAAG TGTTTCCCTC CTTTCAATGA
AAACTAGAGA GTTTAATCCT TGCTGATGGC CATCATTAGC ACGCGAGATT ATTTCAAAGT
ATGAATTATT TCATAAAGGG ACGCAAGGTA AACCAGTGGG CCTCAAAGCG TGAGTCTGTC
GATCACACTC TTCCAAAAAA AAGAAAGAAA AAGCTGCTGA CTGGCAAGCA CCCGGATCAG
CCACATCGTC TTGACAACAT ATGATATGAT AACTTCTTCA GAATTTCGTG TTTTCTCTGC
CATCCCTCGA TGGGAGGTGC TATGCGTGGA CGAGGGGCAG CGACGTAAGA TAATAACCGT
TCTTTTCTGC ATTTTTGCCT CACCACGCTA AATGTTTATC AGTCAAATCT GATAATAGTA
AAATATTCAA CAACCTCAAA ACTCTCAACT CTGTGCACCG AATACTTCTC ACGGGCACGC
CGCTGAACAA CAATATCCGT GAACTATTCA ACCTGCTCAA TTTCTTGGAT CAAGATAATT
TCAAAGACTT GGAATCTATG GAGCAGGAAT ACTCAGACTT GAATGAGGTC AAAGTGCAAA
AGCTGCACCA GATGATTAAA CCATACATCC TTCGAAGGAT CAAGGCCGAT GTGCTCAATC
TACCGCCAAA GGTAAGTTCC ATCCATATCA CACAATTACT TTTTTGTCAT TTGACCAGAT
GTTGAATCCC CTCGGCAGGT CGAAATTATC GTCCCCATCT CACTCGCCCC CTTGCAGAAG
CAAATGTACA AAGGTATATT CGAAAACCAT GCCGGGATAA TCGAAGACAT TCTCAAAGCG
AGGCAAAAAA GGCGACAGGC AATAAAGTCT GTTCAGTCCG CAGTGCCAAC CAATTAGTCG
AGCATAATTA TAATGGAAAG GGTCGGGATA ACA
 
Protein sequence
MPARRLRPVI ELAPLPPAVR ALYARPPSAA PAHTQHRTFC EKCRRPPAAL ILESLHSRPK 
KRPRTKRPSP EDDDLLSDSE LVHILQGWVT CRRCVVASHY GCLSASQKKA VLESLRSQDL
AALGNIDPTT QASDVPLRKT VPIQQEADFL CAKCGEGVPC FVCCKDELLN VDPVTVGRRK
DDDAGVVDVG KEDSMTDMDS SKAPQLPPLA EAVSDKEIGS PQPKSTRPLF RCLRCRQAAH
YEHLKVPEPL GENPDLAEIA HHYQTQTDDG DAWTCHQCRA SPWGIDIVIA WRPLPATSPF
HSLPPPPPKT PLYKTVLPRE YLIKYSSRSF RHVEWVPHAW LSGIAPMKLR RFLEKGPLLD
LVTDETLEAR GDEVVRPSIA DIDNRAHGKD VHHEKGVGLI VDAEEDAESG LPILWSTIDR
VLDVLLIRPT AAQIKGKSKK SQSGNDRRNQ RRIVSLSPSP YDGAAPFPPD SSPNAQAVAV
KTPFEQLQAD LNIPDGLPLP DDELIEIEEW EILTGRRLLE TDVEEVAGLV GWGLFKWQDL
QYDQAHPRTE KACWDTPPPS SSPLYTAYKH GLSKYLAARH ITIPVLTPAQ IRARDNDPAR
GFVPPQEQPD CIAGGNLMPF QMEGFQWLLY KYFKRESCIL ADDMGLGKTV QIASVLGYLG
SAEHEIYPCL VVVPNSTITN WVREFEKWSP HLRVVPFYGE AASREIISKY ELFHKGTQGK
PVGLKAHIVL TTYDMITSSE FRVFSAIPRW EVLCVDEGQR LKSDNSKIFN NLKTLNSVHR
ILLTGTPLNN NIRELFNLLN FLDQDNFKDL ESMEQEYSDL NEVKVQKLHQ MIKPYILRRI
KADVLNLPPK VEIIVPISLA PLQKQMYKGI FENHAGIIED ILKARQKRRQ AIKSVQSAVP
TN