Gene CNJ00400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNJ00400 
Symbol 
ID3254253 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006679 
Strand
Start bp104471 
End bp108218 
Gene Length3748 bp 
Protein Length828 aa 
Translation table 
GC content48% 
IMG OID638253197 
ProductDNA topoisomerase type I, putative 
Protein accessionXP_567296 
Protein GI58259767 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.772015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTCC TGTGCGTCGC TGAGAAGCCA AGTATAGCGA AGAGTATCAC CGAGATACTC 
TCCGGAGGAC GGTGGGATAC TGTAAGTCCC TATCTCCATT TCCTTCTCCT TTCGTAAATG
TCAAGAAAGG GGCAGAGACT GATAAATTGG CTTCCCTTTT GGACAGAGGA ACGGAAGACA
CCAATATATA CGCAATTACG ATTTCTTATA CAACCTTGCT CCGCCGCTTG GGAATGGAAG
AGGTGCCAAT TTTACCGTCA CAGCAGTCTT GGGACATTTG ACTTCAAGTG TTAGTCACCA
GTCAATTATT CGATTAAGTA GCTCTGTAAA CGGGGATGTG AAGGGTGTAA TGAAACAAAT
GAGAGAAAAG GAACGCTGAT AGAAGCTTTT TGGGAAGTAG GATTTTGATG ATGATCATCG
GAAATGGGGG TCTTGTGATC CGTTTGCGCT GTTCGATGCA CCAGTCATCA CTTTCGTCGA
CCAAGTACGT CTAATCTCTT GCATTCCCGC GGCAAATGTG ATTAACATGG TACCTTTCTG
GCGTAGAAAT TGAAAACAGT AGAATCAAAC CTCCAAATTG AAGCCCGTAA CGCCGACATA
CTCATGATTT GGACTGATTG TGATAGAGAG GGAGAACATA TCGGATCTGA AGTCGTAGCG
GCGTGTAAGA AAGTAAATAG GAATATCCTT GTCAAAAGAG CGAGGTTTAG TGCCATTATC
CCTGCGTGGG TCTCCTTTCC CAGGATGAAG GCTTGAAACT GATAAGCTAA CGCGGTATTT
GTGATGATAT TAGGCAGATC CACCAAGCAT GTAGACAGGC GAACGACTTG GATATGAGGC
AGGCAGATGC GGTGGCCACG AGGATAAGCT TGGATCTCAA GGTTGGATCG GCGTTCACAA
GGCTTACAAC GATGACTTTG CAAGTACGAG TGCCCGATCT AGCAGAGCAA CTCATCAGTT
ACGGTAAGTC CTTCCTTCTC GCATCCCCGG TTTTTTTCCA AGAAGACCAC ACTGGCTGAC
CCAAGACCGC CTTCCTTCCA TTATATCCTC TTTCAGGTCC ATGCCAATTC CCAACTCTCG
GCTTTGTCGT CGAACAATAC ACCCGCGTCC AAGCCTTTGT TCCTGAAACG TTCTGGTACA
TCTTCGTCGC CCTCGAGCGA GAGCATGAAG ATGGTGAACC CAGCACTGTC GAGTTTAGGT
GGAAGAGGAA CCATCTGTTC GATTTAGATC TGGCGGCGGT GTTGTACGAA CAGTGTACAG
TCAATCCGCA GACGAGGGTT TTGAAGGTCG AATCAAAGCC CGCGACCAAG TGGTGGGTTC
CCATGGTTTA TAATGAGGGA GGGTCAGTAC TAAAAGCTAA GAATAGGAAG CCATTACCGT
TGACGACAGT TGAGCTTCAA CAATCTGGTT CGAGGTTACT TCATATGACT CCTAAACGAA
TTCTCGATGT AAACTATATC ATTACCAACA TCACTTGCAT AAGCTGATGG TGTTTGTAGC
TTGCCGAAAA ACTCTATCAA AAAGGTATTG TTAGTTATCC TCGTACCGAA ACCGATCAGT
ACGATCCCAA ATTTGACTTC AACTCTCTCA TCCAAAAGCA AACGCTTGAT AACCAATGGG
GAGCTTACGC TCAAAAGTCA GTCATTTTCT TCTCACCCAG AGCCGAATTT TTACTAATTT
ATAAATGCTA AAAAAATGAA GGCTACTAGA CGGTGCCTTT CAGAAACCCC GAAATGGCCG
TAAAAACGAT AAAGCCCATC CCCCTATCCA CCCTACCGCA CACGCGGGCA ACCTTGAAGG
CGACGAACGC CGCGTATTCG AACTCATCAC CCGTCGTTTT CTCGCTTCTT GCTCGACCAA
CGCCGAAGGC CAAAATACCA CTGTCGAAAT CTCCATCGCC GATGAGATCT TCTCCACTAC
AGGGCTTGTC GTCCTGAGGA GGAATTATTT GGAGGTGTAC CCGTATGACA AGTGGGCGAC
GCATGCGTTA CCGAATTTTG AGGAAGGGGA GGTTTTTATA CCGGATGTAA TTGATTTGAA
GGAGGGAACG ACGAGTCGAC CAAGTTTGTT GACTGAGGCT GATCTGGTTG GGTTGATGGA
TAAGAATGGC ATCGGTGAGT TTGTTTGTCA TGTTCGCTTG CCATCTCTCC ACCTTGGTCA
GTTATTGACC GAGACATGTA ACAGGTACCG ATGCAACCAT CGCTGAGCAT ATTGCCAAAA
TCATTGAACG CGGGTATGTT ACAGAAAAAC AAGAAGCTCG AATAAAATAC CTCGTGCCAT
CTACTCTGGG TATCGGTCTG GTCGAAGGGT TCAATGCTAT CGGTTTTGAC CGTTCTTTGA
GCAAACCACA TTTACGACGA GAGGTGAGCC TTTTTTTATC ATCACTTACA CGTCGAACAT
CAAGCTGATC GATGAATGTA AAAGACTGAG CATCGAATGC AATTGATATG TGATGGTGTA
CGAGGGAAAA GAGAGATTCT CCAGACTACG ATTGAAGAAT ACAAGGAAGT TTTCGTCAAG
GCCCGTCGAG AGTTCCAGAC CGTTATCGAT GTCAGTTTTC CCGCCTTCTT CCCAACGAAA
ATGCAAAACT AACGTGTGTT TTTCTTATAA TCAGTGCGTG GAAAACTACC TTCACGGAGC
TGGCGAAGCC CAAGAAGCTC TTCGAGCAGC AGCACGGGGT GGTCGAGGCG GACGAGGAAC
TAGAGGAGCG AGAGGTGCAA GAGGGGGGAG AGGAACTGGA GGAAGAGGCA GAGGTGGTGG
AGGAGTTGTC GCTCCCAGAG GCGGTCGAGA TGATGACAAT GACGACGACG ACGATGATGA
TGATGACCAA GGTCCGCCGA GAGGTGGTGT AAGAGAGAGA GCGAGAGGTG CGGCGACGAC
GAGAGGGAGG GGAACAAGTA CTGCCAGAGG ACGAGCGGGA GCTGCTGGAG CTGGGACGGC
TGCTGGTAGA AGGGCAGGAT CGCCTACTTT TGGTAAGTCG CCTTCTTCTC CCCATTGTCA
GAATCTAACT AAGCTTGAAT TGATTGATTT ATTCGGTAGG CGCGGATGAT GGCGGTGGCG
ATACGAAAAT GTGCAACTGC GGTAGAGAAG CTGTCTCTCG CGTTGTTGCC AAGGCGGATT
CGGCGCATAA GGGAAGATCG TTTTGGACAT GTCCTCAGCC TCAAGGAGAA CAATGCGGCT
TCTTCGTCAG TCTACTGCTT GCGTATCACA ATAGGCGGCT ACTGACGTGA GGCGCTTCAA
GGAATGGGGG ACGGATGGAG ATATGGCACG CTCGGCTGGC CCTAGTAAAT CACGAACAGC
CAGTGCTCAA CCGCCGCCAT CCAAAAGACA AAAAACCACG GTAAGTATGT TCCATGCCCC
CTTCGCGTCG AAGGAAAAAG GCACTGATAG TGAGTTTCAG AATCGGAACG ATCCCCCTCC
TTCGAAAGAT GGCGTACCTT CGTGTAAATG TGGCCTTGAT GCGGCTTTCG CCACAGTAAT
CAAAGAAGGT CCAAACAAAG GTCGACAGTT CTGGTAAGTT ATCAAGCTTT ACACGTTTGT
GCCGATGAAT GCTACTTGCT AAAGTGCAAT TCCGTCTTAC TTAGGGCTTG TCCGAACAAC
CCGAAAGCCC GTTGTGGGTT CTTCCAATGG GAAGATGATC CTAATCTCGG CAGTGGTGGT
TCTAATGGAG GTGATACTTA CAACGGAGGA GGAGGAAGGA GTGGAACCTC AGGAGGTTTG
TGCTTCGTGC TTGAAAGGTA CACCCTAA
 
Protein sequence
MRVLCVAEKP SIAKSITEIL SGGRWDTRNG RHQYIRNYDF LYNLAPPLGN GRGANFTVTA 
VLGHLTSSDF DDDHRKWGSC DPFALFDAPV ITFVDQKLKT VESNLQIEAR NADILMIWTD
CDREGEHIGS EVVAACKKVN RNILVKRARF SAIIPAQIHQ ACRQANDLDM RQADAVATRI
SLDLKVGSAF TRLTTMTLQV RVPDLAEQLI SYGPCQFPTL GFVVEQYTRV QAFVPETFWY
IFVALEREHE DGEPSTVEFR WKRNHLFDLD LAAVLYEQCT VNPQTRVLKV ESKPATKWKP
LPLTTVELQQ SGSRLLHMTP KRILDLAEKL YQKGIVSYPR TETDQYDPKF DFNSLIQKQT
LDNQWGAYAQ KSKPRNGRKN DKAHPPIHPT AHAGNLEGDE RRVFELITRR FLASCSTNAE
GQNTTVEISI ADEIFSTTGL VVLRRNYLEV YPYDKWATHA LPNFEEGEVF IPDVIDLKEG
TTSRPSLLTE ADLVGLMDKN GIGTDATIAE HIAKIIERGY VTEKQEARIK YLVPSTLGIG
LVEGFNAIGF DRSLSKPHLR RETEHRMQLI CDGVRGKREI LQTTIEEYKE VFVKARREFQ
TVIDCVENYL HGAGEAQEAL RAAARGGRGG RGTRGARGAR GGRGTGGRGR GGGGVVAPRG
GRDDDNDDDD DDDDDQGPPR GGVRERARGA ATTRGRGTST ARGRAGAAGA GTAAGRRAGS
PTFGKNGGRM EIWHARLALV NHEQPVLNRR HPKDKKPRIG TIPLLRKMAY LRVNVALMRL
SPQSKQRSTV LMILISAVVV LMEVILTTEE EEGVEPQEVC ASCLKGTP