Gene CNK00740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00740 
Symbol 
ID3254551 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp239383 
End bp241414 
Gene Length2032 bp 
Protein Length475 aa 
Translation table 
GC content46% 
IMG OID638253563 
Producthypothetical protein 
Protein accessionXP_567636 
Protein GI58260452 
COG category[L] Replication, recombination and repair 
COG ID[COG1041] Predicted DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.635907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACCCAAGCA TGCCCCTTTA TGCCCTCCGA CTGGTCATAG ACCACAAGAC GTTCCGGCTC 
CCATCGCTTC TTTCCATATC GCAAGTCTTC GGCTTTCCCA TTCGATTTGT GTCAGAGGAT
AAAACCCGAG GATTACTTGT GATAGAGCTG GAAAAAGATG ACGATATGGA GAAAATTTTG
GACAGGGAGA CATTAGTGCT GTAAGCCTTA CTGCCTTGGT GCTGTAAACC TTACTGCCTT
GGTCTTTTTT TTTTTTGAGA GCAACGCTGA CTGCCGTTGT AGCTCTGCAG CTGAAGTCTA
CGCAGAAGGC GCAACTTATG AGGAGCTTCA TGCGCAGCTG AAATCAAAGC TCCATGTGCT
GGATCCTTAC AAACACTCTT CCTTCAAGGT TATCATTGAA AGTGCTCACC ACTCCATCCC
TGAGCCTCGA CAAATGTAAG TTCAGTCATT TCGTAATTTA AGATATAGAT TATGTTGCTG
ATTGACCTGC ACAGAGAAAC TATCAACTCG TTTAAATACA CTGGTTTAGA AGGCAAGATC
AGGTTAAAAA ATCCAGAAGT GGAATTCATT GTTTACGAGG ATTGTGAGTT GTTCTCATTG
CCTTTTGCGT CATATTAGAT TGACAGAGCG CATCAGACGA TTGGGTCGCC GCTAACACCC
CCGAGCAACG CCTTGCGCGA GATGGCAAGT TTCACCGAGT TTACTTTGGT CGAAAGGTAA
GTTTCATTTC TAAAGTGATC TGTGAACTGC CTCTCACCGA GCTAGATCGG AAACGGTCGA
GCTAGGCAGT TGATTATCTC TCACTCCGTC AAAACTCGAG CATACTACGG CAACACCTCC
ATGGAGGCAC AGATGGGATT CCTTATGGCT AACCAAGCTC TAGTGAGTGA TCGCTTGGAA
TTCCTCATGG CAGGCTAATG GTATTCTTGT AGCCTGCCCC GGGAAAACTC ATCTATGATC
CCTTTGTTGG AACCGGTTCT ATGCTATACG CAGTCGCACA ATTCGGAGCT TATGTTATGG
GTTCAGATAT TGATGGCAGA CAAATCCGAG GCAAAAGTAG GTTGATCTAT TCTGGCTTCC
AATTGCCTGA ACTGAACAAT ATCATTAGAG AAGGGCAAAG GAATCAAGCC CGGAATTCTT
CGTGCTGCTG AGCAATACGG ACTCCAGGAT AAATTCCTGG ACTTCTATAT TTTCGATGTC
ACTCGAGGTC CCATCCGTCG GGGTGGATGG ATTGATGCTA TCATTACAGA CCCTCCTTGT
ATGTCGTATC CTTTGTTCGA TTACCCGACA CTAATGGTAC GTGCTGGTAG ATGGCGTTAG
AGCAGGTGCA AAGCGTCTCG GTCGCAAGGA GGGAAAGAAG CCTTTGAGGG AAGAGCCGTA
TCAATTACCT GATGGTACCT ACTCTCACGA GTACGTTGTT ATCTCTTCCA TCAAGCATTA
CATAGGTATA CTGACTTCAT TTTCCAGACG GTCTGACTAT ATCCCCCCTT CTCGCCCTTA
CGAACTAGCC AACCTCACTC TCGACCTGAT TCTCCTTGCG CGATGGATCC TTGTGCCCAA
AGGCCGTTTA GTTTTCTTCT TGCCAACCGT TAATGAGGAT TATGACGAGA TCGATATCCC
TAAAGTAGAA GGAATGAGGG AATTAAAGAT CGGGGATGGA AGTGTGCAGG ATTTTGGAAA
GTGGGGTAGA CGAGTGTGTC ATGTTCGTTC ATTATATTTT CGAACATTAC ACTGACGAGC
TGAGATAGTT AATTACAATG GAGAAGACCG CACTTGATGA TGGTGAGCCC CCAATGTTTG
AGGACCACGA AGAATTCAAG GAAGGAGCGG AGGACTTGCC GGGTCATTTC GGCTTCTACA
AAAGAGTAAG CCAAGATCGC TTTTGTACTG ATTGACGGAC AGGAGGAGCT GATAGCTAGT
TTAGTATTTG AGCGGGTTCA AACCAAACTC AAATTCAGCC AGTCCCGATC CATCGACCTC
AATGGCTAAG TCAAGAGACA CATAGACATC CGCGTAATGG ATGCGGCGAG AC
 
Protein sequence
MPLYALRLVI DHKTFRLPSL LSISQVFGFP IRFVSEDKTR GLLVIELEKD DDMEKILDRE 
TLVLSAAEVY AEGATYEELH AQLKSKLHVL DPYKHSSFKV IIESAHHSIP EPRQIETINS
FKYTGLEGKI RLKNPEVEFI VYEDYDWVAA NTPEQRLARD GKFHRVYFGR KIGNGRARQL
IISHSVKTRA YYGNTSMEAQ MGFLMANQAL PAPGKLIYDP FVGTGSMLYA VAQFGAYVMG
SDIDGRQIRG KKKGKGIKPG ILRAAEQYGL QDKFLDFYIF DVTRGPIRRG GWIDAIITDP
PYGVRAGAKR LGRKEGKKPL REEPYQLPDG TYSHERSDYI PPSRPYELAN LTLDLILLAR
WILVPKGRLV FFLPTVNEDY DEIDIPKVEG MRELKIGDGS VQDFGKWGRR LITMEKTALD
DGEPPMFEDH EEFKEGAEDL PGHFGFYKRY LSGFKPNSNS ASPDPSTSMA KSRDT