Gene CNN00640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00640 
Symbol 
ID3255345 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp207494 
End bp209642 
Gene Length2149 bp 
Protein Length607 aa 
Translation table 
GC content49% 
IMG OID638254481 
Producthypothetical protein 
Protein accessionXP_568733 
Protein GI58262646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCTCCCTCA AAAAACATCT TCTGCTTCAG CGGATACAGC GTACGCCTCG GTACAGCGAC 
GTCCGCACTC CCATATCCGG TACTTGCACA CCTGCAGGCC GGATACTCTA CAAAAGTATG
GGCTCTATAA AAGTAAGTCA TCCCCTCCAT CCGCGTTGCC GCCCACCCGT ACCCGCGCAA
GACATTGACA AAGGTACACC AGAGTCACGT CCCTTTGGAC GCATATAATA ACATAAATCG
CATTGATAGA CAGAGAACGC CATGTCTGCC TGCAGCTACA GGTTTGGGGC TCATCAACAA
TAGTGATAAG GAAAACTTGG ATGGTGGATC AACCAACGCA GAGCCGAACG CCCTTGCACT
CGAACGGGAG GCGAGATCTG TACCGGCCCC GATAATCCAC TTTTTCAACC ATCCGTCTTT
GGCCAAGCGG CAGTCGACTG AAAAAGAGTC CATTGCGCCT GCTGTCACCA CCCCTCGTCC
TTTATCGAAT TCTCAGCTTA TCAATAGTGA TAGTAATAAT AATAATCCGC TCGACACGCT
TAATCGGGCT TACGTCAATG CCTTTGATCT TATCATGCGA CATAGAAATG GTCTTGGTAT
TATTCCTGTA GGTGAAGGAG CTAGAAGCTC GGGGACGATG AATGGGAATA CTGCAGACCA
GATTGGTTCT GGAGAAAGCG AAAAGATTCC AGTTGGAAAA GTGGACAAGC CAGTGGATGA
GGTGAGTTTA AGCTCTTCAA GCAGCTATAC TTTTAGCAAA AGGACCAAGC TGACTGCAAC
CTGATAGGAA CTCACTCCCA AATCAAACCT CACTATTCCT TCTAAACCGT CAAAACAGAA
ATCCATCCAC GGTTCCGAAA CAACCTCTCC TTCAACATCA TCTCTCTGTC TTTCGCCGGC
GACCTCTTAC GATTCAGATT CTAGTTTGGG CCATGGAGAG TGTAGCCATG ACATGCCTTT
AGCGGAGAAG GAAGGTGAAG ATGATAATGA GGATGAGGAT GAGGAACATG CGGGGGTGAA
GGGAGCGCTG GGCTTGTCAA CTGGTCTTTT TGTCAGATCC AGCGGGTTGA ATTTGGGCGA
TTCGGACGAG GAAAAGGAAG AAGAGGAAGA AGATAACATT GTCAGAGAAG ACGTACATCA
TCAAAGAACA GCGCAACTCT CTGATTTTGC TTCCTGTCTT TTCACCGTCC ACAGCTCGGA
GAGCGAAGAC GAAGATTTAC CTTACGAATC TGACCGTGAT GATCAAGACG ATGTCGACTC
AGTAGCATCC TCCTCTTCCT CCTCGTCATC CTCCTCCAAT CGCCTTCGCC CAAATTACTC
TTCCATTCAC CACAGTGTTT TCGTATCCCA GTCACATGAA TCATCCGACT CTGACTCTGC
TTCCGACATG GATGCAGACT CGGATCCGGA AGATGGTAAT ACTGGACACA TACGGGATGA
CGACGCGCTG TACCTCCCCG AAAGTCCCTG GCTCATCCAA TCTCCCGTTC CCGTGATACC
TAAAGGGAAC AGTAGCCCAT TAAAAAGCTC TCGGTCGACT TCGCCACTAC CTTCAAGGAC
GATAGCCTCT CCTTTGAGTT CAATATCATC CTTTCCACAA TCACAACACC GAATATCTAC
TCTTCCATCC CAGCCCCAAC CCCAGCCCCA ACCCCAGTTC CAACCACAGC CTCAGCACCA
TCATTATCAC CAAACCCCTC CAAACCATAT TCAACCCCCA CCATCCCCCC ATCCCGTCCA
GCGATACATC CTCACCGAAG CACCCGCTCT TCCCGTGTAC GAGCGATATC GACCCATCCC
AATCCGCGTA CCCGTCCTGC CCTGTCCGTA CTCGTATTCA AATATGATAA TGTACAGTTG
TTATTCTTAT ACCGCTGGCG GACCTTTGGG CGGTGATCAG AGTGAGGTTC AACCTCAAGA
GCAAGTCTTT GGGCAGGGAA AGAGGGAGAT TAATGAGCAG GAGAGGTTGG ATGAAGAGTG
GGCGAGAAAA GAAGAAAGGA AATTACGAGA AGAGGAAGAG GAGAGATTGA GAATTCGGCG
GTATGCGGCA GAGTATAGTT CGCCGAGGTT GTATTAGCAT CGCTCTTTCG AGTATCATCT
ATTTGAAAGA CGTAGTCACG TCGCGCCAGA CTGGCGTTCG AAAACATTG
 
Protein sequence
MGSIKSHVPL DAYNNINRID RQRTPCLPAA TGLGLINNSD KENLDGGSTN AEPNALALER 
EARSVPAPII HFFNHPSLAK RQSTEKESIA PAVTTPRPLS NSQLINSDSN NNNPLDTLNR
AYVNAFDLIM RHRNGLGIIP VGEGARSSGT MNGNTADQIG SGESEKIPVG KVDKPVDEEL
TPKSNLTIPS KPSKQKSIHG SETTSPSTSS LCLSPATSYD SDSSLGHGEC SHDMPLAEKE
GEDDNEDEDE EHAGVKGALG LSTGLFVRSS GLNLGDSDEE KEEEEEDNIV REDVHHQRTA
QLSDFASCLF TVHSSESEDE DLPYESDRDD QDDVDSVASS SSSSSSSSNR LRPNYSSIHH
SVFVSQSHES SDSDSASDMD ADSDPEDGNT GHIRDDDALY LPESPWLIQS PVPVIPKGNS
SPLKSSRSTS PLPSRTIASP LSSISSFPQS QHRISTLPSQ PQPQPQPQFQ PQPQHHHYHQ
TPPNHIQPPP SPHPVQRYIL TEAPALPVYE RYRPIPIRVP VLPCPYSYSN MIMYSCYSYT
AGGPLGGDQS EVQPQEQVFG QGKREINEQE RLDEEWARKE ERKLREEEEE RLRIRRYAAE
YSSPRLY