Gene CNK00440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00440 
Symbol 
ID3254401 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp131800 
End bp133564 
Gene Length1765 bp 
Protein Length467 aa 
Translation table 
GC content49% 
IMG OID638253538 
Productexpressed protein 
Protein accessionXP_567613 
Protein GI58260406 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCATACTGA GGGGATATGC CTTAGTACGA CATGTCAAAT TCATCCTCTG TTAGAAATCC 
TCTTCAGCGC GCCCATGCTC TCTCAGCCCA AGCATCTAGC CTTTTGGATT CCGATCAGCC
GTCGGCCCAA GCAATGAGCC AGGCCCTACA GGCTTACAGA GAAGCAGCCA GTCTATTTGA
AGCTTCTGCT GGTGATTTGG ATGTCGATGA GAGCACGAAG GTGACTTTGA AGTTGTTGAC
AACGCAGCAT CGCAAACTGG CAAGGGACCT CGAGCGGAGG ATTAGTATTG CTGCCAAAGG
GAAAATGGCC ACTCCTGTGA ATACGCGACT TGCTGGAATG CCCCCAAACC GGAGATATTT
CTCTGAAGGT TCACCTGCCA GAGGCTATCT CCCTGATAAC AAACAGGTCG CGGGCGGAAT
GACACCTCTG CGGGTTGACT CGAGTAGGTT TTCCGTGCCT TCCTCTCATG GACCAAGCTC
ATAAGGCTTT AAGCCGCGCA GCACTTATCA CATACTCAGG AGGTACCACC ATTCGCATAT
AGGCCTCCTC CAAATCCGTC CGGTTCTCAG CCAAATTCTG CTTCCATCTT CTCGCCATCC
GATGCTCCGT CACCTTTACA ATCCTCTTCG TCATCTTCTG AAGCGCCTGA GGAGTCTTAC
GTGCATTTCG GCGCTCCCCC GGATATAACC GACCCATTCA ATCGCTTTTG GGCCATGCTG
GATAATATGC TGGAAGATAT ATCTAACCCC ATTGCTTTTG CCAGTGCACC TCTTGATACA
CTAGGTCCAA CCATTCCACA ACCGAAAAAG TCACAGGAGA AACGCAATAG CAAGAGTGGT
AAAGAAGAAA AGAGAAAGGA AGAGTCACCA TCACCTACTG ATTCCTTTTA TTTGGTTCAT
CCAAAGGGCA AGATGACACC CGAAGGATCC GAGGATGGGG ATAAACCTCG CGGTAACCCG
TGCGTTAATA TCGATGTACA AGTCCATCAA ATAGTGCTGA CATACCGTTG TAGAGCCCCT
TTAGCCAAAA CGCCAGAGGA ATTGGCCCTG GAAAATACAT CTTTACGGCA TTCCCTCGAC
ACCCTTGCTT CACACACTCA ATCTCTAGAA CAGACAAACC GCCTTCTCAA AATCCAGTTA
GAAGAGCGGG ATAAGAAGTT TTTGGCAGCC ATGGCAGGTG TAAAGAAGGA AGCTGCTCGG
GCCAAACAAG GACAAGAGCT GTGGCGCAGT CAAGTCATGG CTGGATCTAT CATACCGGTT
AGAGCGCCGA GGGTGGACGG AAGTCCGATT AACAGTGTTC CTGGGTTGAA GGATGGGCCA
GGATCGGATA GCACGGGTCA GTTCCTTACA CATCATTCAA AGCGACAGAA TGTCCTGACA
TATCGCTGCA GTCTTGCGCA AGAGGATCAA AGAATTGGAG GAGGAAGTGA AGGCCCTGAA
GCTCGAATCA GAGAAGCAGA AAAACCACAT TGAAAAGTAC AAAGGAAAGG TGGGTCCGAT
TTGTCTAAAC TCTGGAGCGC GACAGGAAGT TGATATGGTC TTAGTTCGAA AAGCTGAAAG
CAAATGCGCG AGCCAAGAAG GAAGCTAAAT TAGCTGCAGC TGCCGCTGAA GCCACCGCGC
AAGGACAGAA TTCTTCTACC CCGACATAAA GCTGAACGTC TTAGACAACT GCAGCATCGT
ATGATTATCA GCCATCTCAT CTTGTAATAC GGAATTGTGT TTTCTCTGGC ATAGAACCAG
TGATGTTGTA CTATCTTTAC AGGAA
 
Protein sequence
MSNSSSVRNP LQRAHALSAQ ASSLLDSDQP SAQAMSQALQ AYREAASLFE ASAGDLDVDE 
STKVTLKLLT TQHRKLARDL ERRISIAAKG KMATPVNTRL AGMPPNRRYF SEGSPARGYL
PDNKQVAGGM TPLRVDSTAQ HLSHTQEVPP FAYRPPPNPS GSQPNSASIF SPSDAPSPLQ
SSSSSSEAPE ESYVHFGAPP DITDPFNRFW AMLDNMLEDI SNPIAFASAP LDTLGPTIPQ
PKKSQEKRNS KSGKEEKRKE ESPSPTDSFY LVHPKGKMTP EGSEDGDKPR GNPAPLAKTP
EELALENTSL RHSLDTLASH TQSLEQTNRL LKIQLEERDK KFLAAMAGVK KEAARAKQGQ
ELWRSQVMAG SIIPVRAPRV DGSPINSVPG LKDGPGSDST VLRKRIKELE EEVKALKLES
EKQKNHIEKY KGKFEKLKAN ARAKKEAKLA AAAAEATAQG QNSSTPT