Gene CNF03490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03490 
Symbol 
ID3258404 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1033529 
End bp1035382 
Gene Length1854 bp 
Protein Length489 aa 
Translation table 
GC content48% 
IMG OID638257467 
Productsplicing factor u2af-associated protein 2, putative 
Protein accessionXP_571423 
Protein GI58268534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGCT GTACGCCGAC GAACCGCAAA TCCTTGGGTA TACTGTAGCT CTAGGTGATT 
TCTGGGTCTT ATGTGTTCGT TCTTTTTTGT ATATGTTTTT TCTCCCAGGC TACATATCAG
CCATGCTGAC GGAAGAGATG ACCAATCGAG TAGATCCGGA GAGCCAGTGG AGTGAATTCT
TTACGTGACG GAGAAAAGTG ACTGACGAGT TGACGAGGCG AAGGAGGGAG CCGTCCCACG
TGCGCTTGGC GTGGATCTCG CGTGGCTTAT CCTACCACCG TCATGGCCCT CAAGCCGTCG
CGGACCAGAA GCAATCGAAC TATCCGTTGT TCGAGTCTCT TCAGTTATTG AGCTGTCTAT
ACGTTTACGT GCCACCATTT GGTAAAATGC CAAACGCCCC CATACCCGGC CAGTTCGAGC
AGGACACTCG AGTCTCTTTT GACAAAGTTT CCGGCAAGTG GCAGTACGAA GATGATGAAG
GCACAGAACA TGAATGGAAT GGCACTGCTT GGATTCCCAT TGTACGTATC ATGCAGACTC
GTACCAAACA AAGGCTGATC CATCTGTCTA GATTGACGAT GAGCTTGTAA GAGCACAGCA
AGCAGCGTAC TCGGTACCCG GTGTAGACGA ATCAGTACGT CGGTTATCAA TCGTAAAAGA
ATTAAATCCT CATCAACCGA TTGATAGACA CCTTCCAATG CGGCCATCGC AAGAGAAGAA
CGCCGTAACA AGAAGCGTAA GAAGGGAGAA AAGGATTATA CCTCAAATAC CTCCAACGCC
CCAGCTGCTG CGACCGAGGC CTCCAAACCT GCTCCTGCCC CGTCTGCGCC CAAGAAGACT
GGTGTTTGGG TCACAAATCT TCCGCCAAAC ACCACTATCC AGAAGCTTGC CGATGTCTTC
TCCAAGGCTG GCGTCTTGCA TATTGATGAT GAAGGCAATC CCCGTATTAA GATGTACTAT
GATGACGAAG GGAATTTCAA AGGCGAAGCT TGGGTTGTAT ATTTCAAGGA AGGCAGTGTG
GACCTCGCCA TCACACTTTT GGATGACACT GAGCTCGAGC TGGGTGCTGG TTATCCGCCT
ATGAGAGTCA AAGTCGCGGA ATATTTTAAA GATCAGGAAA AGGGAAAAGA TAAAGAGAAG
AAAGAGAAAA CTGAAGGAGA AAAGAAGAAA TTGACGGCCG AAGAGAAGCA AAAAATGAGC
AAGAGGATGA AGACTCTTCA GAGGTGCGTG ATTAGTTTCT TATGATCTTG ATCTGGATTG
GACTGATTGT ATGTAGTAAA ATCACGTGGC GCTCGGATGA TGAGTCTGAC GACCCTGCTG
CTCCTCTCGG AGGTGCTCCT GCCCCGACAA ACAACCGTTT CGCTCGTGTG GTCGTGTTGA
AGGGAATGTT CGTCCCCGAG GAATTAGAAA AGGATCCTGC GTTATTGCTA GAGCTGAAAG
AAGAGGTCAG AGAAGAAGCA GAGACGCTTG GCCAAGTCAC GAGTGTTATC TTGTATGATG
TAGGTTACTA CATCTTGACG GCTGATTTCG ATGCTTACAA TGAACCATAG AAGGAGGAGG
ACGGGGTAAT GACCATCAAG TTCAAGGAAC CCGTGTCAGC GCAGGCGTGT GTAGCGAAGA
TGAACAACCG ATATTTCGAC GGTCGAGTGG TATGTCTGGC TTATGATCCT TTTTCTAATG
TCGCTAACAC ATTCAACGCA GATCTACGCC GGTCTCTATA ACGGAAAGGA AAGATTCAAA
AAATCTGGTG GACGGACGTT TGATGAAGAT AATGATCAGG AGGAGAAGGA GCGACTGGAC
AACTTTGCGC ACTGGCTGGT GGAGGGCGAG GATGAAGAAG CTGCCAAGAA GTAA
 
Protein sequence
MASCYISAML TEEMTNRVDP ESQWMTDELT RRRREPSHVR LAWISRGLSY HRHGPQAVAD 
QKQSNYPLFE SLQLLSCLYV YVPPFGKMPN APIPGQFEQD TRVSFDKVSG KWQYEDDEGT
EHEWNGTAWI PIIDDELVRA QQAAYSVPGV DESTPSNAAI AREERRNKKR KKGEKDYTSN
TSNAPAAATE ASKPAPAPSA PKKTGVWVTN LPPNTTIQKL ADVFSKAGVL HIDDEGNPRI
KMYYDDEGNF KGEAWVVYFK EGSVDLAITL LDDTELELGA GYPPMRVKVA EYFKDQEKGK
DKEKKEKTEG EKKKLTAEEK QKMSKRMKTL QSKITWRSDD ESDDPAAPLG GAPAPTNNRF
ARVVVLKGMF VPEELEKDPA LLLELKEEVR EEAETLGQVT SVILYDKEED GVMTIKFKEP
VSAQACVAKM NNRYFDGRVI YAGLYNGKER FKKSGGRTFD EDNDQEEKER LDNFAHWLVE
GEDEEAAKK