Gene CNF01220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01220 
Symbol 
ID3258227 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp366843 
End bp369745 
Gene Length2903 bp 
Protein Length529 aa 
Translation table 
GC content46% 
IMG OID638257245 
Producthypothetical protein 
Protein accessionXP_571265 
Protein GI58268218 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.904942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTGTCTGGA CTGTTTTCCA ATACCAAACA GGGTTTAACT GCCTTACAAA TTGTATCAGC 
CGATAGGCAA GTCTGGAAGG GTATAAGTGT CTTCGGTATC CACCATTCTG GTGAATCCCT
ATGTTGCTAA AGACTAGCCT GACAGCTAAT TACAGGCGTA CTCCTTAGTC ACCATCGCAC
CTCGAAACTC ACGATAACCA TATACACATC ACGAAGCTAC GATTTAACAT GTCTGCCGAG
ATTAAAGGAG AACATTTGAC ACATCACTAC TTGACCAATG CCCAGCTGGA CGACTACCCA
GATGAGAAGC AGGTTGTCGG GAATGGAGAT TTGGTGACGG TAGAGGAAGT CAAAGTCGCT
GAAGACGCGT GAGTGATGAC TGTAGATTAT TGATGCAAAC CTATCATTAA TATGGTTTCC
AAGCATCCTG GCGGGGGATG AATACACGGA AGAGCAATAC AAGAAGCTCA AGCGCAAGGT
AGACTGGGTC CTTTTGCCTC TGATGTGGTG GTGTTACGGT ATCCAGCAGA CGGACAAAAC
TGGGTGCGTT TTTGTTTCGC CCATAATTTG CTATGGGATG ATACTGACAA GAAACTATGA
TAGTCTCGGT ACAATGGTAT GTCAGCATAT GCGCCCGTGT GATTCTTCTC TGACGACTTG
CAGAACCTGT ACGGCGTGCA GGCTGACACT GGCATGCACG GCAATCAATA CTCCCTGTTA
ACCGTGGTGT TCTGTAAGTA AAAAATCTGA TAGACAACGA CCAATCTACT TATGGTGTCA
CGCAAAGATA CGGCCTATGC TGTCTGCGAG TTCCCTTCAA ATTTTCTCCT TCAACGTTTT
AGTAAGTTTA TAAGTAACCA GTAGCTGTAG TATCGAACTG ACAAAACGTC AGACATGGGC
AAATGCTTGA CCATCTACAT GTGAGTGCAT GTTTGCATCC GCTGGACCTA GAGCTGATAT
CTACCAGGTT CTGCTGGGGT ATCATTGTCC TTGCACAAGG TTTCGTCAAG TCCTTTGCGC
CTTTCCTCGT TTTGCGACTG CTCCAAGGTG CTTTCGAATG CACAATCAGC CCCGGTTTCA
ACCTCATCAT CGCCAACTGG TACACATCCC AAGAACACAA TTCTCGCTCC CTCATCTTCC
AGAGTGCCAA CGCCGGCTGG GGTATCGTTG TCAGTTTGAC AATGTACGGT ATTGCCCAAG
CTGCCAACAA GAACCCTGGT GGTTTCGCGG CATGGCGAGG GATTGCCGTC TTCCTAGGTG
GTCAAACTTT GCTTGCTGCT GGTGTAGCCT TCTTCTTGCT CGGTACCCCC AACGAGGTCA
GGTGGCTTAA GGCAGAGGAG AAGAAGATCG CCTATGCCAG AGTCATGAAG AACAACGCTG
GCACTGACAC GACTGGTAGA AAGACCTGGA AGTGGGGCCA AGTACGCGAA GCGTTTTTAG
ACCCTGCATT GTACTTTCAG TTTATCAACG CCTTTTTGGT CTCTGTGGTG GGTACACTCA
TAGCCGCGTG ATCGTGTTAC TTGAATATTC ACGCCTTTTA GTGTAATGGT GCTCTTACCA
CCTTTGGTGC TGTCATCACT CTATCTTTTG GCTTGTAAGA CCCATTGATC TTTTCCGCCT
CGGAGCTGAC GAAATACAAT AATAACAGTT CTGAGAGTCA AGTCATCTTG TACGGTATAC
CTCAAAATGT TGTCTCTGTC CTTTGGTTCG CCTTTGTCGG TTTCATGACA CTCAAGTTCA
AGGGACTCAG GATGTACTTT ATGATGATCA GTGTCATCTT CCCCTTCATC GGTGTAAGTC
ATCGTCCCCA ATTTATTATT CTCCTTTGGA TCGGTAACTG AATATAAACG ATCGGTAGCT
TCTTTTCATG GCTTTGCTTC CTGAGGACAC CAGCTACCGA TGGACCAAGT GGGGCATGTA
CTTTATGACT GTCACGTAAG CCTGTCCTTG GCGTCATGCT ATCCAAGTAC CTTGCTGACC
ATTTTCAACA CAGCTTTATT CTCCCTCTCT TCTCTGGATG GGCTCTCATC TCTTCCAACA
CTGCCGGTCG TACCAAGCGA ACTGTAATGA GCTCCACGAC CTTTATCGCC TATTGGTGAG
TTTTCTTCCG TCTCTATTCT CTTAGGACAA CGTAGGTATG TAGTGACTAA TAAACACGAT
ACTTTTAGCG CTGGCAATAT TGCCGGTTCT GAAGTTATGA AGTCCAAAGA CGCCCCACAC
TACATTCCCG GTACCGTACG TTTCCATCTT TCCCTCTCTT CTGGCCTTAC CTGCTAACGC
TCTCTTCCGC CACTTTTCTC AACTTATCTT TTAACTAAAT TCACACACAA TGCAGATCGC
CATCGCATGC TGTATGGGCG TTGAATTCGC CACGCTCATT ATATGGCGTA TCTATCTCCA
ATACTGTAAC AGGAAAAAAA CCAGGGCCAT AGCTGAGATG GGGTTGAGTG AGGAGGAGAT
TACGAAGAAG GGACAAGCGT TGGGTGCCGA GGATGCGACG GACATGAAGA ACCCTTTCTT
CCTGTGAGTA TATAAACCTG AATCTTCATC GATCTGTTTA AAAAAGGAAC AAGGCTGATG
GGTTTGATAG CTATTCCACC TAGTTCTTTT GGGATCTCTC TTGAAGCAGT TGGAGGAGAA
GATGGATGTG CGAGAGGGCC ATGGAAGCTT CAGTTGTGTT GAAGGTGACT AACAAAGTTG
TACTCTTCTG GATCACTATA GAGTTGTAAA GTCGAAGTTG TGCTGTCTAG TTGTTATCCT
TTTTCATTTG TTAATATACC GAAAATTTTG TAGATAGGGA AAATTTTTAA TTGTTCAACG
GGAAATGGAA TAGATCAAAT GACTGACAAT GTAAACAGCC ATCTGATATA TAGCTGAATT
GGGACCAGTG CAGTGTACAT GCG
 
Protein sequence
MSAEIKGEHL THHYLTNAQL DDYPDEKQVV GNGDLVTVEE VKVAEDAILA GDEYTEEQYK 
KLKRKVDWVL LPLMWWCYGI QQTDKTGLGT MNLYGVQADT GMHGNQYSLL TVVFYTAYAV
CEFPSNFLLQ RFNMGKCLTI YMFCWGIIVL AQGFVKSFAP FLVLRLLQGA FECTISPGFN
LIIANWYTSQ EHNSRSLIFQ SANAGWGIVV SLTMYGIAQA ANKNPGGFAA WRGIAVFLGG
QTLLAAGVAF FLLGTPNEVR WLKAEEKKIA YARVMKNNAG TDTTGRKTWK WGQVREAFLD
PALYFQFINA FLVSVCNGAL TTFGAVITLS FGFSESQVIL YGIPQNVVSV LWFAFVGFMT
LKFKGLRMYF MMISVIFPFI GLLFMALLPE DTSYRWTKWG MYFMTVTFIL PLFSGWALIS
SNTAGRTKRT VMSSTTFIAY CAGNIAGSEV MKSKDAPHYI PGTIAIACCM GVEFATLIIW
RIYLQYCNRK KTRAIAEMGL SEEEITKKGQ ALGAEDATDM KNPFFLYST