Gene CNF01760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01760 
Symbol 
ID3258066 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp511494 
End bp514943 
Gene Length3450 bp 
Protein Length725 aa 
Translation table 
GC content49% 
IMG OID638257301 
Producthypothetical protein 
Protein accessionXP_571515 
Protein GI58268718 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAGACTCCGG CCTCTGTCCC ATATACATCC CTTCCATCAC TATACCACCA GCGGGGAGAC 
TCCGCCATAG AGACAGATGC CTCGAGACAC TTTCTCCAGC AACCCCTTCA TATCCTCTCC
GGTCTACCAC GACAGGTCTG CCGACATCTC CTCTTTCTCC TACGATGCAA GGCGGTACGC
GCCCTCGCCT TCATCCGAGG CGCTGCCCCA GGAGCTACCT GCAGGAGCGA TGCCTGCTGT
TGCCGACACT TTTTCTGTGC GGAACTTGTC TTCGGTGGGT CACTATGTCT AACATTGGTT
TTTGGCTGCA AGATTCGCTA TGATGTGCCT GAAAGGAAAA TGCGATGTTA CGGATCACAA
ACAACCGCGC GCTTTTTCAT CGTAGTTCTG CGTTTTCCCA CTTTCTTTGG CGTCCATCCT
TCTCCCCGCT CATTTGCTGA CTAGATCCTA GTATTCCCAG CGGCCCAGTA TGTCGACCAC
CCCTAGGATG AACTCTTCAT TCAGCGGTAG GGATAACCAG CTGGAAAAAG ACAACCTGTT
TGGCAGTAGT CCAACGCACT TCAACTCTAG CTCGCGTCTG GCCTACGAAG CAGGGCCACA
TAGCAATTAT ATGGCCCCCG AGCCTTTAGG CTATGGTCGA ACCCGGACCA AGAAGCCGAC
ACGCACTTGC ATACCCACGA ACCCAACCAA ACGTCGACTT TTGTTTTTCG GCGTTCCTAT
TCTCTTGGTC GTTGTGGCAG CTGCTATCAT TGGAGGTGTG GTAGGATCTC AGAAGCGCCA
TAGTGCGAGC AATGACAGCA GTAATGGGGA TTCTTCTTCC GGTACTTCTG GTGGCGGCGG
GTCAAATTCT ACCAGTGATA CAGATGCGAA CACTTGGAAT TCGTTTATTC AACCTGGCTC
GGGCGGTGAT GGGTCTATTG TCACCACAGA TCTCGGTGTA AACTTTACCT ACTCGAATGC
TTTTGGTGGA ACCTGGGCTC AAGATCCTTA TGACCCATAT TCTGTGGGTT TGCTTCACCA
TATCGGGCCC GTTGAAGCTG ATAACAATTT AGGTATCTGG TCAAGCTCAG AGCTGGAGCC
CGAGCTTGTT GGAAGATTGG GTATGGGGAG AACATATCGT TCGAGGGTAA GTGATCATCT
AGTCTTCGCT CTTGTCATGT CAGCACTGAC GACCATTCCT TCCATAGCGT CAATATTGGT
GGATGGCTGG TGACTGAACG TAAGTTTCTT CCCTCCATAA CCATGTTAGA GTCTAACCAG
ACATAGCTTT TATCGTCCCT AGTCTGTATG AGAAATATCA AACTTCCACA CCAAAAGCCA
TCGATGAGTA TACGTTGTCG CAAGCGTATG TCTTTTATTT TCTGGTTTTC TGTGCTGTCT
AATCCCAGTA TTAGAATGGG AGACAACCTC GCTACTGAGA TGGAAGAGCA TTACAAAACA
TTCATCGTAA ACCATTTTAT GTTACTGATC GTACAAGAAG AATGAGCTAA TTTTTTTCTT
CCTTCTAGAC TGAAGAAGAC TTTGCTTTGA TCGCTGGGGC GGGTCTCAAC TATGTTCGGT
ACATTCGTTT TGAGTATCAA ATTATCAAGC AATGCTGAAC CTTGAACTTG TCTTAGTATT
GCATTAGGCT ATTGGGCTGT TGAGACAATC GACGGCGAAC CATACCTCGC AAAAGTTTCT
TGGAAGTCAG TCATTCCTGG AATTTTTGAA AGACAGTCGT CGCTTACCAC TAAGATCAGC
TACTTCCTCA AGGCCATTGA TTGGGCTCGA AAGTATGGTC TCCGTGTTTT AGTCGACTTC
CATTCGTTAC CAGGCAAGTC AGATTCATCT CTTGAACCCT CTACTTAACT AACTCCTGTA
TGATAGGATC CCAAAACGGC TGGAATCACT CTGGCAAATC CGGGTCTGTC AATTGGATGT
ACGGTGTCAT GGGTATCGCC AATGCGCAAC GCTCTCTTGA GACGCTCCGA TCGATCGTCG
AGTACATTTC CCAAGATGGT GTCAAACAAG TCGTGCCTAT GATTGGGCTT GTTAATGAAG
TTCAAGCAGA GACCGTCGGC GGAGATGTGT TGGCTGCCTT GTAAGTGTCA ATCTCTTGAT
TGAAGGTTTT ATATTTGTTA ACTTTATTAT GTTTTTAGCT ATTACCAAGC CTACGAGATG
ATCCGAGAAA TAACCGGCTA TGGTGCAGGC AATGGTCCCG TCATCTTACT GCACGAAGGT
TTCTACGGGA TCGCGGCTTG GAATGGATTT TTGGCAGGTG CTGACAGAAT GTAAGGGTGT
ACACATCTAT CATTGTACTC TGGCTAAACT TCTGCCATCC AGTGGCCTCG ATCAACACCC
GTACTTGGCG TTTCCCACCA CCCAGATCGC TGACAACCAC ACTGTCCAAG CCCATACTGC
CTGTGGCTGG GGTGGCGGTA CCAACGACAC TTCCACTGGC TACGGCATCG TCATCGGTGG
TGAATGGTCC AATGCAATCA ACGATTGTGG CTACTGGCTC GACGGTGTTG ACTCAACTCC
TCAATTCGAG GTCACGGGAA CCGGAAGCTG TGCCGCGCTC GATGAATGGT TCAACTATTC
CGATGAAACC AAGCAAGGCA TCATGGGCTA TACTTTGGCC AACATGGATG CTCTCCAAAA
TTACTTTTTC TGGACTTGGA AAATTGGGAA CAGTACGGTG AAGGGATATC CGACCAGTCC
GATGTGGCAT TACAAATTGG GATTGGAACA AGGATGGATG CCAAAAGACC CTAGAGTTGC
AGGTGGTTAC TGCCAAAGCA TCGGTGTAGG AGGGAACCAA GTAAGCTCTC TTCACATTCC
TGCATTAGAA ACCCGAGTGC TAACGCTCTC TTCAAGTTTG CCGGGACATA CCCTGCTTCA
GCTATCGGTT CCTTCCCCAC CGAAGTTGCT ACACCAACTA TCGATCCGAC GCAAGTTGCT
TCTCATTCCG TTTGGCCTCC CACCGCCCTT GGGCCGTCTC CATCCTACTC TGCCGCGCAA
ATAACTCTCT TCCCCACACT TACCCAGACC GGCACTAGAA ATGTTCTTGC AACGCCTACA
CACCCCAGTA ATGTGACACT GGAAGGTGGT TGGGCAAATG CCGCCGATAC CACCGGAGCT
TGGGTGAGAG TGGCTGGATG TGATTACCCA GAGTAAGCAC TTAGTTGGTT CCCTTCAAGG
TGGCCATGCT GACGATTACG GCCTGTAGTG AATATGATGC GAATACGGTG GCTGTTCCTA
CCGCCCAGTG TACCGGCTCT GGCTCTACAG ACAGTATGCG GAAAAGGAGT TTGCGAAGAT
GATAAAGAGT GGCGGAGCGG CCTTTATAAC AACTTTGGTT TGAATCAACG GACTTCTTGT
GGCATATTAG ATGAGCTTTA AATCGTTTCG CATTATCTTT ACATGATTTT TATATTGAAT
ATATCTCCAT AACCTATACG TAACATCGGG
 
Protein sequence
MSTTPRMNSS FSGRDNQLEK DNLFGSSPTH FNSSSRLAYE AGPHSNYMAP EPLGYGRTRT 
KKPTRTCIPT NPTKRRLLFF GVPILLVVVA AAIIGGVVGS QKRHSASNDS SNGDSSSGTS
GGGGSNSTSD TDANTWNSFI QPGSGGDGSI VTTDLGVNFT YSNAFGGTWA QDPYDPYSVS
GQAQSWSPSL LEDWVWGEHI VRGVNIGGWL VTEPFIVPSL YEKYQTSTPK AIDEYTLSQA
MGDNLATEME EHYKTFITEE DFALIAGAGL NYVRIALGYW AVETIDGEPY LAKVSWNYFL
KAIDWARKYG LRVLVDFHSL PGSQNGWNHS GKSGSVNWMY GVMGIANAQR SLETLRSIVE
YISQDGVKQV VPMIGLVNEV QAETVGGDVL AAFYYQAYEM IREITGYGAG NGPVILLHEG
FYGIAAWNGF LAGADRIGLD QHPYLAFPTT QIADNHTVQA HTACGWGGGT NDTSTGYGIV
IGGEWSNAIN DCGYWLDGVD STPQFEVTGT GSCAALDEWF NYSDETKQGI MGYTLANMDA
LQNYFFWTWK IGNSTVKGYP TSPMWHYKLG LEQGWMPKDP RVAGGYCQSI GVGGNQFAGT
YPASAIGSFP TEVATPTIDP TQVASHSVWP PTALGPSPSY SAAQITLFPT LTQTGTRNVL
ATPTHPSNVT LEGGWANAAD TTGAWVRVAG CDYPDEYDAN TVAVPTAQCT GSGSTDSMRK
RSLRR