Gene CNN01070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01070 
Symbol 
ID3255536 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp329403 
End bp332739 
Gene Length3337 bp 
Protein Length861 aa 
Translation table 
GC content50% 
IMG OID638254523 
Productconserved hypothetical protein 
Protein accessionXP_568746 
Protein GI58262672 
COG category[S] Function unknown 
COG ID[COG5594] Uncharacterized integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.935111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCCAACATC CATTCTCATT TTCTCTTTCT CAGCGCCCGC AACTGACCAG CCTGTCTCCC 
AGCTGCAAGA GCGGAGAACG AACAACCATA TTTCTACAGG TGACGCGTAT CAAACGAGGT
GAGTCGCCCG GGAAACAAGC ACTTCGGGTA AACGGGGTTG AAAACTGGTT ATTCCCCACT
AGCCCACCCG AAATAGTCCC GAATGAAAAG AGTCCAAGCA CAGGAGCGCT GACTTGCGGT
CAAATTTTCG TAGCATTTAC CATCAACTCA TCGTTACACT TCGCCCCTCG CCCATCCATC
TTTATTCGCC ATGTCGGCTA CTAACGCGGA CGTGCAAACG TCCACGACCT CATCCTTCGT
CGCGGCGCTT GTCGTGGCAG GTATCACGGT CGGAGCTTTC TCAGCTCTCT GGCTCGTCCT
CCACGGAAGG AAAGACCTGC AGAGGGTTTT CCAGCCCAGG ACAATCCTTC CTCCTGAGAG
GTAAGTTTAA TCTTGAATGA AAAGGTCTAT GGCCGGCACT CAATTCTGTG TGCAGCAAAC
GACCTCAACC TCTCCCATCG GGTATAATAG CGTTTTGGAA AACGTTGATC CAAACACCGG
ATCAGGACAT CATTACTTCC AATGGTCCCG ATGCCTACTT TTACGTCCGC TTCCTCAAGG
TCTTTGGTTT TCAAATGCTT ATCCCCTACG AAATCCTCAC TTGTGCGATC CTTATTCCTG
TTTCTGTTAT TTCTCCTAAC CAGGGGAACA CGGGCTTAAA CAAGTTGACT TTCGGCAACG
TTGGTGAAAC AGACCAGATC AGGCATGTTG CCCATTTCCT TGTCGCTATT GTCTTGATGA
GCTGGACCGT TTACTTGATC TGGAGAGAGT ACAATCACTT TGTCGATGTT CGACAAACTT
GGTTGACCAC TCCACAGCAC TTGTCTCTTG CGCGCGCTAG GACCATTGCC ATCACCAACA
TCCCTGACAG CCTCAACTCT TCCACTGGCA TCAAGGAGCT CGCTGGTGTC GTCTCTCGTG
TCGGTGCTGG TAACGGCTCT GGCACCAATC TTCTTGCACT TGCCAACCCC TTCTCTCGTC
AATCTACCGC TACTGAGAAC ACCGGTGCCA CTGGCGACTC TGAAGGTGGT GTAAGGCGCG
TTTGGTTGAC TCGCAAGTGC AAGAACGTCG AGAAAGTTTG GAAGGAGCGA GATGCCGAGT
GTGCCAGGCT CGAAGGTGGT GTTGCCAAGC TCCAGAAGCG CGCTGCGAAG AATGTCCGCA
AGGGCAAGAC CCCCGAGACG CAAGGTAAGT TTTTAGAATT CAGGTCAATC AACAAAGTGC
CTTACATTCT TGTTATGTCA TTTGCTTACT CTTCCCATTT AATAGGCAAA TACGATGCTG
AATCTTCTGG TGGTGATCTC ATTGACCGCT ATGTCCTCCC CAAGAAGCGT CCTTCGTGGA
AACAAGGTCT TCTTGGTCTT ATCGGTCAGA AGCAGAACCT CGAGACCTCC CCCGACTACA
TCCACGAGCA CAATGTCAAG CTCGACGAGT TACGTGAAGG TATTGAGGAT CTTCCTCAAG
GCAACACTGC GTTCATCCGA TTCTCTTCTC AGTTCGAGGC TCACGCCTTC GCTAAGCTGG
CAAGCAAAAC TGACAAGTCC AACATGCACA TCCGTGGCGG TATCGAAGTC GTCCCTGAAG
ATATTGAATG GTCCAACATC TCTATGAGCC CTTGGGAACG TCACGCCCGA ACTATCGTCT
CTTGGTGTCT CACTGTTGGC TTGATCATTG TCTGGGCCAT CCCTGTTGCC TTTGTCGGTA
TGATCTCCAA CGTTGATACC CTTTGTGCCA ATGCCAGCTG GCTGGCTTGG ATCTGTGAAC
TGCCCCCTGC CGCCCTTGGT ATCATTAAGG GTGTTCTTCC TCCTGCGTTG CTTGCTGTCC
TCTTCATGCT CTTGCCTGTC GTTCTCCGTT TGATGGTCAA AATGCAGGGT GAGATCCGAA
AGAGGTGAAT TTACATCTAT TGTCTCTTGT AATCACCTGT TTTGCTGACT CGGATGTAGC
GACATTGAGC TCAAACTCTT CAGCCGATTC TGGCTCTTCC AGGTCATTCA TGGTTTCCTT
ATCGTCACTC TTGCTTCCGG TTTGATCAAT GCTCTCGGCA ATTTGGGCGA CACTGCCGGT
GAAGTCCCCA CCTTGCTTGC CACCAAGCTT CCTGGTGCTT CCATCTTCTT CTTGACCTTT
ATCCTTACCG CCACCTTGTC CGGTGCCGCC AAGACCTACG CTCGTTTGGT TCCTTGGATC
ATGTACTTGC TCCGTGACAT TTTGGCCGGT GGCACTCCCC GAAAAGTTTA CTTGAAGAAG
TACAAAATGG ACTCCTTCAC TTGGTCGACT GCCTTCCCTC CTACCTGTCT TATCATTTGC
GTTACCATTG TCTACTCTGT CATCCAGCCT ATTATCACTG TGCTCGCCTT GGTCGCCTTC
ATCCTTCTTT ACTGCGCCAA CAAGTATGTC ATCCACTGGT GTGCTGACCA GCCCGACGCT
GCCGAGACTG GCGGTCTGTA CTACATCAAG GCTCTCAGGA CCGTCTTCGT GTCTCTTTAC
ATTCAAGGTG TCTGTATGGC TGGTTTGTTC TTCCTTTCTA CCGATGAAAA TGGAAACAGG
TCCAAGAGTG GTTTGGGATG CGGTGCCGTC ATGGTAAGTA TGATCATTTG GTTTTGATTG
TAGCAACTGA CATGTGTGCT TTAGTGTGTT ATGATCGTCT GCATTGCTGT CATCCAGATT
TACATCGACT GGTTCCGATT CACCAAGCCT TACCTCATTT TTGTGCACAA CACCCCCTCC
GTTCCCCATT CTTCTTCTGT CGAGCCCAAG GTTGGCGGCG CCACCGACTC ACCTGATGAA
AATGCGGTCG CCGGCCCCGA GCTCGGCAAC ACCTCTGGCT TCCACAATCG TGCCTTTGAC
CACCCCGCCT TGTGGAAGAA GCAGCCCGTC ATCTGGGTTG CCGCAGACCC GCATGGTTTG
GGTGCGTTTG AGGTTGAGCA GATCAACGCC AAGGGCGTGG AGGCGAATTT GGAGTATGCG
GTCATGACTG AGAAGGGAGC GATTGACGTG GAGAGGTCTC CTCCTGATGA GGCTTGGTAT
GAGGGATTCA CTGCGTAATT CGTTTTCAGG ATTTGGTAAA TATAATTTAT GGTGTAAAGA
GAGGAGATGT GAACACACGA TAAGAAGTTA TACCTGTTTT GATATCGATA CCAAATAGAA
GTATCTGTTT TCTTTCTCCA TACACGGTCG TACGGATGTT TATTTCCGAG TCTATATTAT
TTTATCGACA TTATGAATGT GAGCGCGCTG TTTACCT
 
Protein sequence
MSATNADVQT STTSSFVAAL VVAGITVGAF SALWLVLHGR KDLQRVFQPR TILPPESKRP 
QPLPSGIIAF WKTLIQTPDQ DIITSNGPDA YFYVRFLKVF GFQMLIPYEI LTCAILIPVS
VISPNQGNTG LNKLTFGNVG ETDQIRHVAH FLVAIVLMSW TVYLIWREYN HFVDVRQTWL
TTPQHLSLAR ARTIAITNIP DSLNSSTGIK ELAGVVSRVG AGNGSGTNLL ALANPFSRQS
TATENTGATG DSEGGVRRVW LTRKCKNVEK VWKERDAECA RLEGGVAKLQ KRAAKNVRKG
KTPETQGKYD AESSGGDLID RYVLPKKRPS WKQGLLGLIG QKQNLETSPD YIHEHNVKLD
ELREGIEDLP QGNTAFIRFS SQFEAHAFAK LASKTDKSNM HIRGGIEVVP EDIEWSNISM
SPWERHARTI VSWCLTVGLI IVWAIPVAFV GMISNVDTLC ANASWLAWIC ELPPAALGII
KGVLPPALLA VLFMLLPVVL RLMVKMQGEI RKSDIELKLF SRFWLFQVIH GFLIVTLASG
LINALGNLGD TAGEVPTLLA TKLPGASIFF LTFILTATLS GAAKTYARLV PWIMYLLRDI
LAGGTPRKVY LKKYKMDSFT WSTAFPPTCL IICVTIVYSV IQPIITVLAL VAFILLYCAN
KYVIHWCADQ PDAAETGGLY YIKALRTVFV SLYIQGVCMA GLFFLSTDEN GNRSKSGLGC
GAVMCVMIVC IAVIQIYIDW FRFTKPYLIF VHNTPSVPHS SSVEPKVGGA TDSPDENAVA
GPELGNTSGF HNRAFDHPAL WKKQPVIWVA ADPHGLGAFE VEQINAKGVE ANLEYAVMTE
KGAIDVERSP PDEAWYEGFT A