Gene CNB02270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB02270 
Symbol 
ID3255595 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp657696 
End bp660558 
Gene Length2863 bp 
Protein Length799 aa 
Translation table 
GC content49% 
IMG OID638254878 
Productperoxisome targeting sequence binding protein, putative 
Protein accessionXP_569156 
Protein GI58263492 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.655025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTCCCACAC ATACCGCCAT GTCTGCTTTC CTATCGGGCG CTTCGGTCCA GTGTGGACCT 
ACATCTGCTC TCAAAAACGT CTCAGACAGG ATCAACGTTG ACCGTTCTCT ACAACAGGTG
CGCTGATGAG TGTCGATCAC TTTGCAAGCT TATTTATAAT GAGCTAACTT ATCCCGTGGT
AGGATCGACT AGCCTACACC TCGAATGCCT CTGGATCATC ATCAAAGGTG GGTATGGAAG
TATGCATTAC ATCCTATATT GATAGCTTGC ATAGCAGCCA TTCAGAGCTC AACTTGCTCA
GCTGCCAGTC AACCAGTCCC CGAAGCAAGT TCCAGGACCT CCATCAACTT TTGACCTCTC
TTCACTGAGG CAACAGCTTT CTCCTGTTCC ATCAACTTCT CATGCCTCAG ATTGGGCCAG
CGATTTCGTC CCTCTGACAG GACCGACAGC ATCGAAATAC ACCCGTTCTG TAGCTTCCAC
AAAGAGCGGT TGGCAGGAAG AGTTTAGCCA GCATGTTGCA GCCGCGCCTC CACTTGGAAC
CACAAGATCT CAGAAACCCC TATATAATGC TGGTCTTGCA CCATGGGAGG TTCCAGAAAC
TCAATATCAA CTTCAGCGCC CGGCATTGAT GCGACCAAGC TTTCGGTCGC ATGACATTCC
GATTTCAGAA GCAAGCCTGA CATTTCCTCG ACCTGATATC CATGCAGCGG CTCCCGTTAG
TAAATATGAA GAGCAAATTG GTCAACCAAC CGGATCAGAA ACTCTGCCTC TTGATGAATC
TCAACAATTA CTAGCTCGGA CGGCCCGGTC CTTTGTAAGT AATCTGGAGA CCCAGTCAGA
CATTTTATCT GCCAATCCCA AATTTGCACA AAGTAAATTC TTGTCTCTTC TGAGAGGCTT
AGGTGACGAA CAGGTTGTGG TCAAAGAAGG GCAAGAGGTA AAAGGTGAGG AGGTGGGTGA
GGGTGCGACA TTCGTTGAGC GGAACATTGT TGGAAACAAT TGGGCGGAAG GTTTTGCCAA
GCAAAAAGAG AAATCAATAC CGCAAGAAGC CCCCACCTTG GCTGAAGCTG AATACCTCGA
ACGAAGATCG CCTTATCCTC CAGGTCAAAA AGAGTATCCA GCCCTGAATT CTTGGGTTCC
GGCTTTGCCA ACGCACACAT TGGTTCCTCC CCAGACAGCT CCTCAAGCTG CCCGACTAGC
CGCGAATAAT GGTGCTTTGT GGGATCAACA GTATCATGAT CAAGAAGCCC TCATACAATC
TTCGGAATCG CCCGCACCCG AGCCAAGAAA GAATGTCCAC TTTGACGAAC ACCCCGCTTC
TCGAGAAAGG AGTGGTGTAC CCAGCACTCT CGAGGAAGCC ATTTCGTCTC CTGGCAACAT
CCCCGGCGCA GGTTGGGGTT GGAATGAACA AGGGCTAACT CACGACTTTG ATGAAGATGT
TTTTGAAGAG TTCAATGGTC AACTCAGACG GGCCCAAGAA AGTTTGGAAG GCGGAGTGGG
GAAGCAAGAA AGTTGGGATA GGCTGCAAAG TGATTGGGAG GAATTTCAAA GGACAGAACC
TGGCGTTGCA CATTTCAGAG GCATGGGGAC TGGCGACCAG TCAGAAAGAT ACATGTTCCA
AAGTCGAAAC CCTTATTCGA CCGATGAAGA GGAGCTATAT TTCGAAGTGT CTAGAGACTC
ACCTACCTTA AAAGTGAGTT GCATATCCCC TTCTTCTGTT TGATTCAGCC GTCTGACGTA
CATCCCAGGG TATACTTGAG CTTGAATCTG AAGTACAGAA AGATTCTACC TCACATGAAG
CATGGTATGC CCTCGGCCTC AAGCAACAAG AGAATGAGCG CGAAGACCAA GCTATCTTGG
CCCTATCCAA AGTCATTCAA CTGAATCCGC AGTATCGGCC AGCGTATCTT GCCCTCGCTG
TCAGCTATAC AAATGAAGGG GAAAATGAGG CCGCGTGCAC CATGCTGGAG GATTGGATTA
GACTGAAGGA CAGCAAAAAT ACAACCGGCG CTGATGGACA AAAAGGAAAA GACAGAAACA
AACTCATTGA GAGTTTAATA GAAATTGCGA GGCAAACGCC GCACGAGATT GATGCTGATG
TTCAGGTTGC ACTCGGGGTG CTGTTCAATA TGAGTGGGGG CCAGGTAAGT CGTCTTTCTT
GTCAGCCATT GCGCGATTAC TTATGACCAT CTGGGACAGG ATTATTCCAA AGCTGAGGAT
TGCTTTTTGG CGGCCTTGGA GGCCCGGCCT GAAGTAAGGC AGCTTGCTTC CTGTAATACA
TCTCGCAGAT GCTAACTGCC CCTATAGGAT TGGTTGTTGT ACAACCGTCT CGGTGCAACA
CTTGCGAACA GCGGAAGATC CAGCGAAGCT GTACAGTACT ACCATCAAGC TCTGAGGTTG
CATCCCGGTT TTGTCCGAGC TTTGTAAGCA TTTCACCACT ATATCTCGGG TGATGCGAGC
TTTCTAACGC AGCTCTTCTC TTCAGGTTCA ATCTTGGCAT CGCATACATG AATCTCGGTG
AATATCAAAC TGCCGCCCAA TCAATCCTCG ACGCCCTGAG GCTCCAGCAC TCTGAGGCAA
GTGAAGCCTA TGCATACGGC CAGAATGGTG GAGGTGCAAA AGGTGTTACG AGCGAGACGC
TATGGAATAG TCTAAAAGGC GCCTGCTTCT AGTAAGTGCT ATATTCACCT CTCCAAACGT
TTTTGATTCC TGTGTTAACG GATGGGTATC AAGTATGAAT CGACAGGACC TTGTAAAAAT
CGTAGAAAAG AGAGATTTAT CTGGTGAGTA GCCCTATCTG ATGTGCGCCA TCGCTAAATT
TTTGTCAGGT CTGCCATTGA AATTTGTAGA TGAAGAGCTC TAG
 
Protein sequence
MSAFLSGASV QCGPTSALKN VSDRINVDRS LQQDRLAYTS NASGSSSKQP FRAQLAQLPV 
NQSPKQVPGP PSTFDLSSLR QQLSPVPSTS HASDWASDFV PLTGPTASKY TRSVASTKSG
WQEEFSQHVA AAPPLGTTRS QKPLYNAGLA PWEVPETQYQ LQRPALMRPS FRSHDIPISE
ASLTFPRPDI HAAAPVSKYE EQIGQPTGSE TLPLDESQQL LARTARSFVS NLETQSDILS
ANPKFAQSKF LSLLRGLGDE QVVVKEGQEV KGEEVGEGAT FVERNIVGNN WAEGFAKQKE
KSIPQEAPTL AEAEYLERRS PYPPGQKEYP ALNSWVPALP THTLVPPQTA PQAARLAANN
GALWDQQYHD QEALIQSSES PAPEPRKNVH FDEHPASRER SGVPSTLEEA ISSPGNIPGA
GWGWNEQGLT HDFDEDVFEE FNGQLRRAQE SLEGGVGKQE SWDRLQSDWE EFQRTEPGVA
HFRGMGTGDQ SERYMFQSRN PYSTDEEELY FEVSRDSPTL KGILELESEV QKDSTSHEAW
YALGLKQQEN EREDQAILAL SKVIQLNPQY RPAYLALAVS YTNEGENEAA CTMLEDWIRL
KDSKNTTGAD GQKGKDRNKL IESLIEIARQ TPHEIDADVQ VALGVLFNMS GGQDYSKAED
CFLAALEARP EDWLLYNRLG ATLANSGRSS EAVQYYHQAL RLHPGFVRAL FNLGIAYMNL
GEYQTAAQSI LDALRLQHSE ASEAYAYGQN GGGAKGVTSE TLWNSLKGAC FYMNRQDLVK
IVEKRDLSGL PLKFVDEEL