Gene CNF01410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01410 
Symbol 
ID3258306 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp410988 
End bp414672 
Gene Length3685 bp 
Protein Length667 aa 
Translation table 
GC content45% 
IMG OID638257265 
Productexpressed protein 
Protein accessionXP_571579 
Protein GI58268846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACAAGCATC AAAATGGATG AAGATAGCTC ATCAGTTCCT CGAAAATCTG CTACTGGCCC 
CGTAAGAGTC AGCCAGAGGC AACCACAAAG GTGAGTAGAT GAATTCTTGC CTTGTTCGAA
ACAGGAACGA AAAGCCTGAC GCTCGCTTTT CATGCGAAAG CTGCACGGAA TGTACCAGGT
GAGCATTAAA CCAAACTTCC AAACTCTGCT GCTATGCTCA TCCTCGGTGA CTGTGGTAAA
GGCGCAAGAC AAGGTGAGGG AAATTAATCA AGCGCGTGCA TTTAGAGGTA GTTTAATCCA
ACAAACATAG GTGCGACAAA CGGGTTCCCT GCGTCAACTG GTGAGTATAA ACTTACAGCT
TCTCTCAAAT CACTCATGAT AATTACAGTT GCAAACGAGG GGTACCTGAG AAATGCCAAA
TCGAACAAGT AATTCCTTCA AAGAATTTGT ATGTACACTC TCCGGCCATA GATCAATGCT
AAACGTTTCT TTTCAGGAGT ACTGCAGCGC AGATCAAGAC CTTAAGAGAT GAGATGTTAG
CAGCCGATCA TGAGCTGAGA CAGCGTGTAG AAGCCTTAGA GAAACTAGTC AAATCGTTAA
CCGCAAGCCG AGAGGCCTCA GAAAAAGCTA CTAAAGCGAC CACAACTGTG CTGTCTCCTT
CCATTTCTCC ATCGGCTATG AATGATCAAG TGAGTACACT CATACTGGCG CTTTCGATGG
CATAGTTGAT GGAATGAAAG GGTAATAACA CCGACGAAAA TCTGGATGAG TCTGTCGGTG
GTGGTTACGA TGACGATGAA GCGGAGGCTG CAGCCACCCT TGAATTGTAA GCGCTTTGGA
CATGTTATCT TAGTAGCTTC AGCTTATTAA TCTGGCATGT AAGTCTTGCC ATTGGACGAC
TTCGACCAAA GCCTCCAGGT GGATCGGACG TGATATACAA TGGGGACGCA AATCAAGTTG
ATGGTGTGAG TATCCATCAC CGTAATTACA AGCGATATTT CCTCACAATT CTGAGTAGCA
GGCTTGTTCC GAAGATGGGG AGATGAACGA ATCACCTCCG ATCATCCTGT CATTATCTGA
TCCTACTCAC GCATCCACTC ACCCTCAAAC GTATCTTCAT TTACCTCCAG TCAAATTGTC
TCGCCTCAAC GACCGCCGGC TTGCTTTGTA TCATGAAGTC GTCAGAAAAG ACATTCTGTC
TCAACTGCCT CCCGCTACTG TTGGAAGGGC TTTGGTCCAA TTCGACATGG ATAATGTTGC
CTGGATGCAC TGGTAAGTCT GAATGTTGGA TTCCGCGATC AAAACTCATA CGTTTTTTGG
ACAGCTGTTA TCACGGTCCT ACTTTCCGTA GGTATAATTA TTGGATTACA TAGGCGTAGA
GCTGACAAAA GACAAGAACG TGAAGCCGAT CTGGTCTGGG CAGAGCTTGG ATCAGATGAC
GTAGAGATCA ACTGGAGCTT TATGGCTCTA TTATTCGGCG TTCTGATGGT CAGTTTTTAT
CTTTTGCGAT AGACAAAGAA CTGACGAGCA GATAGTCGGG AGCATATCAC CTACCTGAAA
GCACATTCCA GTATCTTTTT CCTTCGCGTA AGTGTCTAAT TGCCGTCCTT GCTTGCTTTG
GCTAATACAA CTTTCTAGAA ACTCGGTTGG ATTTAATCAA TCAATGGTTT ACGGCTTGCA
TCATGTGTCT TCAGGAAGCC GATTGGATGC GTTCACACAG CCTGTAAGTA GCTTTTAAGT
GTTCTTCTTG AGTGTTTCGC TGATCATTCG CTAGTTACGC CATTCAATGC GTTGCTATTA
TTACTTCCGC AGCGAACCAT ATTGGTAAAG CAGATTTATA TTTCACTCTT CTTGGTGCGG
CCGTAAGGTA AGTAGAACGA CGGTAATCAT ACCCTGCTAA TTCTTATGGC CCAGAATTGC
CCAAGCATTG AACCTTCATC GACTCGGTCC AGATTCCGAG CTTTTATATA AATCTAAGAA
TTCGAAATTG ATCATCGCAC GAGAGGTTCG AAAACGTACA TGGTATCAGC TCCTATATCA
GGGTAATTCA AGTTGTCCGC CTTACTTTTG GGTACTAATA CATGACTTCA ATAGACGCAT
TCCATGTGGC GTTCAATGGT GCTTGTTGTG AGTATCTTGA TTGCTTCTTT CCTTTGAGCG
CTCTAATCAC GATATTCGAC TAGCTATCAA TCGAGCCCAG TTTAACACTG CGCCTCCCAT
CCACTGTATT GACAGTATGT TGGGTCTTGG TATGGAAGAA GCCAAAGTAT CCTACGACAT
TCCGACCACA GATTCGCACG TCTGTCAACT CTACAAGTGT AAGTTCCACA TACGCGGTGC
AAAACTGACC ATTTAGTGGC TTTGCAAGTG AATAGGTAAT AATACCGACA GGGATCCTTT
AACCAAAGCT AATTTTTTGG TAGAGCATAC TCCTCCACTG AAGCCATGGC CCCTTTACCC
TATGATGTAA TTCTTCAGAT TGATCGTGAC ATCCATACAA TCATGAACGA AGCTCCATTA
TGGATGCGGG ACGAGAACTA CGTACTTGAT CCAAATGCTC CTGCATGGGC TGACTGGCAG
AGAAAGGCGT ACATCGTACG TATCTATGAA GCCTTGAATA AGTAATATAC TGATATCCCA
TCACAGCTTT CGGCCTCCCA CAAAGTGAGT TGTCAATATG CTCGATATAG AAATTGCACT
GATTGTTGTC AGGTTATTGT CCTGCATCGT CCTTATCTGG GCCGGGCATT CCGAGGCGAT
CAACGTTATA TCCGGTCTCG CGAGGTGAGA AAGCATTGAT ATATCCTCAA CGTCGCTGAT
ACCTTTTTTA GAATTGTTTA CAGCACGCCC ATAAAATCCT GCGGATATTC AAAGGCTGTT
CACTCGTCCA GTTTCGCACT ACTTGGTCGG TTGCCTATTT CAAGCTGCAA GGATTGGAGC
TTACTTCATT CGATTAGGAC CGTCTTAGTT CATGCTGTTG CGGCGAGTCT TATCCTTCTC
CTTGACGCCT CGCAGCAAAC GAATCATTGC GCCCTCAACT CAATCCAAAT TGTGCGCGAG
ACGGTCGATA TCCTGCGCCA GCTCTCTGAC CTCTCTATCA TCGCCAAAAA GGGTTTTCTC
GTCATTTCAC CTTTAATTGA AGACTCCGCA CAACTCACCG AGGCTGGACC TGATGTAAAC
GTCAGAAACA GAAAAGCTTT TCTGGGTAAG GCGGAAACCG AACTGAAACA AGCCCTCGAT
GTTTTTCGGC GTGGGCACTC CACCGCTCCA CCTTCTGTGT CCCCATTCAG TGCCTCAGGG
CATGTGGAGT TGTCTGCTAT ATGCAACCAG GATGCCAGCA TGGCTTCGGA CTCAGAGGCG
ATGGTCGCTA CTTCGTCCCT CATAGGAGCA TCACAAGTAA GGGGGTTGGA AGCCACGGAT
GTATGGAATC CCTTTTTATG GAACTCACAG GGCGACATGG GGTCCTCCAC CAATCCTAAC
ATGATGATGG ATTGGGGAAA TCCAGAATCT CTGAGTTTGA GTGGTGATAC AAGTGGAATG
GACTGGATGT CTTATACATA AGCAGAAAAT GGGCTAGTTG TACATGTTAC TAGAATAAAT
CAAGTTGTAC TAGATTGGGA ATTGATAAGT TCATAGGAAT ATGCCTCTGT TTTTTTCTTA
AATATTAATG AAGGAACTTG AGGCT
 
Protein sequence
MDEDSSSVPR KSATGPVRVS QRQPQSCTEC TRRKTRCDKR VPCVNCCKRG VPEKCQIEQV 
IPSKNLSTAA QIKTLRDEML AADHELRQRV EALEKLVKSL TASREASEKA TKATTTVLSP
SISPSAMNDQ GNNTDENLDE SVGGGYDDDE AEAAATLEFL AIGRLRPKPP GGSDVIYNGD
ANQVDGQACS EDGEMNESPP IILSLSDPTH ASTHPQTYLH LPPVKLSRLN DRRLALYHEV
VRKDILSQLP PATVGRALVQ FDMDNVAWMH CCYHGPTFQR EADLVWAELG SDDVEINWSF
MALLFGVLMS GAYHLPESTF QYLFPSRKCL IAEADWMRSH SLYAIQCVAI ITSAANHIGK
ADLYFTLLGA AVSLTLRLPS TVLTVCWVLV WKKPKYPTTF RPQIRTSVNS TSLSASHKVI
VLHRPYLGRA FRGDQRYIRS RENCLQHAHK ILRIFKGCSL VQFRTTWTVL VHAVAASLIL
LLDASQQTNH CALNSIQIVR ETVDILRQLS DLSIIAKKGF LVISPLIEDS AQLTEAGPDV
NVRNRKAFLG KAETELKQAL DVFRRGHSTA PPSVSPFSAS GHVELSAICN QDASMASDSE
AMVATSSLIG ASQVRGLEAT DVWNPFLWNS QGDMGSSTNP NMMMDWGNPE SLSLSGDTSG
MDWMSYT