Gene CNB01000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB01000 
Symbol 
ID3256101 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp301086 
End bp304024 
Gene Length2939 bp 
Protein Length597 aa 
Translation table 
GC content44% 
IMG OID638254751 
Producthypothetical protein 
Protein accessionXP_569106 
Protein GI58263392 
COG category[S] Function unknown 
COG ID[COG5594] Uncharacterized integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.77991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTATC TTCTTTTTCT CCGACTTTTG AAATACCTCT TCTCCGCAAT GTCCGTTCTG 
GCAGGTCTTT TGGCCATCAC AAACTACTAC CTCAACACGC AAACCACATA CGGCAGCACG
AGTACGATCT CTTCTTCTGG GGGCGAGGAT GATATAACAA AACGAGACAG TCCAAGTTCA
TCAAGCTCAG AAACTTCGAA CAACACCTCC ATAATAGATA ACCCCCAGCT ATTAACTGCT
GCCAATGTCA CCAGCAATGG TCTTTTGGTC CACATCTCGT TCGAATGGAT CGTGACAATG
ATGATCATAG TCTTTGGTTA GTTACATATT TCAACGTGAA AATCTTCGTG ATTGACTAAC
GCGAAACAAT CTGAAGTCCT CAAGACATCT GCTCATCACT TGAAAATTGT GCAAGAATGG
ACTCATTTGT AAGCTCTTGA TACCCATTCA TGATTTTGGG TCCTAACGGC AATCCAGGAA
TTATAATGAA GTCGCTTTCA AAACGTTGAT GATCACCAAT CTATCGCTTC GCCAGAACAA
AGCAAAATCA ATTGTGACTG TTGCAGATGC CAAACGTGAG ATAAAATCTC TTGTTCTCGG
TGCAGAAAGG GGCAAAATCG ATGCCAGGGT TTGGTTCGCC ATACATAACA TGAATCCTTT
GCATGAGAAG ATGGAAACAT TCAAGAAGAA GCATTTCAAC TTGGCGATTA AAGCCGTTGC
CATGGAAACT TTCCATGGAA GAGGTGCGGG AGTTCTTTAC GATAGCTGTA CAGGAAGAAT
GTGTGGGCGA TCCAAAGTAA ATCTTTCAAT TATCGGATGG AAAGTTGTTA CTGATGGGCC
TTGAGCCAAA CAGTCTGCTT CGAGTAGAGT GTAGGTAAAG TCCATTTACA AAAGAGGGCT
AATTGTGTCT GACATGCCCA GGGTTGAAGC GTTCAAGGAG AAACTCGAAA TTGAAGAGCT
TCAAGATCGT ATCCGCCAAG GACAAGTTGA TGTTCGTCAC ACGGATCTTT CCGGCACAAT
CACATCAGCC TTTGTCACTG TTCCCAGTGC CAAACAAGCT CGTGAAATCT TGAAAAATGT
GAAAGACGAC ATGAAGCGGG CAGGTTACCA TATTCAACGA GTGTGTCGGA TTTCGCTTGC
CTTCTAAAAA AGCTCGCTGA TAGACATATA TTTAGGCACC ACGTTCTCAC AACGTTGTAA
GTGACTTGTA ACTTCATATC ACGTGGGACC GACCTTGATG AACGATGTCT GTGCAGCTCT
GGAAGAACCT TGAAAAGGAT GTCAAGTCAC GCCATTCACA TGCAATCATA GGCAAATTTG
CTCTCGTAAT TATTTGTTTC GTGAACACTA TTCCGCTCAT GATTGTAACT GTCTTAGCCA
ATCTGGGTAC AGTAAGTCTT AGTTTTCTTA CGGCAAATCC CAAGTAGATT ACTTGAGAAG
TATTAATACG TGAAATCCGG AAGGCCATAG ATCGCTGGCC AACTCTGGCA AAGCTCGAAG
ACTCCTCTGA GATCTGGAAA GCCATCTTCA CCGTCCTTGC AGGAGTTCTT CCAGCCACTA
TTTCGGCCAT GTTCTCCTAT ATCCTTCCAT ATATCATGCG ACGGCTTTCT CGTTGGTCAG
GCGCTCTTAC TCGGGGTCAA TTGGATAAGG CCGTCATCAG ACAGCTCTTC ATCTTTCAAC
TAGTATCCAA TTTCATTGTG TTTTCTTTGC TTGGCGTCGT GTATGAAACA TATCTAACCA
TCTCGGAAGA CATTGGGAAA GAAAGCTGGT CCACTATCTA TGCAGGTCTG GGTGATGTCC
CAGCCAAAGT CACTCAAGCA TATATCTCTG AAAGCCTGTA CTGGCTGTCA TGGTACCCGT
CAGTCATATT TCCTGACTAC GAGGAGTATC ATGCTCATGC GTATTTTAAG GATTCGCTCA
GTAGTGGCGT GCTTACAGCT CCTCCAAATA CCAAGACTCA TTTTAAAGAC GCCTCAGTTA
CTGATGATCA AAACACCTCA TGACCTGGCG GAAGTGGCGC AGCCAGAAAA TTTTGAGGTA
AGTGCAGAGA TATGTGTCTC CTATACAAAC TTATTCTTGC TCTTAAGTAC GCGATCGAGT
ATTCACACGT GGTGAGTTCT ATTTTATAAT GCATGAGGCT CTATTGATCC AGATTGAGCT
AGCTCTTTGC TATGGTAGTA GGGTAAGCGA AGACATGATA GGCTGTTAAG TTCTTGCTCA
CCACTTTCTA GTCTGATGTA CGCTCCACTG GCCCCAATCA TTGTTATATG CGCGGCCATT
TACTTTTGGG CACTATACAT CATTGTGAGT CGTGGTCATA TCGAGAAATT GAAAATGTCA
TCTGATGCTG CGATGAACTG ACTTCTTCAC CTCGGTAGCA CAACAATCAG CTTAAATTTG
TATTTGACTC CAAGGAAACA GATGGAAAGT GCTGGAAGAT CTTGATAAAT CGCGTCCTTA
TCGCGACCGT CTTCATGCAG CTGTTCATGG TGTTAAGTGA GTTTTCCTAT AGTGCTGATG
TACTATGACT CCAGTTAATT CCTTCACTGA TAAATAAATC TTCAGCCTGC ACTCTTAAGA
CGCAGTCGGC GGCGATGGCA GTTGGTGCTG GACTTCCGGT TGGCATTATT TTCCTTTTTA
AAATGTATCT TCGGCGTCAT TACCATCCGG ATGGCGAGGT TTTCTCGCAG TATATCGACA
AGTATGAAGA CGATGATACC AGACATGGGG AATGGGCCCC TGAGTATGAG CATGAGTTAC
TGAGAGAAGA TTGGATGCCA AAAATCAAGA CGGTAAAGAA TGCCAAGCTC ATGAGTGTCG
CTATGCGTGA ATTCCCCAAG TTGAAAGAGC TATTAAGGGT TGGCAGGAAA GCGGACGGTG
AAAAATATAG AGGCTTGATG GACAAAAAAC GGCGTAAAAG GGTGCGAGAG AAGGGATGA
 
Protein sequence
MVYLLFLRLL KYLFSAMSVL AGLLAITNYY LNTQTTYGST STISSSGGED DITKRDSPSS 
SSSETSNNTS IIDNPQLLTA ANVTSNGLLV HISFEWIVTM MIIVFAFKEK LEIEELQDRI
RQGQVDVRHT DLSGTITSAF VTVPSAKQAR EILKNVKDDM KRAGYHIQRA PRSHNVLWKN
LEKDVKSRHS HAIIGKFALV IICFVNTIPL MIVTVLANLG TLEDSSEIWK AIFTVLAGVL
PATISAMFSY ILPYIMRRLS RWSGALTRGQ LDKAVIRQLF IFQLVSNFIV FSLLGVVYET
YLTISEDIGK ESWSTIYAGL GDVPAKVTQA YISESLYWLS WYPIRSVVAC LQLLQIPRLI
LKTPQLLMIK TPHDLAEVAQ PENFEVSAEI CVSYTNLFLL LSTRSSIHTC LMYAPLAPII
VICAAIYFWA LYIIHNNQLK FVFDSKETDG KCWKILINRV LIATVFMQLF MVLTCTLKTQ
SAAMAVGAGL PVGIIFLFKM YLRRHYHPDG EVFSQYIDKY EDDDTRHGEW APEYEHELLR
EDWMPKIKTV KNAKLMSVAM REFPKLKELL RVGRKADGEK YRGLMDKKRR KRVREKG