Gene CNE00020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE00020 
Symbol 
ID3257642 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1785 
End bp3474 
Gene Length1690 bp 
Protein Length405 aa 
Translation table 
GC content52% 
IMG OID638256584 
Productconserved hypothetical protein 
Protein accessionXP_570691 
Protein GI58267070 
COG category[T] Signal transduction mechanisms 
COG ID[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGAA TTCTCACAGT GAAAAACTAT GAAGGAAACT ATCCAGGTTC GAGTCGCGCC 
GCCTGCAAGC CGGGTATGTA TATGATTTTA CAGAGCTTAA TAGGGCTTAT AGTGTCATAA
GCTAATGCTC TCCCGGCTGT TAGATCGGAT CAAAAATATT CATGCGCTAT TTATATACAT
CAGCGGCGAT TACCCTCCGT CCTGCCCGTC TGTCACATAC AAAAAATTTC CACCACAGCG
CTTATCGGGC ACCATGCTTT TGATTCTCTC CCCCAGCTGT TAGGGTGATG CACGGGTACG
TCAGCCGTTA TTTAGCGTTC GTGAGTAGTT TGCAATTCGT TCTCGATTCT ACATCGCAAA
AAAGGACAAT GACGTCGCCG GCTCGCACTG TATCAAAGGT GACATGAATA GCTAAAGCGA
AAAGAATGGA AAAAAGCCTT ACCATCTTGC CCCCTTTGCT TCCCCATTAT TTCCGCCGCC
CGGTTCCTCT TCATCTTCGT CCCGTCGCTG CAACCCCTCG TTTTCCCATT TTTTCCCGAC
TTCGAGACCC CACTGTCTGG CTCCGTGTCT TCTCAATCTA AAGTCGTCCC TGCCGCATGG
TCAGTACGTC GAACAACGAG AAGGCGAGAA AACCCACATC TCGCCGCCAT CGCTACGAAC
GCTTTCTTCT ACGAGATGAT TATCGGGGAC GGCTTCCCCG ATGGATCTCG CGTTTCACAG
GCTACCGTCA GCCCGGCGAT GAGCCGCCTT ATGATCCTCT ACCGTTCCCG CCTTTTAACT
GGTTGGTCAA GATTCCTTTG AGGGTCGAGG TATGGATATT CGCTTGGATA GGCTGCTTTG
GAGGGATCCT CCTCATTGAA GCCATCATGT GCACCAACAC AGCTTTTCGC AATGTGTACT
CCTCCCCTAT CATCATCACT TCCTTCGGCG CATCGGCCGT CCTTCTTTTC GGTGCAATCG
AGTCACCGCT TGCTCAACCT CGCAATTTCA TTGGCGGACA CTTCGTTTCG GCTCTGGTTG
GGACGGCTAT CACCAGGCTC TGGGTCCTCA ACCCACGTTA CCAAGATTAT CTGGATAATA
CAGGCTTTCA TGGCAACACT TTTGTGAATG GGGGACTGTG CATGGCAACC GCGGCCTTGG
CGATGCTGAT CTCAGGAATG GTCCATCCAC CGTCAGTGCA GCACCTATCA GATTTCCAAA
ATGCGCAGGC GCTTATGACC TAAGCCTCAG GTCCGGGGCC ACTGCACTCA ATGCTGCAGT
CCAAACCTCG GTCGTTTCCT TGTCTTGGCG GTACCTACCT GTCGTTCTCG CCTCCGCCCT
CATCATGCAA GGATGGGCCC TCATCATCAA CAATCTGGGC CGCCGGCGGT ATCCGATCTA
TTGGTGGTCA CCAAAGCAGG TATTCGTCCG ACCCGAGGTC TTAGAACACG AGAACGACGA
AGAAACAGCG CTACGAACGC TTCAAGAAGG CCCTCTTCGT AGTGCGGAGG ATGCGGGAAG
GACCAGGGAG ACGTTGTTGG AGGCTAGAAT GCAAGGAGAG GGAGCAGGCG GCGCGGATTT
TATGCAGGAT CCATCCGGAC AAAATAATAT TCCCATGGGG GCCGTCGAGA GTCCGGAAAT
TGGTGTTCCA GAGCCGCGAT TAAAACCGAT GACTGAAGGC GACCGACGTC GAGCAGAAGG
CTTTGATTAA
 
Protein sequence
MRRILTVKNY EGNYPGSSRA ACKPALPSCP LCFPIISAAR FLFIFVPSLQ PLVFPFFPDF 
ETPLSGSVSS QSKVVPAAWS VRRTTRRREN PHLAAIATNA FFYEMIIGDG FPDGSRVSQA
TVSPAMSRLM ILYRSRLLTG CFGGILLIEA IMCTNTAFRN VYSSPIIITS FGASAVLLFG
AIESPLAQPR NFIGGHFVSA LVGTAITRLW VLNPRYQDYL DNTGFHGNTF VNGGLCMATA
ALAMLISGMV HPPSGATALN AAVQTSVVSL SWRYLPVVLA SALIMQGWAL IINNLGRRRY
PIYWWSPKQV FVRPEVLEHE NDEETALRTL QEGPLRSAED AGRTRETLLE ARMQGEGAGG
ADFMQDPSGQ NNIPMGAVES PEIGVPEPRL KPMTEGDRRR AEGFD