Gene CNH03220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03220 
Symbol 
ID3259043 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp200117 
End bp202187 
Gene Length2071 bp 
Protein Length403 aa 
Translation table 
GC content48% 
IMG OID638258162 
Productconserved hypothetical protein 
Protein accessionXP_572491 
Protein GI58270670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0411031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAATTATATC TATCTTCCCC ATTATCTTGC ATGTCATCAT TCGACGATCA TGTCTAGTCT 
CCACCATCTC GTCCCACTTC TTTTTCTTAT CGCTCCCGCC CTCTCTCAAT ACACTGCAAC
CTATTCACCT TACTCGCTGC CAGAAACAAC TGAACAAGGC CAATATGGCA CCAATGCCTG
CGGGACATCA TCCTCCCAAG ATTCAAAATG TCAGAATGTG TACGTCAATG GAGTAGACGA
TTTCTGTCTC TGGGGACCCC CGGAGACAAC AAGTAATGAA GGAGATGGAA CTAGTAAAAT
TGGGAATGTG GAACAGATAG TCGTTAGGTG GGTTATGTTC TCCCATTACC GCTGTGAAGG
GGAAGAGCGA TGGACCGGGA ACTGCATGGA GTGCAGTCGC GTGACCATTG TGGGGACTGA
TAAATTGAAG TCACGCTAAA AATATCGCAC CTTTATAGCT ATTGCTTGAA GGACGGTTAT
GGCACACGTC TCATTCCTCC CGGTTCAATT ACAGGCGCCC ATTTTGTTAA AGTCCTAAGC
GAAAAGGTCT CGTACGTCCA AGTCACTGGC GTCGGAGATG TAAGTGGGCT CTAGAATCCA
ACAAAGGGAG GTTATATTTA TGATGGATGC ATGCTAATGG TACCTTGTAA CCCCTAAGCT
TACCAAACTT TTGATTCCTG CAGGAGATGA CGGTGGTGAA CTCGACGTAA GTCTATTCGT
TCATTTTAGG GGATAGCTGG CTGATAACTT GCTCCGTAGC CGCACTGTAT GTCTACATAA
TTCTACTAGC CCCACCTACC CATTCGTCCA CAGTTGATAC GGCAGCTTAC TGACTTAATC
GCCAACAGCT TGGACCGGCC TCGGTAACCC GCAAGGTGGT TTAGTATTCA CCAATGCTTT
CACGGGATCT TATGAGCAAA CTCACGAGTG GACTAGCTTC ATGTCTGCAG GTATGTTTTC
TTCCCCAGTC CATTGTGCCA TCTTCCCACC TTAGCTTCTA GTGCTGACTT ACTGCTCTGT
TCGAAGATGA ATTCTGCATA CGAGCATGTC GGGACGGAGA CAGTAAGCTT TCCCCCGTGA
CGGCACCTGG CTTTCGACGC TGATCCCTGT TTACCCCAGA CGCTGCCGCT TATTGCCAAC
ACATCTACGA CGTTTGTGAG TTCTCTTTTC TCCTCTTTTA CCCTTCATTG AGATCTCACC
CATTTCATGC TATAGTAAGC TGCAGCTTCA CAATCCCAGG CGACATGTCC CCCGGCTTCG
ACTCTTGTCT CGGTGAACCC ACTGAAGAAG CACCAGGTGT CTACTCTGGT TCCACTTTCC
GTCAAGGCGA CCCTACGACC CCGGCGCCTC ATCCGGCTGG GGCAACTTCC GAGTGTCAGG
TTTATTCGTC AGTTGGTGGG GGTAACGCCA ACATTGGCCA AGGCGCTGTT GTGTCAGCAG
CGACAACGAC GCGGAGCTCC AGTAGTGCAG AAGAGACGAA CAGCCCGAGC TCAATTTCAG
CAATAGATTC CACTACATCG TACACGTCCT ACACCTCACC TTCAAACACC ACCGTCTTCG
CATCTGCCAC CTCCACCCTT TCAAACTTGA GCACTGTAGC GACCACCACC ACTATCATCA
CTTCCGCCTC CTCATCTTCT GTCCCGTCCT TTTCGTCCAC GTCTTCATCT TCATCTAATG
CTGCGACAAC AACAGCCCTC AACAATGCCC AATCCAACGG ATATCCAAAG GCTATCGAGG
GGATGGGGAT CGCGAGTGTT TGCATAGCTG GAATGTTCGC ATTTTCGTTC TTTATTTGAT
AAGAGCGAGG AACTCCACCC ATAATTTGCC GGGATGCTCG GGGCGGTTGG AATAGGTCGT
TAGAGAGGAG AATGGATATA ATGCCGATCT ATGGTCGCTA GCTTTTCGGT GTCAAATACG
GTTTAGGGGG TGTACGAGTA GCGGTCTTGT TGTACATCAT ACTTGGAGCA TATGGACGTT
ATACGGACAT TATTTGAAGC CGCAGACTTT TTTAAGAATT TATTGCAGTT CTATAAAACC
TTTACCACTT TTTAGTTAGG ATCATATAAT T
 
Protein sequence
MSSLHHLVPL LFLIAPALSQ YTATYSPYSL PETTEQGQYG TNACGTSSSQ DSKCQNVYVN 
GVDDFCLWGP PETTSNEGDG TSKIGNVEQI VVSYCLKDGY GTRLIPPGSI TGAHFVKVLS
EKVSYVQVTG VGDLTKLLIP AGDDGGELDP HSWTGLGNPQ GGLVFTNAFT GSYEQTHEWT
SFMSADEFCI RACRDGDNAA AYCQHIYDVL SCSFTIPGDM SPGFDSCLGE PTEEAPGVYS
GSTFRQGDPT TPAPHPAGAT SECQVYSSVG GGNANIGQGA VVSAATTTRS SSSAEETNSP
SSISAIDSTT SYTSYTSPSN TTVFASATST LSNLSTVATT TTIITSASSS SVPSFSSTSS
SSSNAATTTA LNNAQSNGYP KAIEGMGIAS VCIAGMFAFS FFI