Gene CNA06920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA06920 
Symbol 
ID3253702 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1884512 
End bp1886307 
Gene Length1796 bp 
Protein Length444 aa 
Translation table 
GC content47% 
IMG OID638253014 
Productconserved hypothetical protein 
Protein accessionXP_567009 
Protein GI58259193 
COG category[C] Energy production and conversion 
COG ID[COG5231] Vacuolar H+-ATPase V1 sector, subunit H 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.17206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGGCTATAAC CATTCGAGCA GTAGTAAAGT ACGAGACAAC ATGGCCACGA CTGTACACCC 
TATTCCACCT CCCTTCTTCT CCCCTTACCT AGATGACCAG TGCAACAAGA TCAACGCTAA
ACCTGTTCCG TGGGAGGTAA GCTTGCTCGT ACGTGGGTTC GGTATAGGCT GACATCGGTT
CGCTGTGCAG GGTTACCAGA GGGCAAAGCT TCTTTCAGCA GATGAACTTT CACTTTTGAA
GTCTCTCAGC AAGCTTGTAC GTCGCCCGAC ATTATTGTTA ATTTACTGCC ATTAACAGAT
AACATAGCCC TCCGCTCAAC GACCTACGGT ACTGGCTACT CAAGGGCCTC AGTATGCTAA
GCTTTACATC GACTTACTTC GAAAGCTTCA AAGAGTAGAT ACCGTTCAGG CGGTGCTTGT
ATCTATCAGC GACATGCTCG CTGACAACTC GACCATCCCA TACTTTCACA ATCTTGCATC
GCCGGAGCAC CCTGACGATC CTTACGGGCC TATCGTCAAG TGTTTGAGTA TGGATGAAGA
ATTTCCTGTC TTGGGAAGCC TGAGGATATT GTCACTTTTG ATTGCGTGAG ATTGTCCTGG
GAAAGCATGG CAAAAAGCTA ATGTAGATTC CAGCACCGAT CCCAAGCCCT TCCCCAACGA
CCTCGTTCCC ACTTTACTCT CATCCCTCCA AAAGCTCTTA AACGGCAGTC GATTGCCTCT
ATGGGAAGTT GCAGCCCAGG TCCTCGGTGC TGTTCTCGGG ACTAAACAGT TCAGGAAGTT
CGTATGGAAT GAAGAAAATT GCCTCTCAGG GTGTGTCTAG GAGTTCAGCC ATCTTTGTGT
CAAGCTAATC GGCGGGGGTA GGCTTATCAA ATCTTTGAAG ACGAACCCCA ACCCCCAAGC
GCAATATTGG GCTATCACTT GTCTTTGGCA ATTGTCGTTC GAGAAAGAAG TGGCGGAGAA
CTTGGACAAG AAGTATGATG TCGTGGCGAT CCTGACCGAT ATAGCTAAGG CTGCGGTGAA
AGAAAAAGTC ACTCGGGTTG TAGTGGCTAC TTTCAGGGTA AAACCAAATC GTTCAATATA
TGTCCATAGC TGACAAGGAC AACTTAGAAC CTACTCGCCA TCGCGCCTTC CCAGAACCTT
CCTTCCATGT TTGTTACAAA ACTGCTACCC TTCATTGTTT CTCTTCAGTC GCGTAAATGG
TCCGATGAGG AGATTGTTGA AGACCTTGAC TACCTCAAGG ATGAGCTCAA GTCTCGCTTG
GATGGGCTTA GCACCTATGA CGAGTACGTC AAGGAGCTTG AGAGTGGTCA TTTAGTCTGG
TCACCTGCAC ATGAGACGGA TGACTTTTGG AAGGAGAATG GAATTAGGAT TGGGCAGGAA
GAGGGCGGGA AGGCGGTCAA GTCAGTAACA GAGTTCATCT TGATTTGAGC CTTTGCTGAC
ATGACAGCAG GCGCTTAGTC GAGCTTATCA CGACAAGTAA AGATCCTCTT GTTCTTGCTG
TTGCCACGCA TGATATCGGT CAGTTTGTCA AGTACGGTGG TGACCGATCT AAACAGTATG
TATATCACTT TGTTGTTCAG TGCCTATCTA ACATATACAT AGAATCATCG ACAACCTGCA
CGGCAAGACG CGTGTGATGG AACTGATGAG CCACGAGAAT GCGGACGTAA GGTATCAGGC
GTTGATGACG GTGCAGAGAT TGATGAGCCA ACACTGGTCA AAGTAATTTG GAAATGAAAT
CATCAAACTA GTTTGATCAC ACCAGAGTTG TAAAGCATGT ATGGAAACAT TTAGAC
 
Protein sequence
MATTVHPIPP PFFSPYLDDQ CNKINAKPVP WEGYQRAKLL SADELSLLKS LSKLPSAQRP 
TVLATQGPQY AKLYIDLLRK LQRVDTVQAV LVSISDMLAD NSTIPYFHNL ASPEHPDDPY
GPIVKCLSMD EEFPVLGSLR ILSLLIATDP KPFPNDLVPT LLSSLQKLLN GSRLPLWEVA
AQVLGAVLGT KQFRKFVWNE ENCLSGLIKS LKTNPNPQAQ YWAITCLWQL SFEKEVAENL
DKKYDVVAIL TDIAKAAVKE KVTRVVVATF RNLLAIAPSQ NLPSMFVTKL LPFIVSLQSR
KWSDEEIVED LDYLKDELKS RLDGLSTYDE YVKELESGHL VWSPAHETDD FWKENGIRIG
QEEGGKAVKR LVELITTSKD PLVLAVATHD IGQFVKYGGD RSKQIIDNLH GKTRVMELMS
HENADVRYQA LMTVQRLMSQ HWSK