Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA06920 |
Symbol | |
ID | 3253702 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1884512 |
End bp | 1886307 |
Gene Length | 1796 bp |
Protein Length | 444 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253014 |
Product | conserved hypothetical protein |
Protein accession | XP_567009 |
Protein GI | 58259193 |
COG category | [C] Energy production and conversion |
COG ID | [COG5231] Vacuolar H+-ATPase V1 sector, subunit H |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.17206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGCTATAAC CATTCGAGCA GTAGTAAAGT ACGAGACAAC ATGGCCACGA CTGTACACCC TATTCCACCT CCCTTCTTCT CCCCTTACCT AGATGACCAG TGCAACAAGA TCAACGCTAA ACCTGTTCCG TGGGAGGTAA GCTTGCTCGT ACGTGGGTTC GGTATAGGCT GACATCGGTT CGCTGTGCAG GGTTACCAGA GGGCAAAGCT TCTTTCAGCA GATGAACTTT CACTTTTGAA GTCTCTCAGC AAGCTTGTAC GTCGCCCGAC ATTATTGTTA ATTTACTGCC ATTAACAGAT AACATAGCCC TCCGCTCAAC GACCTACGGT ACTGGCTACT CAAGGGCCTC AGTATGCTAA GCTTTACATC GACTTACTTC GAAAGCTTCA AAGAGTAGAT ACCGTTCAGG CGGTGCTTGT ATCTATCAGC GACATGCTCG CTGACAACTC GACCATCCCA TACTTTCACA ATCTTGCATC GCCGGAGCAC CCTGACGATC CTTACGGGCC TATCGTCAAG TGTTTGAGTA TGGATGAAGA ATTTCCTGTC TTGGGAAGCC TGAGGATATT GTCACTTTTG ATTGCGTGAG ATTGTCCTGG GAAAGCATGG CAAAAAGCTA ATGTAGATTC CAGCACCGAT CCCAAGCCCT TCCCCAACGA CCTCGTTCCC ACTTTACTCT CATCCCTCCA AAAGCTCTTA AACGGCAGTC GATTGCCTCT ATGGGAAGTT GCAGCCCAGG TCCTCGGTGC TGTTCTCGGG ACTAAACAGT TCAGGAAGTT CGTATGGAAT GAAGAAAATT GCCTCTCAGG GTGTGTCTAG GAGTTCAGCC ATCTTTGTGT CAAGCTAATC GGCGGGGGTA GGCTTATCAA ATCTTTGAAG ACGAACCCCA ACCCCCAAGC GCAATATTGG GCTATCACTT GTCTTTGGCA ATTGTCGTTC GAGAAAGAAG TGGCGGAGAA CTTGGACAAG AAGTATGATG TCGTGGCGAT CCTGACCGAT ATAGCTAAGG CTGCGGTGAA AGAAAAAGTC ACTCGGGTTG TAGTGGCTAC TTTCAGGGTA AAACCAAATC GTTCAATATA TGTCCATAGC TGACAAGGAC AACTTAGAAC CTACTCGCCA TCGCGCCTTC CCAGAACCTT CCTTCCATGT TTGTTACAAA ACTGCTACCC TTCATTGTTT CTCTTCAGTC GCGTAAATGG TCCGATGAGG AGATTGTTGA AGACCTTGAC TACCTCAAGG ATGAGCTCAA GTCTCGCTTG GATGGGCTTA GCACCTATGA CGAGTACGTC AAGGAGCTTG AGAGTGGTCA TTTAGTCTGG TCACCTGCAC ATGAGACGGA TGACTTTTGG AAGGAGAATG GAATTAGGAT TGGGCAGGAA GAGGGCGGGA AGGCGGTCAA GTCAGTAACA GAGTTCATCT TGATTTGAGC CTTTGCTGAC ATGACAGCAG GCGCTTAGTC GAGCTTATCA CGACAAGTAA AGATCCTCTT GTTCTTGCTG TTGCCACGCA TGATATCGGT CAGTTTGTCA AGTACGGTGG TGACCGATCT AAACAGTATG TATATCACTT TGTTGTTCAG TGCCTATCTA ACATATACAT AGAATCATCG ACAACCTGCA CGGCAAGACG CGTGTGATGG AACTGATGAG CCACGAGAAT GCGGACGTAA GGTATCAGGC GTTGATGACG GTGCAGAGAT TGATGAGCCA ACACTGGTCA AAGTAATTTG GAAATGAAAT CATCAAACTA GTTTGATCAC ACCAGAGTTG TAAAGCATGT ATGGAAACAT TTAGAC
|
Protein sequence | MATTVHPIPP PFFSPYLDDQ CNKINAKPVP WEGYQRAKLL SADELSLLKS LSKLPSAQRP TVLATQGPQY AKLYIDLLRK LQRVDTVQAV LVSISDMLAD NSTIPYFHNL ASPEHPDDPY GPIVKCLSMD EEFPVLGSLR ILSLLIATDP KPFPNDLVPT LLSSLQKLLN GSRLPLWEVA AQVLGAVLGT KQFRKFVWNE ENCLSGLIKS LKTNPNPQAQ YWAITCLWQL SFEKEVAENL DKKYDVVAIL TDIAKAAVKE KVTRVVVATF RNLLAIAPSQ NLPSMFVTKL LPFIVSLQSR KWSDEEIVED LDYLKDELKS RLDGLSTYDE YVKELESGHL VWSPAHETDD FWKENGIRIG QEEGGKAVKR LVELITTSKD PLVLAVATHD IGQFVKYGGD RSKQIIDNLH GKTRVMELMS HENADVRYQA LMTVQRLMSQ HWSK
|
| |