Gene CNG00020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00020 
Symbol 
ID3258654 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp3211 
End bp4659 
Gene Length1449 bp 
Protein Length183 aa 
Translation table 
GC content48% 
IMG OID638257614 
Producthypothetical protein 
Protein accessionXP_571718 
Protein GI58269124 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.686504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCACCG CCCTCGTCAG CGGGCAAAAG CAACATCCTG AGCGCTTCCT TTGCCAGTTC 
AGGGGCACTC GTCCTCGATG CAGCAGCTAC GTGTATTCTT CGGTGTTTAA GTTCGGCGAG
GATAGAGGGT ACTTCATGGT AGAAAGAAAG ATTTTGTCCA CGCCTGCATT CCGTGAGTAG
CTTGTTACGC GAGCGTACAA AAGAGACAGG GACGAAAGAC TCACCGATCG ACAAGCTGGT
TAACAACGTC TCCATTACGC TTGAGTGGAG GAATAATATG CGTCTATTGA CCATTCTTCG
TTAGTGCTCG TCTCCTAGTT GACAATACGT CGCCCCGTGA AACGCAACGC CCAAACATAC
ATCTATCCAC AGATCCCAAA GGGTATAACT GGTGCATTGG ATATATCAGC CTGCTTATCA
GGATATTATT AAAAAAGTGT AGATACTCAA GGTCGAACGC CACAAGTAGA GGGTAAGCAT
CCAGATCATT CGGCGAAACG GTTCGATACT TTTCGGCAGG TGGTGAAGGT TCGCGGCGGG
AGGCGGCTCT TCGAGGCATC GTTTGGACTT TGTGAAAATG TAAATTAACG AGCTTTCTAC
CCTGCCCGTA AGTTGAAAGG TATTGAATAC GTGAACGAGG TGTAGTTGCA AGTCTCAAAT
GTAATGGCGA TCTTTCGATA TGTGATTCGG TCCGTCGTTG GGTCGTTGTT CGTGCTTTCT
TCGATCCAGT GAGAACAAGT GATGACGTAA GGGTTTTTTT ATTTTCTCAC GGTCGACTTC
TGTTTATCCC CCGGGCAATG TCTCCCGATA GCGGACGGAC CTTGAGTTAA GCCTTAAGCC
TTAGATTGTT GCACCTCGAA GAACATCCTG TAAATAATCG ATAGCCCTCG TAATTCCAGG
AGCAAACAGT GAATATTTAT CATTATGGCT GCTGAAAAAC AACAACAACG TATGACCTAG
CCCCTCTTGA AAGTTCGTCT CTCGCTAATT CGGTTGTGCA GCCTTGCCTC TTGGGAAACC
TGTTCCCGCT AAAGAAGGAG AGGAGAAAGT GGAATTGAGT GATGAAGGAG ATGAAGACAG
AGAGGATCTT GAGGAGGAAA ATGATGAAGA CTTCGATGAG GACGAAGACG AGGACGACGA
GGATGAGGAT GAGAATGAGA ATGAGAATGA GAATGAGGAT GAGGATGAGG AAGACGAAGA
CGAAGAGGGC GACGATGGTA TTGACCACAA GAAGGTGCTG TCGGACTTTT ACAATGTAAG
GCCTAGGGCC TTGCCACAGG ACCTAGTATG ATATCGGAAC TGTGCTAATG GGCCGTTTGG
TAGACCGAAC AAGTGGACGA AGAGGATGAT GAGGATGTCA TCGAAGGCAA AGAGGATGCG
GGAGTCAGCA ACCTGAAGCG AAAGGCGGAC GGTGAGGAAC ATGGCGAGGC AAAAAAAAAC
AAGGCCTAG
 
Protein sequence
MVTALVSGQK QHPERFLCQF RGTRPRCSSY VYSSVFKFGE DRGYFMVERK ILSTPAFPLP 
LGKPVPAKEG EEKVELSDEG DEDREDLEEE NDEDFDEDED EDDEDEDENE NENENEDEDE
EDEDEEGDDG IDHKKVLSDF YNTEQVDEED DEDVIEGKED AGVSNLKRKA DGEEHGEAKK
NKA