Gene CNG00190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00190 
Symbol 
ID3258676 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp50614 
End bp52314 
Gene Length1701 bp 
Protein Length452 aa 
Translation table 
GC content47% 
IMG OID638257633 
Productexpressed protein 
Protein accessionXP_571749 
Protein GI58269186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.27031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAGCATGTC ACAATCACCC CTCGCCAGGT CCACACGAAG GAGCCTTCCC GCATTTGCTC 
AGCCGCCCTC GAAGTTGTCA AGAAGCAATG TGGCTTCAGA AGACCCAAAT TCGCCCTCGC
CATCGGGGTC CACTCGGACA CCTTTAAAGA AGTTTGCGAC ACCGAGCCGA GAGTCCATCA
GCCGCTATCG CAGTGTTGGT TCCTCTACTA GTACACCGGT GACCACACCA ATAATACATT
ATTCCCCTTA TGCGTTATCC ACTCCGCCAC AAAGTCTGAG CAAGAGTGCT AGTATCCCGT
TTGACATGGT GGCCAGTGCT AAAGCAGCTC GACGGGCTGA AGAGGATGTG AAACTGAGAG
GTGCAGAATC GAGCGCGAAG AAGAAGAAGT TTATCAGGAA GAAGCCTCTC TACCAACGGT
AAGTTGTTGA AGGCTGTAGC CTTAATTATT TACTGACATG TCCTAGGGTC ATTGGATTCC
CTCAGAAAAT TACAGACAAA TTCTTATATC ATACGCCCGC TTCTATTCTG GATATCCTTC
CAGATCCTCA TATGGCAAAT CCCATAGCGT TAGCTATCCA TGTCGTTCAT TATCTGCTTG
TTGCCCCTCT CTTTGCGGCG AAAAGCGATG ACTTTGAGAG CGTTTTGAGG ACAAGTCGTA
CTCGAAACGA TGTGTCAAGC CGCTGGGATG AATGGGAGAA CGAGGAGAAG GGCGGAAAAT
CTGGTCTATT GGGGGGTCGT GTAGTAAGCC TTCTGTACGA CTAATTGAAA AAGCTGATAA
TTTGGTGTAG AGGGCTATCC TTGTACTTTT GCTAATGGCA ATGGCGGTTG GTAACGCAAT
TTACTTATTC ACTCGATTCA GAACATACGA CATGTTGCTT AGAAATGTAA GTAACGGCTG
TGTTTGACTT GATTCATGCT GAAATTTTAC AGGCGCAGGA GACAGTACAC TCTCCACATG
CTTCTCCCGT CCCCGCTCCC AAAGTCAAAG CTGCCAAAGA CGACGATGAC GAGAAGGTCT
TTGAGGCAGC ACCTCGTGCC AAAGCGGAGC CCTGGGCCCC AAAAGTTCTC AAATTTACTG
GCAGATCCCT CATTTTTATA ATCAAGCTGC TTATGTATGT GTAATTTTTT TTTGGTAAGT
AATCATCTAA CGCTATACAG CCATGCGGTG TTTTCAGCTT TCGGCCGTCC GCAAGGGAAT
GCCCCTTCAT TGAAGGACCT TGGTCAAGCT GAGAATAAGA TCCAGTCTTT GCGGGTTTGG
GACCCACCAG AATTTTGTCT CGCCTTCTTT TGGTAAGTGT GCTGTTATTT TTTAGAATTC
AGCTGATACT CTCAATAGCG CATACCCTCC AACTGCGCCC TTCATTACCC ATCTTTTCAC
GCACATCAAC CCTTTCCTTA CCCCTGTTCT TCATGTCTCC ACCACATTCC TCCTTTCCAA
CCTGGCACAA TCATACGCTC AACTCGTCAA AGATCGAATG CTACTTTCGG CAGAGGTTAT
GCGCGAGTAC GATCAGAGGT TTGTGTACAA AAAGATCTTC TCGAACAAGG TAGATAGGGG
AGTGAGCACC AACGAGTGTA CGTAATCTTT TGAATAATCC CTGGGATAAG TCTGACAATC
ATCGTAGCTG AATTCGTTGG TTTTTAAATA CAGTATATGG TCCGTTCGTT CCCTTGAGCA
TTCTGACAAA TGTCCTGCAT G
 
Protein sequence
MSQSPLARST RRSLPAFAQP PSKLSRSNVA SEDPNSPSPS GSTRTPLKKF ATPSRESISR 
YRSVGSSTST PVTTPIIHYS PYALSTPPQS LSKSASIPFD MVASAKAARR AEEDVKLRGA
ESSAKKKKFI RKKPLYQRVI GFPQKITDKF LYHTPASILD ILPDPHMANP IALAIHVVHY
LLVAPLFAAK SDDFESVLRT SRTRNDVSSR WDEWENEEKG GKSGLLGGRV RAILVLLLMA
MAVGNAIYLF TRFRTYDMLL RNAQETVHSP HASPVPAPKV KAAKDDDDEK VFEAAPRAKA
EPWAPKVLKF TGRSLIFIIK LLIHAVFSAF GRPQGNAPSL KDLGQAENKI QSLRVWDPPE
FCLAFFCAYP PTAPFITHLF THINPFLTPV LHVSTTFLLS NLAQSYAQLV KDRMLLSAEV
MREYDQRFVY KKIFSNKVDR GVSTNESEFV GF