Gene CNC05890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC05890 
Symbol 
ID3256833 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1741087 
End bp1742871 
Gene Length1785 bp 
Protein Length433 aa 
Translation table 
GC content55% 
IMG OID638255810 
Productconserved hypothetical protein 
Protein accessionXP_569812 
Protein GI58265312 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.970035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAC AAACCCCGCT GCTGTCCCCC TCCCCGTCCA CCGTCTCGGC CCCCCCCGCC 
GCCCGCAGAC GGACATCCCT CCTCCTCGCC CTCGCCACCC TCCTCCTCCT CGCGGCCGGC
CTCTCCGTCG GCCTCGTCGT CCGGCATGCC CATAAGGAGC CCAGCGATGT CCTCGAAAGG
GCAAAGCTTT ACCTCAAAGC GTGCACATCC CGCTCCATCT GCAGCCGGCC GGATGCTGAC
TGCGCTGTAG CTCGCCGGTG ATCGATGGCC ATATCGACCT CCCCGAATTC GCCCGTGCGG
TCTACGGCAA CAATATCGAA AAGTTTGATT TGCGTGGTGC CCTCGTACGT CGTTCGCCTC
TTTCTTTGCT TTTCTAGACA GAAAACTGAC GAGATGGTGT CTAGCCCGGA CACTTTGATA
TCCCTCGGGC CAGAGAGGGC CATCTGGGAG CATTCTTCTG GTCCATGTCA GTCCTCTCAA
TCGTTGACTA CCATGGGCCA TGCAATTAAA CTCTTGACCT TGTTATAGCT TTACCGAATG
CCGCGATACG AATGGCGACG ACTTTATGAA CCCGACCTTT GAAGTTCGAG GCGAGTCCGA
AAAACATTCT CCTTCCCATA ACCGTCTTCC CCTTTCCACT ACTAACCTTT TTGCTCTAGA
CGCCCTCGAA CAGCTCGACG TGTCAAACAA CCTCATCTCC AAATACAGCG ACACATTCGC
CGTCGCCCGG ACCGCCGACC AAGTCGAGTG GGCTATAAAG CACGGCAAGA TTGCGAGTCT
TTTCGGGCTG GAGGGTGCGC ATATGCTTGG CAATTCTCTT GCAGTGTTGA GAATGTACCA
CCAGCTCGGG GTGAGGTATA TGACCCTCAC GCATAGCTGT AACAACGCGT TTGCCGATTC
GGCCGGTATC TTTGGAGACG TCAAAGAACG TTGGGGCGGT CTGAGCCCCC TGGGCAAAGA
ACTCGTTCCA GAGATGAACC GACTCGGAAT CTTCATCGAC CTCTCCCACG TTTCCGACCA
AACTGCCCTC CAGGCGCTGG ACCTGACAGA AGCACCCGTC ATCCTCTCGC ATTCCTGTGC
GAGGCATTTC AATAAGATGA ACAGGAATGT ACCGGACGAG GTGCTGGCTA GGTTGGGCAG
CGGAAAGGGA AAAGTCGATG GGGTGGTGAT GGTCAAGTGA GTCTGTAACG CTTCTTTAAT
TAACAGCGGA AAACTGATAG ACCCCGCTTC ATTACAGCTT CTTCCCCGTA TTCGCCTCTC
CCAACCCGGA CCTCGTCGAC GTCGCATACA TCGCTGATGA AATCGAGTAT ATCGCCAATA
AAACTAGCAG GGATCAGTGA GTCACCCCGT CACATTTAAA AACTTGATTC TCAAAGCGCT
GACTGTTTTG GGGGTGGTGG CGCCTGACAG TGTCGGGATC GGATCAGATT ACGACGGGAT
TGAATCAGTG CCCAAGGGTC TTGAAGACGT TTCCAAGTAT CCTTACCTCG TACGTCTCTG
ACTTTTGTCG TTTGCCGTTT TACCCCTGCT AACGTCGCCC CCCGGCCGGC CGGACAGTTT
GCCGAACTAA TCAAACGCGG TTGGTCCAAG AACGATCTCT CCAACCTCGC CGGCGGGAAC
CTCCTCCGCG CCATGCGGGG GATGGAAGAC GTGAGCCGTC GGATGAGGGA CGAACAGGGT
AAACAGCCAA GTATGGCGAA ATATGATAAG CGGAGGGATT TGGATGGGGG CGATTGGGAT
TTCTAGCCGG GGGGGGGGTG AAAATTGGAT TTGCGAGAAG AAACG
 
Protein sequence
MAEQTPLLSP SPSTVSAPPA ARRRTSLLLA LATLLLLAAG LSVGLVVRHA HKEPSDVLER 
AKLYLKASPV IDGHIDLPEF ARAVYGNNIE KFDLRGALPG HFDIPRAREG HLGAFFWSIF
TECRDTNGDD FMNPTFEVRD ALEQLDVSNN LISKYSDTFA VARTADQVEW AIKHGKIASL
FGLEGAHMLG NSLAVLRMYH QLGVRYMTLT HSCNNAFADS AGIFGDVKER WGGLSPLGKE
LVPEMNRLGI FIDLSHVSDQ TALQALDLTE APVILSHSCA RHFNKMNRNV PDEVLARLGS
GKGKVDGVVM VNFFPVFASP NPDLVDVAYI ADEIEYIANK TSRDHVGIGS DYDGIESVPK
GLEDVSKYPY LFAELIKRGW SKNDLSNLAG GNLLRAMRGM EDVSRRMRDE QGKQPSMAKY
DKRRDLDGGD WDF