Gene CNF01370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01370 
Symbol 
ID3258126 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp401651 
End bp403792 
Gene Length2142 bp 
Protein Length330 aa 
Translation table 
GC content48% 
IMG OID638257261 
Productconserved hypothetical protein 
Protein accessionXP_571513 
Protein GI58268714 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCACTGGGA CCTGTTCTAT ATCAGCTCAA CGTTCATGGT TATCCAGAAC AACCAGGAGA 
TCAGGCAATG CGAATACGTA CCACTGGGGA AATCCTGAGC AAGGTGGACG TGCGCATCTA
CGCCCCCGGG CATAACATAT CCCCCTCTGC GTAGGTTTCA GCCGTTGAAT CTCTCGCACC
AACTTCCTCA TCTACTCACT CGGCATCGAC GACCTTTTCC GCAAGATCCT CTGAGATGTT
TGCAGCGAGA TGGGCGATTT TGCCGTCCTT GATGGCGATA TCAAGCTTTC CAGTATCTCC
GGATGTGACT ACGATGCCAT TTTTCACGAC GAGGTCGAAC TGTTTAGACA TGTTGTTGGG
TGATGGTAGT TCTTGTAATG AGGGACGCAG AAAAAAGGAT GGATACCATT GGGGTAAAAT
GAGCAATCTC ATCCGCCCTC TAAAGGTCTG TAGGTCAGGT CTGCGTGAAA CCCGAAGCCG
GATAAGCTTA CGGCGGAGAT AGGTTATCTC TCTCGTTATT CGGAAACTGC TCAGCCGAAC
CCTTGTTCAT ATTTCAACTC TCATATGGAT CACATTCTTG ACCAACTCGA ACACAATTCA
CAAACTCTCA GCTCTTGAAG ATATCAAATA GCACACTCTA GCATTAGACT ATGACCGTCA
CCATTAACCC TGCTGCCTCC CTTCCCATCA TTTCGCTCGC CGAGCACAAT TCGGTTGACT
CCCTTGCTAG GGCTCTCTAC GATTCTTGTA CTCAAGAGGG CTTTATTTAC GTCTGTGACC
ATGAGATCCA ACAAGATCTC ATTGACCAGG CCTTTGCCAT TTCGGCGAAT TATTTTACTC
ACGCCCGCCC AGAGGATAAA GTCGATCTTA AGACCAACCT TGGCTATACT GCAGTGTGAG
TGTCGCCCTT GGAATCTCGC TTGGATTCAG CTGATGTCCA CTTGCAGCCG ACAAGAAAGG
TAAGCTGATG CGTTGGCTAA GTTAACCATT CACTCATTCG CTTGCCCTTA CAGTCTTGAC
TCCACAAGGC CCAGCTCCGG TGATCTCAAG GAATTCTTCC ACGTTGCTGA TAATCATTGG
CGCGTGAGGA ACGGAGAGAG CCCGCAAGAA CTTCCTGAAG CTCTCGAATC CTCTCGAAAA
GCATTAGACG ACTTTATCGA GCAGATTAAT GGTCTTGCTG ACAGAATCTT GAGGGGTTTG
TCAGTGGCAC TCAAGGTGCG TCCTGATCTG AGCGATGATC AGTGTGAAAT ATGGCTAACC
TGATTACCAG TTGAAGCCCG AATTCTTGAC GAATCAGCAC AGGGGAGAGT AAGTTGTGGA
TTTACCCATA ATGATTTCCA TGCTGACAAA TCGCTAGACT CAACCGACTC CGTATGCTCC
ACTATCCACC TGTTGAAGTG GAACAAAATG GAATCAACTC TGATAGGCAA GCCTATCCAT
TTTGAAGCGC ACGACTACAG ACTGACAGAT TTTTTGTAGC AATGAAATCC GAGCAGGGGC
TCATACCGAC TATGGGTCCA TAACTATCCT CTTCCAGCAC ATTGTATCTG GTTTGCAAGT
TCATCGTAAC GGCTCTTGGA TCGATGTTGC GCCTAGAAAA GGCTGTGTCG TTATCAACAT
TGGCGATGCT CTTGAGTTCT GGTCTGGTGG CTTGTTCAAG GTAGGTCATG GGTGATCGAA
ACGTAGACTG AATATAGCTA ACTTTCCGAT CAGTCCACTC TCCATCGAGT TGTCATGCCC
CGCTCCCAAG CTGAAATGGC TTCTAGGTAC TGTGAGTCAC CCTTCAAGCC CAAGGATCAG
AAGATTAATA ATTGATGAGC CTCTTAGCTA TTGCCTATTT TGTTCATGCC GACAATGCTA
GTATTCTGGA GCCTTTCACT GATGGAATGT GAGTTACAAT ACCCTCAATG ATTTAAAAGC
TCTGAGTGCT GACATCCCTG ACAACAGTGA TGAGGATGCG CTCGACGAAA TTATTGCCCG
CAAAGGACTC CCTCGAGGGA CGCGAAGGAT TACCGGAGGA GATTATGTCC AAGCTCGACT
TGCTGCTACT TATGGTATGA AGGTGGCGGC CTGATGGATA TGATCTGGGG GAATGAAGTT
AGTAAAAGAA CTAGAAAAGT TGTAACTATT GTCGGAGATG TG
 
Protein sequence
MTVTINPAAS LPIISLAEHN SVDSLARALY DSCTQEGFIY VCDHEIQQDL IDQAFAISAN 
YFTHARPEDK VDLKTNLGYT AVRQESLDST RPSSGDLKEF FHVADNHWRV RNGESPQELP
EALESSRKAL DDFIEQINGL ADRILRGLSV ALKLKPEFLT NQHRGELNRL RMLHYPPVEV
EQNGINSDSN EIRAGAHTDY GSITILFQHI VSGLQVHRNG SWIDVAPRKG CVVINIGDAL
EFWSGGLFKS TLHRVVMPRS QAEMASRYSI AYFVHADNAS ILEPFTDGID EDALDEIIAR
KGLPRGTRRI TGGDYVQARL AATYGMKVAA