Gene CNN01800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01800 
Symbol 
ID3255311 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp517538 
End bp519401 
Gene Length1864 bp 
Protein Length551 aa 
Translation table 
GC content56% 
IMG OID638254598 
Productconserved hypothetical protein 
Protein accessionXP_568684 
Protein GI58262548 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAGGC TCTGCCGCAC ATGCCGCCTC GGCGCATACA CCCCGCGCAG GACACTTGCC 
ACCGCCACCG CCACCGCGCC CCTTCCCGAC CTGCCCCCGC GGCGGCCGCA GTTCCAGCCA
CAACGCCCAG ACAGAGCAGG CCGGGAACGA GACCGCTTCC GGACTCCCGA CAGCAACAGA
CTCCGCCTCG CAAACATCCT CCAGACACTC ACGAAATACA AGGCAGAGAA CCGCTCGCCG
ACGCCGGTAG CGTATGTCAA CATTATCGAA GCAGCGAGCG AATTCGCGTT GAGCCACAGG
GTCGACGGGG ATCAAGGCGA CGGGCTGGGG TTCCAGGTTG CGTTGGCGGC GTGGGAGGAT
GCGAAGCGGG GAGGTGTTGA GCTGGGGCAG GAAGGTGTTG ATGCGATGAT GAATGTGAGT
GGATACCGGT TTTTTTGATT TGTGAAATGG TGTGGGCTAA TAGTTGTTCA GTTTGCGGTG
ATCTATCCCC AGCTGCTCTC TTCCCTCCTT CTTTACACCA AAACCCGCCG TCTCGCGACG
TACAATGCCA TGTCCAGAGT GGCCTCTTCC AGCTTTGATG TCGAGCAGAT GGTTTACCTT
TTGGAGGAGA TGTCCCAGCA AGGGTTCGTT CCCAACACTG CCACTTTGAA ACATACAGTC
CGCCAGGCAT GTGAATGGGG ATACCCCCGA TTGGCTCTTC AGATTGCGCA AAAGGCCGAG
GAAGAGTCTA GTTTTGGGTT CAGGCTTGAT CAGAGTGCGT GGGTTCAGAT CCTCATTGCG
AGTGCGGACA ATCACTATGT ATGTTCCTAC CCGTTTATTT GCTACGTCTC TTGTCGCTAA
ATATTTCAAA ATTGCAGTTG AACGGTGTCG AGACCGCATG GGAGCGTGTC AAGTCCAGCT
ACACCCCGGA CGAGGGCCTC ATTCTCTCCA TGCTCAACGC CGCCGGCAGA TGGGGTCGAC
CCGATTTTTC ATCCACCATC CTTGAACTCC TCCCCGGTCC GCCCCAAGAA CATCACCTCG
CCCCTCTCCT CGAAGCATTC TGCAACGCCG GCCAAGTGCC CAACGCTTTC CACGTCATCA
GCACCATCCG CTCCACCGGC CTCACCCCGA CCTTGTCCTC CATCCAGCCG ATCGTGAACG
CGTTGAAATC CGCAGAGGTC ATTGACCAGG CGTACTATAC TCTGGAGGAT ATGCACAAAT
CCGGCCAGGC GGTGGATATC ACAGCGTTGA ACGCTGTGAT TGCGGCTAGC AGTTCTATCG
GTGATCTCCA GCGTGCTCGG GCTACCCAGA GCGCGATCCC AGAATTCGGC ATGACGCCCA
ACATTGATAC ATACAACCTC GTCCTCCAAT GTTGCGTGAC CACCTCTCAC CGCCCATTAG
GCGATACCCT CCTCTCCGAA ATGGCTGCCC AGAATGTCCA GCCCAACGCT ACCACTTACG
ACCACCTCAT CCACCTCTGT CTCACCCAGC CTTCTTACGA AGACGCATTC TACTATCTCG
AAAAAATGAA AGCTGGCGGC TTCAAACCCG GCTACGCCGT CTACGCTTCC CTCGTGAAAA
AGTGTGTCAA GATGGGCGAT TCGAGGTGGA GGTTGGTAGT CGATGAGATG AAGGATGTGG
GGTACAAGAT TGAGGCCGAG TTGCAAGGGT TTATTAATAA TGGAGGAAGG GAGAGGGGAA
GACAGGCGGC GGGGCAGAGG AGGGCGAATG ATCAGATGGT GGGGAGTAAG AGACGGAGCT
GGATAAGGCA GGCGGCTGAG GAGGCTGTGT AGCGGTGTGG AGGGCTTTTT TCTGGTGGAG
ATTTATTTAG GAGCATGGTT TTTCTCCTTG CACCAACAGG GCTCATGCAT TGCATCATGA
TTCA
 
Protein sequence
MLRLCRTCRL GAYTPRRTLA TATATAPLPD LPPRRPQFQP QRPDRAGRER DRFRTPDSNR 
LRLANILQTL TKYKAENRSP TPVAYVNIIE AASEFALSHR VDGDQGDGLG FQVALAAWED
AKRGGVELGQ EGVDAMMNFA VIYPQLLSSL LLYTKTRRLA TYNAMSRVAS SSFDVEQMVY
LLEEMSQQGF VPNTATLKHT VRQACEWGYP RLALQIAQKA EEESSFGFRL DQSAWVQILI
ASADNHYLNG VETAWERVKS SYTPDEGLIL SMLNAAGRWG RPDFSSTILE LLPGPPQEHH
LAPLLEAFCN AGQVPNAFHV ISTIRSTGLT PTLSSIQPIV NALKSAEVID QAYYTLEDMH
KSGQAVDITA LNAVIAASSS IGDLQRARAT QSAIPEFGMT PNIDTYNLVL QCCVTTSHRP
LGDTLLSEMA AQNVQPNATT YDHLIHLCLT QPSYEDAFYY LEKMKAGGFK PGYAVYASLV
KKCVKMGDSR WRLVVDEMKD VGYKIEAELQ GFINNGGRER GRQAAGQRRA NDQMVGSKRR
SWIRQAAEEA V