Gene CNN00790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00790 
Symbol 
ID3255308 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp257357 
End bp259500 
Gene Length2144 bp 
Protein Length568 aa 
Translation table 
GC content52% 
IMG OID638254495 
Producthypothetical protein 
Protein accessionXP_568620 
Protein GI58262420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.988321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCCATCTCA AACAGAAAAT GAAAAGGCTT TAATCGACAG AATGAGCGTC CCTTCAACTT 
TTCTACCTCA TTTGCCGAGG TCACAATACA CTCTTCACCC ACTCAACCCC ACCGAACTAC
TCTCTTACAT CGCTGATCTC CGAGAGCTAT ACCTACCTCC GATTCATGGT GGATTCCATG
CCGCTGATGT ACTCGAAGAT GGGGCCGATT CCTCCAGTGA AGAGGGCAAT AAGAATGACA
GACGGGTGAA GAAGGAAATT AGAGAGATCA AGAGGAAGCG AAGATTCAGC GCTGGTTTAG
TCGAGACTAT GGAGGGCATG GGTCTCGGTC TCGATGTAGC ATTACCGGCA AAGACCGAGA
CTATCGTCGA AGACAAGATC AATGAAGAAA TGGAGGACAC CGAAGACACC GAAGAAGAGT
ATGAGGAGCT CGAGACGCCG CATTTGGAGC CCTTCGAGAG AGAATGGGCG GAGAGGTGGC
TCAGTGGAGT CGTGAGGAGG GCTCAAGGCT GGCTGGAAGA GAATGAGGGT GATGAGAAAA
CGGAAGGAAC CAAGGACATT GAAGCTATCT TGAGAGATGC CACGGCGGTT CTTGCAATGA
TGGCTGGTAC AAGTGGTACG TTCTGCCGCG TAATTGACTT CATGAAGGAG TACAGCTGAC
ATCGTTTTCA GCTGCGGGAT CTCTTACTCG CCATCTCATC TTCCCCATAG CGGACTACCT
CGGCCCAGCG CTCTCTAAAG TCCGTTCAAG GGTTACCCCA AACCCGATGC ACTCTCCCAG
CACCTCTACT TTTCTCGCTT CGTTCTCCAC CTCTCCCACT TCCCCCCTCA CTCTTCGCAC
GCGACTTCCC CCTTCGCCCA CGACATCTAC TAGTGCGAAC AGAGGCGCGA GACCGCAAAA
GGCCAATCGT TCGCTTTTAC CCATCTTGCT TCACGATGCA CCTATGAGCG ATCATCTCAG
TGTAGGTGTC CAAACATGGG GTAGCGCCAT CCTGCTAGGT CGACAAATCG CTCTTCACCC
GTCAGACTAT GGGCTTTTCC CTCCTTCCGG AGTCAACAGA GGTGTACGAG TCCTAGAACT
TGGAGCCGGG ACAGGTCTTC TTTCCATTCT TTCCCGAAAG CTTCTCGACC TCAATGCCAT
CGCATCGGAC ACCCATTCCG GTTTGGTGGT AGCCACAGAT TTCTTGCCCT CTGTATTGGA
CAATCTCAAA ATATGTGTCG ACCTCAATTT TCCTCCTGCA CTTACTTCGA ATGGAATTGA
GTCTATAACC GATATTGCTC GTAATGAAGG TATACATATT GCCAAGCTAG ATTGGACGAC
TTTCCCCGCT TTCATGGCCA AGGGAGGACA AGGCGACGAA GAACAGATGG GGGTATTTGC
TAGGGACGGA ACATTTGATT TGGTGTTAGC CAGTGATTGT GTGTATGATG AGACACATGC
GAAGTTGTTG AGGGAGGTCG CGGCGTGGGT GTTGAGGTTG CCTGAAGGTG AAAACGACCA
GGGTGGTACC TTCGTAAGTT ATCTTTACCG GAAATCCGTG GTTTTAACGC TGACTATACG
ACAGCACATC CTATCCCCCC TCCGCCCGAC ATTCGCTCCC GAGCTTGAAT CCATCGATCA
ACACTTCCCT CCTCTCTCCA CTTACACTCC ACTATCCGAT CGCTCCGCCG CCGCGGCGCT
CTCACCAGAT CCATCAGCTG TTGCACCGGA GCTCCGCGGA GAGGGTCTTG GTATATCAAA
AGGTCTAAAG CTCGGCACAA GAGGCGAGGG TAAACGAGGA GTCAAGGGTA GGAAGGGTGA
AGGAAGAATC GACGAGGCCC AAGGGTACTG GTGGTGGGAG GTCGGATGGG GATAGACCAT
TAGACACCAT GAAAAGGATT AGATTCCGGA TCTCTTTGCA TACGACGGGT ATTATACTGA
GATTTTGAGC TACATACAAC TGACAGTCAG GGTTCGAGAA CGGTACGAGG TGTGGAGCGT
CCTTGGCTAA CTAAGGGAGA TCGACGTTGG GCCACAGTGA ATGGAGCAGG AGCGAGAGAA
CAGGCATGAC GAGCGCTCCT GAGAGATTTC CTACTTGAGA ACGGGTCCTT TTGTTTCTTT
GGCGGTGACG AGAAGGGTGA ACTTTTGGAG CCGGCATGGT CTGT
 
Protein sequence
MSVPSTFLPH LPRSQYTLHP LNPTELLSYI ADLRELYLPP IHGGFHAADV LEDGADSSSE 
EGNKNDRRVK KEIREIKRKR RFSAGLVETM EGMGLGLDVA LPAKTETIVE DKINEEMEDT
EDTEEEYEEL ETPHLEPFER EWAERWLSGV VRRAQGWLEE NEGDEKTEGT KDIEAILRDA
TAVLAMMAGT SAAGSLTRHL IFPIADYLGP ALSKVRSRVT PNPMHSPSTS TFLASFSTSP
TSPLTLRTRL PPSPTTSTSA NRGARPQKAN RSLLPILLHD APMSDHLSVG VQTWGSAILL
GRQIALHPSD YGLFPPSGVN RGVRVLELGA GTGLLSILSR KLLDLNAIAS DTHSGLVVAT
DFLPSVLDNL KICVDLNFPP ALTSNGIESI TDIARNEGIH IAKLDWTTFP AFMAKGGQGD
EEQMGVFARD GTFDLVLASD CVYDETHAKL LREVAAWVLR LPEGENDQGG TFHILSPLRP
TFAPELESID QHFPPLSTYT PLSDRSAAAA LSPDPSAVAP ELRGEGLGIS KGLKLGTRGE
GKRGVKGRKG EGRIDEAQGY WWWEVGWG