Gene CNH03030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH03030 
Symbol 
ID3259185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp252199 
End bp254199 
Gene Length2001 bp 
Protein Length621 aa 
Translation table 
GC content54% 
IMG OID638258182 
Producthypothetical protein 
Protein accessionXP_572473 
Protein GI58270634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCGCA CTTCTCGTCT CATCCCCTTG AAGAAGGACG ATGGCTCTAT CCGACCTATC 
GCTGTTGGTG AACTTATCTA TCGGCTATGT GCGAAAGCTC TCATCATCTC GCATTTCCAA
CCCGACTTCC TCCTCCCGTT CCAGCTCGGG GTCAAGTCAA TCGGTGGTGT AGAGCCGATC
GTGAGGCTGA CAGAGAGAGT CTTGGAGGGT TCTGCCGGCG CTGAGTTCTC CTTTTTAGCC
TCGCTCGATG CTTCTAACGC TTTCAACCGT GTAGATAGGG CCGAGATGGC AGCTGCGGTC
AAGACCCATG CGCCGACGCT TTGGAGGACT TGCAAATGGG CCTATGGCGA CTCGTCCGAC
CTTGTGTGTG GCGACAAAAT CCTTCAATCC TCTCAAGGTG TTCGACAGGG TGACCCCTTT
GGCCCTCTCT TCTTCTCAAT CACCCTCCGA CCAACCTTGC ACGCCCTCAG TCAATCGCTA
GGTCCGTCTA CGCAAGCGCT CGCTTATCTC GATGACATCT ACCTCTTCTC AAACGACTCG
CAAGTCCTCA GCAAAACTAC CCAATTCCTC GCCGACAAGC AGCACATCAT CAAGCTCAAC
GAAAAGAAAT GCAAGTTAAT CAGCTTCGAT GAGATCAGGC AGGAGGGCTT CAAGATGCTA
GGGACGATGG TAGGTGGTAA GGAGAAGCGG GCGGAGTTTC TGGAAGGCAG GATTCGGAAG
GAAATGGCAA AGGTGGGCAA GCTCAAGGAT CTTCCACATC AACACGCGCT CCTTCTATTA
CGGTTCTGCA TCCAGCAAAA TCTACGACAC CTACAGAGAA GCCTACGCTC CGACGACCTT
GTAGATCTAT GGGAAAGACT GGACACGATG CTGTGGGAGG AGGTGAAAAG GATGAGGATG
AGGCAGCGAG AGGATACAGT GGAAGAGGAG ACTCTAGGGA GATCGTTGAC GAAGCTACCA
GCGCGACTGG GCGGACTAGG TCTACTTTCC TTCAAAGATG TAGCCCCCCT TGCTTACCGC
TCGGCAGCCG AGGCCTCCGA CACTCTCCTC GATAACCTAG GTCTCCTTTC TTCGCCTGAG
GAACCTCCAA CTCCGGTCCC CCAACGAACT CGATGCGCAG AACTCTGGGA ATCGCAACAG
GAAGCCATTC TACGTAATCT CGGCGACACC GAACGCAAGC GACTCACCGA GAATGCCTCC
AGACTCGGCC GAAGTTGGTT ATCAGTCATC CCTTACCTTC AGCCCCTGCG CCTTTCCAAC
GTCGAGATTG CCTCGGGTCT CCACGACCGC ACCCTGGTCG GCTCCTCGAT ACCTGTCTGT
CGCTTCTGTG GGTCGGACTC ACCTTTGGGT CACGACGAGC TTTGCCGCGC CCGCAACCCC
TGGACCCAGC GCCGGCACAA TGCCATCAAC CGCGTCATCT ATCAACACCT CAAACAAATC
CAAGGTGCCA CGGTTGAGAT TGAGCCCCAC ACGCTGTCGG GACAAAGGAG AAACGACCTT
CGGGTCAGAG GTTCCAGCGC GTTGGCCTTC ACTGACTACG ACCTGAAGGT ATACTCCCTC
GGAGACCGAG ACGCGAGAAG CACCGTCACA CCCTGCGCCC CCAACGGCAA GCTAGCCGAC
TTCTGCTTGG ACCGGTGCGT GAACTGGCTC GACAAGGTGG GTCAGGTCGT CTCGAAGAAC
GCTCCGAAAG TCACTGGTGG GGTCTTTAAA CCGATCATCC TTTCCACTGG TGGCCTGATG
AGCAGGAGCA CAGCAGACGA ATGGAAGGAG TGGAGGGAGG CGATGCCGGT GGGGGGGTTC
GAGAAAATGG AGAAACGGAT TGGTGTCGAG CTAGTAAAGG CAAGGGCGAG GACGCTGGTC
TTGTGAGGAA GAGGAGGTTG GATTATTTTT TTTTCTTTTC TTTAATAAGT TGTTTATTTA
AGTAGTTTCT TTCATTCGGG TAACACACAC GACAACCCAA TAAATTAAAC AACGAAAAAA
TGCAACCTCT ATAACCCCCT A
 
Protein sequence
MLRTSRLIPL KKDDGSIRPI AVGELIYRLC AKALIISHFQ PDFLLPFQLG VKSIGGVEPI 
VRLTERVLEG SAGAEFSFLA SLDASNAFNR VDRAEMAAAV KTHAPTLWRT CKWAYGDSSD
LVCGDKILQS SQGVRQGDPF GPLFFSITLR PTLHALSQSL GPSTQALAYL DDIYLFSNDS
QVLSKTTQFL ADKQHIIKLN EKKCKLISFD EIRQEGFKML GTMVGGKEKR AEFLEGRIRK
EMAKVGKLKD LPHQHALLLL RFCIQQNLRH LQRSLRSDDL VDLWERLDTM LWEEVKRMRM
RQREDTVEEE TLGRSLTKLP ARLGGLGLLS FKDVAPLAYR SAAEASDTLL DNLGLLSSPE
EPPTPVPQRT RCAELWESQQ EAILRNLGDT ERKRLTENAS RLGRSWLSVI PYLQPLRLSN
VEIASGLHDR TLVGSSIPVC RFCGSDSPLG HDELCRARNP WTQRRHNAIN RVIYQHLKQI
QGATVEIEPH TLSGQRRNDL RVRGSSALAF TDYDLKVYSL GDRDARSTVT PCAPNGKLAD
FCLDRCVNWL DKVGQVVSKN APKVTGGVFK PIILSTGGLM SRSTADEWKE WREAMPVGGF
EKMEKRIGVE LVKARARTLV L