Gene CNG03750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG03750 
Symbol 
ID3258888 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1051236 
End bp1053050 
Gene Length1815 bp 
Protein Length427 aa 
Translation table 
GC content47% 
IMG OID638257998 
Productconserved hypothetical protein 
Protein accessionXP_572080 
Protein GI58269848 
COG category[S] Function unknown 
COG ID[COG3268] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATACTTTTCA TTTATATACC ATGTAGCCAT GAGTCTATAT CATACTCATT CATCTCTGAC 
CCAACAACTC AGCTTATTGA ACGAAAATGG CCCAAACAAC AAAGCCGGTA CTCGTCATCT
ATGGCGCGAC AGCGTACACC GCTCAGCAGT TATTTACGTA CCTTGAGGAG CACCCTGAGG
CAGGAGACTT TGACTTTATC CTTGCTGGTC GTAACCAAAC CAAGCTTGAC AAGTTGAATG
GGAGTCTCAA GATACAAAGA GAGGTTATCG CCTGCGAGTT GAGTGACGAG GAAGGGGTTG
AGGCGATGGT CAAGAGGGGA AATGTCATAG TCAATTTCGC TGGTAAGTAA CTCTTGATTT
CACCTTATGC ATTCCAAAAT GTTGATCTAA TTGTTAGGTC CTTACCGATG GCACAATGCT
GAAGCCATAA TTCGGTTCGT TTATCTCTTA CCTTGGCCAA CACCACTTCA CAAAGACTGA
CCAATACCAA AGTGCATGTT CCAAGGCTGG AAAGCACTAC ATCGACCTTT GCGGCGAATC
TGCATGGCTG GCCAAAGACA TCATTCCAAA GTACCATTCG ATCGCCAGCT CCACGGGGGC
CTGTATTGTT CCTTCCTGTG GTTTTGATTC TGTTCCTTCG TGAGCAACTT TATCTTAGTT
GATTGGAATC AAGCTGATTG GTTGATCTAA GAGACCTGAT CGTGCATCTT GCTAATCAAA
CCCTCCAAAC GGTTAGACCC GGATCCACAT TGGCTGACTC GACCTCCATA TTCAAAGTGA
AAGGCACCAT CAGCGGTGGA ACTGTCCAGT CTATGATCAC CCTTACCGAG CTTCCCAAGG
AAGAGCGAAG GGCCGGTGAA TTCACCCTCT GTCCTGGTAG TAAGTCGATT CCCCTCTTCG
GCGACCTCAT CAAGCTCAAT TGTGACAGTC CAACTCCCGT CCACACCTCC TGCGCTTACC
TTCTCCCTTC CTTCTACCCC TCTTACTCCC GCTCGTTTTG CATCCTTTTT CTTCATGTAC
GTCTACAATC GAACTGTCGT CCGCCGTTCC CAGTTTCTCT CTGGCGCCTT GTCTACAAAG
TCCGGTGGTA AGGTTATGAA GTACGCTGAA GGCTTAGACA TTGGATATGG TAAATTCGGG
TCAGCCTTAG CAACCATCGG GATGATGGTT TTCGGAGGCC TGTTTTTCGG TTTTAAATGT
GTAAGTAAAG AATTTCCTGA CATGTGTAAC GTATCTAACT CATTGCCGCA GCTTAGGAAC
ATAATCCTTC GATACTTGCC CAAGCAGGGA GAAGGAGCTC CCCTGGAGTA GGTCATGCAC
CTGTGACAAC ATGCGACCTG TGAATGCTAA TAACTTACGA TCGTAGGCAG CTGAAAGCTG
GTCACTACCA AGTCACGAAC CTTTCCACTG AGGAATCCTC CGCTCCCGAC CACAAGCCTG
TGAAGATCCT CACAAGGTTC GACGGTGAAG GCGACCCTGG ATACCTTAAC ACTTGCTGTA
TGTCCTTTTC AACTTCTTAC TATCCGCATC GCTAACAACA TCTGACAGAC TTACTTGCTG
AGTCCGCCTT GGCCCTGGTT CTGCCTGCCC CCAAGGGCAC TTCCCGTCCA CCTCTAGCCA
AGGCTGGTGG CCTTTTAACC CCTGCGACAG CTATGGGTGA TGTTCTTATT GAGCGATTGA
GAAAGAGTGG CAAGTTCCAG ATTACCAGCG AGGTGTTGAG TGAGGAGAAG AAGAAGGATA
TCTAATTTCT CAATATTCTT TTGATGAGGA GTTTTAGGTT TTCCACATTC TATGTTTTAC
ATATGGTAAT AATAT
 
Protein sequence
MAQTTKPVLV IYGATAYTAQ QLFTYLEEHP EAGDFDFILA GRNQTKLDKL NGSLKIQREV 
IACELSDEEG VEAMVKRGNV IVNFAGPYRW HNAEAIIRAC SKAGKHYIDL CGESAWLAKD
IIPKYHSIAS STGACIVPSC GFDSVPSDLI VHLANQTLQT VRPGSTLADS TSIFKVKGTI
SGGTVQSMIT LTELPKEERR AGEFTLCPGI QLPSTPPALT FSLPSTPLTP ARFASFFFMY
VYNRTVVRRS QFLSGALSTK SGGKVMKYAE GLDIGYGKFG SALATIGMMV FGGLFFGFKC
LRNIILRYLP KQGEGAPLEQ LKAGHYQVTN LSTEESSAPD HKPVKILTRF DGEGDPGYLN
TCYLLAESAL ALVLPAPKGT SRPPLAKAGG LLTPATAMGD VLIERLRKSG KFQITSEVLS
EEKKKDI