Gene CNG00350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00350 
Symbol 
ID3258581 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp83220 
End bp84626 
Gene Length1407 bp 
Protein Length328 aa 
Translation table 
GC content50% 
IMG OID638257648 
Productopsin 1, putative 
Protein accessionXP_571773 
Protein GI58269234 
COG category[R] General function prediction only 
COG ID[COG5524] Bacteriorhodopsin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.534586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGTTATTCAG TGATCACTAG CCAGGATGGA TTACTTCGCA GAGTTCAGGC CTACCACTAC 
CATTTCTGTC AACCCTCCCG GGGGCACAAA CACCCACCAC CCGGGTCCTC ACCATTCTCA
CCATTTGCCC TTGCCCACCG CACCTTTCGT AAGCGCCACT CTGTACTGTC CTGACTGTGA
GAGCTTGTAC TGACTCTGCT TTGATTTTGC AGCCATCAGC TACCCCAAGG TTTTTACATG
CCACTCGTGT CGGCCATGTC TCAGTCTGGG TATTCACTGC ATTGTTCATC GCCGGTTTGG
TGGTTGCATT GGTCTTAACT AGTAGGACTC AGAAGAAGAA CCGTTTATTC CATGGGTGAG
TTCAATCTTT CTATATAACA AAGTGAAGCC ATTAACATGT TTGCTGTAGT ATCTCTGCCG
TCATCCTCAC CGTCTCCGCC TTAACCTACA TGTCCTTGGC GACCCATATC GGATCCACCT
TTGTTCCCAT TTACGGTCCT CCCGGTCACC ACGAACCTCT CGTTCACTTC TTCCGTCAAG
TATTCTCGAT CCGCTACATT GACGCTGCCA TCACTGGTCC TCTCACCATT CTTGCTCTTT
CTCGCTTGGC AGGAGTCAGT CCCGCTACAG CGTTGAGTGC CGCTCTCGCC CAGTTGGTTG
TAGTATACTC TGCTTGGGCT GCAAGTGTCG GTGGCGGTTG GCCGTGGGGT AAGCACGGGA
AAGGTGCGGG GACCAAGTGG GCTTGGTTCG CAGTTGCCGA TCTTGCATTC TTGGCCGTCT
GGACCGTGCT TCTCGCCAAA GGTCGAAAAG GTAAGCTCTT TCCCATTCTG CAGCAGCATT
GTCAATCTTT CACATAGACT GACATCCACT TAGCCTCCGT TCACAGAGCT CGCCCTACTC
AAGGCTTGTT CTACCTTCTC TCCTCCATGA TCATCTTGTA TGTCATCAAC ACTTTCCTTG
CAATCGCCAC TAATCAACCA TTCACAGGAT TCACATCGGC CAAGGCGTCA TCTGGATCCT
CACTGACGGC ATTAACCTCA TCAGTGTTAA CGCTGAGATC ATCAGCTACG GTATCATGGA
CGTTGCCGCC AAGATCGGTT TCACTCACCT TCTTTTGTTG CTTCACAAGA GCGATGAGGA
GGGTCCTTGG ACCTTGCCTG CTTGGTGGGC TGAAGACCCT GAAGGTGCCG GACCCGATGG
TCGAGGTATC TATGGGGCTG TTACCAGCGT TGGTTCTGAC TAGTCCATGA GTGGCAGTTG
GTGGCCGAAA CTGTGACGAA AGAAACCGGT GCTAGGAAAG AGAAGAGAAT ATGTATGATA
GTAGATGAAA ATGAGTGATA ATGTCGCAAT GTACACGAAC AAGAGATGAC TTATGTTCAG
CAATTATGCA GATGTTCCTA TCGTAGT
 
Protein sequence
MDYFAEFRPT TTISVNPPGG TNTHHPGPHH SHHLPLPTAP FPSATPRFLH ATRVGHVSVW 
VFTALFIAGL VVALVLTSRT QKKNRLFHGI SAVILTVSAL TYMSLATHIG STFVPIYGPP
GHHEPLVHFF RQVFSIRYID AAITGPLTIL ALSRLAGVSP ATALSAALAQ LVVVYSAWAA
SVGGGWPWGK HGKGAGTKWA WFAVADLAFL AVWTVLLAKG RKASVHRARP TQGLFYLLSS
MIILIHIGQG VIWILTDGIN LISVNAEIIS YGIMDVAAKI GFTHLLLLLH KSDEEGPWTL
PAWWAEDPEG AGPDGRGIYG AVTSVGSD