Gene CNM01870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01870 
Symbol 
ID3255155 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp569193 
End bp571206 
Gene Length2014 bp 
Protein Length478 aa 
Translation table 
GC content47% 
IMG OID638254341 
Productstreptomycin biosynthesis protein StrI, putative 
Protein accessionXP_568475 
Protein GI58262130 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATAACATATT CCCCTAGAAT AAGATAAGGC CAAACACAAC TATCGCCTTT AGTGATTAAA 
ATATCGAAAT CCAACGCGAC ATGACGGCTC AAGTTCGGAC ACTCCCCACC GAATCCACAT
CTCTCTTCTC TCCAGCTGCT GTACTCACCG AAATAGATGA ACCTGAGTTT AAGAGACCCA
AAGTCAGCAA CAGCGATTCT CTCCAGGATC TGGCGGAATC TAAGACTACA AAACATGTGA
CTGTAGCTGT CATTGGCGCT GGTCAACGAG GTCTGGTACG TTTACATTAA AGTCTCGGCG
TATAGGGCAA GAAGGTAAGA AACGCTGACT GGTTCGTTGT CGGTAAAGGT ATACACTTCA
TACGCTCTTG AGCATCCTGA GCTGGTCAAG GTTGTTGCCG TAGCTGAACC CCGTGCACAC
AGGCGAAAGG TCATGTCCCG TCTTCATTCG TGCGTCTCTC ATCTCTCATT TCTTTCCCTT
TCCGCTTTGA AGCTCACTCC TGCACCTATC AGAGTACCTC CAGAGAACCA GTACGCCAGC
TGGGAACCTC TCCTTGCCCG AGGCCGAATT GCCGACGCCC TACTCATTAC TGTCCTTGAC
GATCTCCACG CTGAGCTGGT AAGCGCTTTC GCACCCCTAG GCTACCACAT ACTATGCGAA
AAGCCTATGG CAACCTCGGT ACAAGATTGT GTGAAGATGG TGAAGGAGGT GGAACTGTCT
GGAGCAGGAA TTTTTGGGAT TGGTCATGTT TTGAGGTATT CGCCGTATAA TAGGGCTGTC
AAGGAGGTTA TTGATTCTGG AGTATTGGGG GAGATTGTGA ACATCCAAGT GAGTTGGGAG
ATTATATTAG ATGGGAATAT TTTGATGAGA ATGTGACTGT TATATCGATC CATTACAGCA
TATTGAGCCT GTAGGAAACG TATGTCGCCG AAATTGATCA CTTGTCTAGC TCATGACTGA
TACTTTCTGG TAGCAACACT TTGCTCATAG CTTTGTTCGA GGCAACTGGA AGAAAGAGTC
TGAATCCACT TTTGCTCTTA TGGCCAAAAG CTGCCAGTAA ATCCCATTCG CCTCTACATT
TACCATTAAG CAGATAGCTG ACATGGCATC TATAGCGACC TTGATATCCT CTCATTCTAT
CTCTCCGGTC TTGAACCTCG GAAAGTTCAT TCCTTCGGCT CTATACACCA CTTCAAGAAC
TCGAAGAAAC CAGCGGAAGC GGGGGATGCT AAAAGGTGTT TAGAATGCGC GTTCGAAAAG
GACTGTGTGT GGAGCGCAAA GAAAATCTAT ATTGATGGTT TAAAGGATGA GGGACACAAG
GTGAGACCTA TTTTGCGAGC AGCAAAGCCA GGATGTGGTA AGAGCAGAGT TGACAAAGGA
CGGAAAGTGG GCTCAACACA TTGTTGATGC GGATGTTCTC GATATCGAAA ACGTGACTGA
TGCGTTGAAG ACTGGTCCTT TTGGCGTTTG TGTCTACGAA GCAGGTAATG ATGTGGTGGA
CCATCAGGTA GTGAATATCG AATACGAAGG AGGAGTCACC GCGAGTATGA CTATGGTTGC
TTGTGAGTAA TCGTCTCCTT GGGATCCAGC CCTGAGTTCG TCTTACACGT TGATATGTAG
TTACCGAGGC CATTTGTGAT CGAGGTACTA GGATCCAGGG GACCAAAGGT GAACTGATCG
GCAATATGGC TTCCTTTGTA AGCATGTGCC CTCACTCCTG CCTCCTACCA TCATTAACAA
AACCTTTTTG CATTGAAAGA CTGTCTTCGA TTTCCTCACT CGTACCAAAA CTCAGCACAC
TCCCAAGTCG CTCCCAGGTA ACCATGGTGG AGGCGACGCA GGTCTTTCAG AGACATTTTT
TGAAGCTGTT AGCAAGTCTG ATCAGAGCGT ACTGGGTGTA ACACCGGAAG AAGTATTGAA
TTCGCATTTA CTTGCGTTTG CTGCTGAGCA GGCCAGAAAA GAGGGGAGGG TGGTAGATTT
CGCAGAGTTT AAAGATAAGG CTATGGCGTA TTAA
 
Protein sequence
MTAQVRTLPT ESTSLFSPAA VLTEIDEPEF KRPKVSNSDS LQDLAESKTT KHVTVAVIGA 
GQRGLVYTSY ALEHPELVKV VAVAEPRAHR RKVMSRLHSV PPENQYASWE PLLARGRIAD
ALLITVLDDL HAELVSAFAP LGYHILCEKP MATSVQDCVK MVKEVELSGA GIFGIGHVLR
YSPYNRAVKE VIDSGVLGEI VNIQHIEPVG NQHFAHSFVR GNWKKESEST FALMAKSCHD
LDILSFYLSG LEPRKVHSFG SIHHFKNSKK PAEAGDAKRC LECAFEKDCV WSAKKIYIDG
LKDEGHKDGK WAQHIVDADV LDIENVTDAL KTGPFGVCVY EAGNDVVDHQ VVNIEYEGGV
TASMTMVAFT EAICDRGTRI QGTKGELIGN MASFTVFDFL TRTKTQHTPK SLPGNHGGGD
AGLSETFFEA VSKSDQSVLG VTPEEVLNSH LLAFAAEQAR KEGRVVDFAE FKDKAMAY