Gene CNI03410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI03410 
Symbol 
ID3259452 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp927039 
End bp929375 
Gene Length2337 bp 
Protein Length650 aa 
Translation table 
GC content49% 
IMG OID638258835 
Productiron hydrogenase, putative 
Protein accessionXP_572892 
Protein GI58271472 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.255006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCTTTTCT ACATACCCAT TCGTTCTTGA TTTTACAGCT CCATCTCTTA CTTTCGATAC 
CTCAGTGAGC CGTCCCCGGT AACGATTTCT ACGCTACAAT GGCTTTCTCA GGTGCTCTTG
TAAGTTTTTC GGAAGCTGCT GGTCATTGAC TTTCGCTGAC CCCCCAACAG ACTATCACCG
ACTTGGATGA CTTTTTAACG CCATCTCAAG CATGTATCAT TCCTGTTCGT AATAACAAGA
AGCCAGCGGA GGATGAAGGT CCCGTAAGTT ACCCGGAAAA ATTCAACATG TACTCTATCT
GACGCCATGA TCTACAGACC GAGATTCATA TCGACTCCAA CAACAATTAC TACGAGGTAT
CCACATATCC ATCGGTTGGT CACGACGATG ACATTGGAAA TTCCAAGAAG GCGCTCGAGA
AGGCAGAAAT TAATTTGAAC GACTGTTTGG CTTGCAGGTA TGATTCCTTC TCCTCCGAAG
TCTTATGTCT TGCTGATGTC AGTATGTAGC GGGTGTATCA CTTCCACTGA ATCGCTCTTG
ATCACTATGC AGTCTCAGAA CGAGATTCTC GAGTTTATCA AAACAAACCC CACAGTGGTA
GATCCTGAAT CCCCCTGCCA CAAGCCCCGT TTACCTATCC TTTCCATATC GCCTCAGACC
CTCGCTTCTT TATCGGCTGC TTATGCAACA GCATCCTCTC GACCTCCCAT TCCTCTCTTG
GTGCTTTTAC GGCGTATCCG TGCATTCCTC TCCCAGCCGG AGAAGGGATC TTGGAGAGTA
TGGGACACCA CTTTTGCCCG GCATATGAGC CTGAGAGAAT CTGTGGTGGA GTTCCATGAG
CGAAAGGACA AGAAGGAAAA GGGTAAGGCT GCGGAGATGC CAATGTTGGC GAGTGCTTGT
CCCGGATGGG TTTGTTACGC AGAGAAGGCA CAGGGTGATA TGTTGCCCTT GTTGAGCGCG
GCAAGGAGTA GCCAAGGCAT TATTGGAGCT TTAGCAAAGT CTTGGTATGG TCACAAATTG
CAGCACAAGT GAGTTCATCT CTTATCGTTT AGATGGAGCC GCCCGCTGAC GATTCGGACA
GACCCGATGA GATTTATCAT GTTACAGCTA TGCCTTGTTA TGACAAGAAG CTCGAAGCCT
CACGATCCGA CTTTTATTCT TCCCTCTATT CTACGCGCGA CGTCGACTGC GTTCTCACCA
CTGGCGAACT TGACCTTCTC CTTCAAGAAC TTGGATTCGA CCCCCACGTT CCCATCGCCA
ACGAGTCTAC CCCGTCTTAC TCGGCTACTG AAGACTCGCC ATTCCCTGAA CTCCTAACTC
ACGAAGGTTC AAGCTCCGGC TCCTATCTCC AGACCATCAT TCACGATGTT CAACGCTCTC
ATCCCAATCC TACTCGAATC ATTACTCGTG AAATCCGAGG TTCAACAGAC AATATTGAAT
ACCTTATTCA GGATACTATC ACTGGCCAAA TCGTTTTCAA GGGTGCCAAG GTTTATGGAT
TCCGTAATCT TCAAAACCTG GTCCGAAAAG TGGCAAAGGA GACGGGTATC GGTCGTTCCG
GTCGAGGTGC AGGTGCCGGC AAGTTAAGTG CTGCCGTCGC AGCCCGTCGG CGAAAAGCCA
AGACGGCAGC ACCAGCAGCA ACTTCTGCTA CAACTTCTGC CGAAGGGACA GATGTAGAGA
GCATTGCATC TCTCAGTCTA GTTAGCGGAG AGGATAAGAA GCTCGATTTC GTAGAGGTTA
TGGCTTGTCC TGGAGGATGC GTGAACGGAG GTGGTCAGAT GAAGCCAACT GTGCCTACAC
CCTCGGCTCC GGAAGCGATG GAAGTAGATG AAGAAGGGTA CCAGCGACCT CTTCCTGACG
ATGGTGTAGC CGTCGCTGTC AACGGGGGAA GTAATAATGT AGGTACTGGG ACGGTTGCAG
GGATGGAAGA AGGGATGCGG TGGTCAACGA AAGAATGGGT TGCCAAGGTA GAAGATATCT
ATTGGACAGG TCTACCTACC CCTCCGGCGT CACCCCCACT TACTGCTTCA AACGTCAATG
GCTTTGCGCC TCAAGTCAAG ACAAATGGCA CAACCAATGG CCAAGTCAAT AACGGCGTAG
ATCGTAATCT TCAATCTGAC CAGCTTGCCG AGGAAATCAT CCATGAAGTC TGTGGGGACG
ATGCGTCTAA GCGATGGGAT TTTATGAGGA CGAGGTTTAG AAAGGTGGAG AGCGATGTTT
TGTCATCAGG AGGGGTTACC CACGAGGCGG TCAAATGGTA GAAAGAGGAC TTGTAATAAT
CACATTCTGT ATCGGCATTT CTGGCTTCTC GAGGGCATGT GTGTCATATC TTGTACG
 
Protein sequence
MAFSGALTIT DLDDFLTPSQ ACIIPVRNNK KPAEDEGPTE IHIDSNNNYY EVSTYPSVGH 
DDDIGNSKKA LEKAEINLND CLACSGCITS TESLLITMQS QNEILEFIKT NPTVVDPESP
CHKPRLPILS ISPQTLASLS AAYATASSRP PIPLLVLLRR IRAFLSQPEK GSWRVWDTTF
ARHMSLRESV VEFHERKDKK EKGKAAEMPM LASACPGWVC YAEKAQGDML PLLSAARSSQ
GIIGALAKSW YGHKLQHKPD EIYHVTAMPC YDKKLEASRS DFYSSLYSTR DVDCVLTTGE
LDLLLQELGF DPHVPIANES TPSYSATEDS PFPELLTHEG SSSGSYLQTI IHDVQRSHPN
PTRIITREIR GSTDNIEYLI QDTITGQIVF KGAKVYGFRN LQNLVRKVAK ETGIGRSGRG
AGAGKLSAAV AARRRKAKTA APAATSATTS AEGTDVESIA SLSLVSGEDK KLDFVEVMAC
PGGCVNGGGQ MKPTVPTPSA PEAMEVDEEG YQRPLPDDGV AVAVNGGSNN VGTGTVAGME
EGMRWSTKEW VAKVEDIYWT GLPTPPASPP LTASNVNGFA PQVKTNGTTN GQVNNGVDRN
LQSDQLAEEI IHEVCGDDAS KRWDFMRTRF RKVESDVLSS GGVTHEAVKW