Gene CNF03830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03830 
Symbol 
ID3258219 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1121293 
End bp1123461 
Gene Length2169 bp 
Protein Length541 aa 
Translation table 
GC content46% 
IMG OID638257502 
Productepsilon DNA polymerase, putative 
Protein accessionXP_571676 
Protein GI58269040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCAGGCATA GCAGAGCGAG ATGGTGAATC AAATGCGTTC AGCGATCGTT AAGGTAAGAT 
TTTTCTCTAT CCTCTTCGGA GAAAGGACTG ATTTTGCATG ATTAGGTCTT TTCGACCAAG
CACTCCTTGA CATTGCCTGC TCCAGCCCTA CATTATATCG AAGAAGTTCT TATAGAGGTT
GGTAGATTCG AGACCTTTTA GCCGGCCTAT GGATCCAGCT AATGGCTCGG TCAGAATGAA
ATTCCCGAGG ACGAATGGAT GGTTGGTCTC GAATTTTGGG CGAAAGAGTA TTTGAAAGCT
GAAGGTACGT TCGTTTGCTG CAGAATACAG CATTACATAC TCACGTATTT CATTTTAGAT
TCATCATCAT TGGTGTCATT GCAAGCTTTA AAAAAGGCCT ACGAAAATCT TCAGCTTGGC
GTGAGCTCTA TACTGCCTGT ACCATCGTAA AATCGCAGAA AGCTGATATG CCCACAGACA
ACAGAAGACA CACAAGTAGC AGACCCTTCT GAAGTGAATG TTGAATCACA CTTCTCTGTC
GTGGACTCTT TTGATATGCC TGCCATGAGA TATGATCCTG TACGGTCAGG TTTTGTTCAG
TGAGTCACAT GATTCCAAGT TTAAAACCAA GACTGATCAA GATGTAGATC CAAGGCTCAA
CCGTCTGTCG CTGGTCAGGC CAGTTCAAGA TCGGCTTTCC TTCGTGAACG TTGGGCCATT
ATCAAAGAGG TCAGTTTTTC TGTGTTCATC CTCAAGCTGT CCTTACGTTG TGCAGATCAT
CCTTCGTAAC GAAAATTTCA CACCCCCTGC TATCGGTGGT CACGATCGTG CCAACTATCT
CAAATTGACG TCAATCCGCA ACCTCTTAGG CCGTGCTGGT CAGCTTTTTC TGTTATTCGG
AATGCTCGCG CGTAATGAAG AGGGCAAGTT GTGCCTTGAA GATGGAGAAA GTCGAGTTGT
TCTGGATATG GAAGATGCTG TTCCCGGTGA GGGGCTGTTC ACTGAGGGGT GTATGGTCTT
GATAGAAGGA GAGTACACAG TGGAGGAAAC GGTTCGCGTG TTGGCTATGG GACACCCTCC
GAGTGAAAGA AGAAATATCG CGAGGTCCTT ACATGGGCAC GTGGACTTCT TGGGAGGTGG
TGCTGTATCT CTGAAAGAAG AGGTGCGGTA ATTTGCGCTG CATTTTGGAA AAAGAACTAA
CTTGAAGTAG CAAAAGTACA ACCCCACAGT GCTTGCCAAC ACTCAGATAT CTTTTGTTAT
TCTGTCTGAT GTCTGGCTCG ATCATCCGAG AACTATGCCT GCCCTGCGCC AGATGTTTGA
AGGATATGCC AACACTGCCG AGTACCGACC GATGGTGTTT GTACTTTGCG GTAACTTCTG
TCAAGGCGGA TGGGAGGGCC AGGAAGGGCT CAAAAGATAT AGCCGAGGGT TTAATTCTCT
TGCAGAGCTT CTTCAATCCA TTCCCCTGCT TCATTCCTCA CATTTTGTCT TTGTTCCCGG
CCCTTCAGAC CCTTGGTCCA GCACTACCCT TCCTCGTCCC TCTCTTCCTT CAGCATTCAC
CACGCGTTTA TCAAACCGTA TACCGAACGC AAGATTTGTC AGCAACCCAT GTCGGCTGAA
GTACTTTGGA ATGGAGATTG TGATCTGTAG AGAGGATTTG ATGGGGAAGA TGATGCGAAA
CTTAGTTGTG GTCAAAGAGG GTGAGGAGAT GAACATGAAG CGATATGTGA GTATTCATAA
CGTCCTTGTC TAGTCATTAC TAATATACAG ATAGCTCGTT CAAACTATTT TGGACCAAGC
ACATCTCTCG CCTCTTCCTA TTTCTGTCCG CCCCACTCTC TGGGAATACG ATCACGCTTT
GCGCCTGTAC CCCATGCCTT CTGCCGTGGT CTTGGCAGAT AAGTACGAAC GATATGAGCT
CACTTACGAA GGGTGCCACG TTTTTAACCC GGGAAAGTTT GTTGGCGGAA TCGGAGAAGA
TGGGTGGGAG TTTGAATGGA GTATGTATTA TCCCGCTACA GGCAGAAGTG AGCGAAGGTG
AGTGTTATTG TCTAGTAGCT TTTAACCCCG AAACTGAATG TGCAACAGTG TCTTGACCAT
GGAATAATTG TTTCAACGTT GATTTGGGCC GCTGTTGTAC TATAAGATCG TCATGCATTA
TAACATATT
 
Protein sequence
MVNQMRSAIV KVFSTKHSLT LPAPALHYIE EVLIENEIPE DEWMVGLEFW AKEYLKAEDS 
SSLVSLQALK KAYENLQLGT TEDTQVADPS EVNVESHFSV VDSFDMPAMR YDPVRSGFVQ
SKAQPSVAGQ ASSRSAFLRE RWAIIKEIIL RNENFTPPAI GGHDRANYLK LTSIRNLLGR
AGQLFLLFGM LARNEEGKLC LEDGESRVVL DMEDAVPGEG LFTEGCMVLI EGEYTVEETV
RVLAMGHPPS ERRNIARSLH GHVDFLGGGA VSLKEEQKYN PTVLANTQIS FVILSDVWLD
HPRTMPALRQ MFEGYANTAE YRPMVFVLCG NFCQGGWEGQ EGLKRYSRGF NSLAELLQSI
PLLHSSHFVF VPGPSDPWSS TTLPRPSLPS AFTTRLSNRI PNARFVSNPC RLKYFGMEIV
ICREDLMGKM MRNLVVVKEG EEMNMKRYLV QTILDQAHLS PLPISVRPTL WEYDHALRLY
PMPSAVVLAD KYERYELTYE GCHVFNPGKF VGGIGEDGWE FEWSMYYPAT GRSERSVLTM
E