Gene CNF01040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01040 
Symbol 
ID3258208 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp317624 
End bp319510 
Gene Length1887 bp 
Protein Length471 aa 
Translation table 
GC content49% 
IMG OID638257227 
Productendopeptidase, putative 
Protein accessionXP_571253 
Protein GI58268194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTTT CCCTCGCTCT CCTACCGCTC ACCCTCGCAG CAACGCTGCC TACCGTCCCT 
TCTTCCTTTC ACCTACCATT GAAAGTGCTC GAATCTCGCC AGCACTCTGA TGATATCTCG
CAACGTCATG CATGGTTGAT CAGCCAGGCA AAGTCTATGC GGGCAAAGTA CGAGCCATAT
CTCAACGAGG AGGGTAAGGA GTTGTTAAAG AGGGATAGAA TGGAAAAAGA AGGACAGATG
AAAAGAGCAC AGCAATCTAT AGAGTGAGTG TTGAATGATT TCATGCTACT TACCTACTGA
CCCTACACTC TAGCTTGATC GATATTGGCC TTGACGGTTT ATATACTGCT CCTATAGAGA
TTGGGTGGGC TACGCTGAAA CTTTGCTTGA CTGTCAATTT GTCCGCTGAC TTTTGCAGCA
CTCCGCCTCA GCAGTTCATT GTTATCATGG ATACTGGATC ATCTGACCTG TGAGGAACTG
AATCCGACGC ACTCTACGTC GCTTCTTTTA TAGAACACTG ACGACGACGG CTTGTGTAGC
TGGGTGGCCG AGAAAGGATG TACAGCCGAT TTCTGTTCCA AGACCTATAC CTTTGATGCC
AGCAATTCAT TCACATTCGA AACTGAAAGA AAGCAATTCA GCATTGCTTA CGGATCCGGT
AATGCGGGAG GCTATTTGGC AAATGACACC GTCTCGACTG GGGGGTTCAC AGTCCGTGAA
CAAGCCTTTG GTATGTCCCG ACCCTGAAGT TAGAACAAGT ACGCTAAATA TATCTCTATA
GCTGTGGTGA CTCAAGCTAT TGACGGTCTG ATCGATTATC CCACATCAGG TCTTTTGGGT
CTTGCGTGGA ACACCATTGC GTCCGCTTAC TCAACCCCCT TCTGGCAAGC CTTGGCGTCT
TCTGGTGCTT GGGATTCACC AGAAATGGGG GTTTACCTCG CCAGATTCAA AGGAAATAGC
AGCGCCCAGA AGATTGAGTT TGATGGGGGG CATTTTATTT TTGGGTAAGG ACCAATTGTT
GTGAGAGAAG GATGGATGAT GCTAAAACTG CCTGATAGCG GGGTCGATTC AACCAAGTTT
GAGGGTGATT TGAATTATAT TCCAATCGGA CCGGGAGAGA GGGATTTTTG GAAACTCCCT
TTGGAGGGTA TGACAGTGGG TGGGAACCCT GTCGATGTCG TCAGTCTTTC CAAGGCTCGT
TCGCCATACC GCTCTAATCT TGCCCTCAGT CTAAAGGCCT CCTAATCGGC CGCAAGGCCC
CTGCTTGCGC TATTGATACT GGTACTACCC TCATCGGCGT GCCAACTGAC ATGGCCGCTG
CGATTTATGC CCAGATACCC GGCTCTTCCA AAGTGCCAAG CTCGGTTATG GGGCAAGGGG
GATATTATCA GTACCCGTGC GATACCAATG TTGACGTGAA ACTCAAGTTT GGCGGAGTAG
AGTATGGAAT CTCCAACACG GACATGAATT TTGGGACTTT TACCGACGAT GGCAAGACCT
GTATTGGAGC GTTCTTTGGG CAAGATGTGT AAGCTATTAT TACATTTTCT CCGCCTGTCG
CTTTAACAGG AAATAACACC ATGACGTGAA TGTAGGTCTC CGAGATCCCC AGTCCAGTGG
ATCGTCGGTG CCGCATTCCT CAAAAACGTA TACACTTCAT TCCGATATGA ACCCACAGCT
ATCGGGTTTG CACCTTTGAC CACTAACGTT ACCCTCCTTC ACAACAAAAG AATGCCCGTC
ACCTCGGAAG AAGCGACTAT TAATGGGACC ACTTCAAGCT CTTCCAGGGG GGCACAATTG
ACCGGCAGAG TTGATGTGCA GCAGAGGGGT GTATTGTGGG GCGTGCTGGG GCTTGGAGCT
ATGCTCGGTC TTGCGTTGGC CCTTTGA
 
Protein sequence
MLVSLALLPL TLAATLPTVP SSFHLPLKVL ESRQHSDDIS QRHAWLISQA KSMRAKYEPY 
LNEEGKELLK RDRMEKEGQM KRAQQSIDTP PQQFIVIMDT GSSDLWVAEK GCTADFCSKT
YTFDASNSFT FETERKQFSI AYGSGNAGGY LANDTVSTGG FTVREQAFAV VTQAIDGLID
YPTSGLLGLA WNTIASAYST PFWQALASSG AWDSPEMGVY LARFKGNSSA QKIEFDGGHF
IFGGVDSTKF EGDLNYIPIG PGERDFWKLP LEGMTSKGLL IGRKAPACAI DTGTTLIGVP
TDMAAAIYAQ IPGSSKVPSS VMGQGGYYQY PCDTNVDVKL KFGGVEYGIS NTDMNFGTFT
DDGKTCIGAF FGQDVSPRSP VQWIVGAAFL KNVYTSFRYE PTAIGFAPLT TNVTLLHNKR
MPVTSEEATI NGTTSSSSRG AQLTGRVDVQ QRGVLWGVLG LGAMLGLALA L