Gene CNB02140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB02140 
Symbol 
ID3255573 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp625418 
End bp627673 
Gene Length2256 bp 
Protein Length715 aa 
Translation table 
GC content49% 
IMG OID638254865 
Productmetallopeptidase, putative 
Protein accessionXP_569144 
Protein GI58263468 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0384968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTGA CAGATCGTAC TCTGATTCAG GCCTCAGCGA TCGTCCAACG CATCGTCGTG 
GCTCCCCAGG ATCCCACTGG TCGAGAACTA AGGCTTGTGG TGAAGAACCT AGATAGGCTG
AGCGATATAC TTTGCGGAGT GATTGATATG TGCGAACTCA TTCGGAATGT CCATCCCCAC
CAAGACTGGG TGAATCAGAG CGATCGGACC CACCAGATAC TATGTAGCTT CATGAACGAG
CTCAATGCAA CCCGGGGTCT TTACGAGGTG AGCCCTCAAG CCATATCGGT AAATTAGCGG
GCTGACTGTT TGACCCATCT TCAGTCACTT GCAAAGGCGA TTGCGCATCC TTTCAATGAC
CCATTGACTA CTTCAGAGCT TAGGGTTGCC CGTATTTTCC TCACAGATTT CGAGCGATCA
GGTATACATC TTCCGCCCTC TGTTCGCGAA AGGTTTGTGA AGCACTCGGA CGCTTTACTC
TTCCTCGGTC GTTCCTTCCT CTCTTCCGCA TCATCGGGCC CATCCACAGT TCCCCATATA
GAAATCCCCG ATCCTCATCG CTTACTTACG GGACTGGGTC GCCAGTTTGT TGATTCATTA
CCACGAACAG GTCGAAATGG ACAGGCTGTT ATCGAACCTG GAAGCTGGGA GGCACAAATG
ATCCTGAGGT ACGCAAGAGA GGGCCGGGCG CGAGAGCTGG TGTACGTCGG CGGAATGAGG
GCCGACAAAA AGAGAATTAG CGTGCTGGAA GCGATGTTGA AGGAAAGAGC TGAACTAGCC
AGCGTCCTTG GGAAGAACAA TTGGGCGGAG GTTGTTCTAG TCGATAAGAT GACAAAAACA
CCGGAAAATG TGATGCGCTT CTTGACCTCC CTCGCTCAGC ACCATCAACC CGTTGCTAGA
GCAGAAGTGG ATATGTTAAG AAGAATGAAA GCTACTGCTC TGACTGGAAA TTACTTTGAC
CCACGAAATT CTCGGACACG ACATCTTCCC CTGTTCCATG CCTGGGATAG GGATTATTAT
AGCGACAAGT ACCTTACATC CCTCATTCCT ACAGGTTCGC CGCCTTCTAT TTCTCCTTAT
CTTTCAACTG GCACAGTGAT GTCAGGCCTT TCCCGCATCT TCTCAAGACT TTACGGTATC
TCCTTCAAAC CAGCTGTCGT CTCACCTGGA GAAGTTTGGC ATCCTTCTGT CCGGCGGCTG
GATGTAGTGC ACGAAGAAGA AGGGCTCATT GGTGTCATAT ATTGTGACTT TTTTTCTCGC
ATTGGAAAAT CTTCTGGAGC AGCCCATTAC ACTGTGAGAT GCTCAAGAAG GGTGGATGAT
GACGATATAG ATGGTGATGG GTTACCAGAA GACTGGGATA AACCATATGG CCCTGGATTA
GAAGCTGATA AGGAGTCTTT ATCAGGCAAG CCAGGAAAAT ACCAACTGCC TATCATCGCA
TTGTCAATGG ATGTCGGTAC AGTGAATGAA GGAAGACCTG CGCTCTTGAA TTGGCAGGAA
TTGGAGACTT TGTTTCATGA AATGGGACAT GCAATCCATT GTCAGTTGCT ATGCTTAACC
AGTAAAAATA TAACATTGAT GTTCTTCTTA GCCATGATTG GTCGGACAGA GTACCACAAT
GTTTCTGGAA CAAGATGTGC CACGGATTTT GTGGAGCTTC CTTCAATACT GATGGAGCAT
TTTGTTTCAT CACCAGAAGT CCTCAGCACT TTGGCGTTCC ATCATGCCAC CGGCGAACCT
CTACCTATCC CCGTTATCGA GGCCCATCTA GCTCTCAATC AGTCCCTAAG CGCCCTCGAG
ACTCATGGAC AGATCGCAAT GGCTCTTTTG GATCAGAAAT ATCATACGCT ACGTCATGGA
CAGGATTCTT TTGATTCTAC TGCTATTTGG TTCCAACTTC AGCAAGAAAT AGGAGTCATC
CAACCAGTGC CCGGAACAGC TTGGCAAATG CAGTTTGGTC ATCTGTACGG ATATGGAGCG
ACGTATTACT CTTATCTATT TGACCGCGCC ATTGCGGGTA AGATATGGTC CACCTTGTTT
CATCGCTCGG GGACCTCCCA AGCTTATGAC CGAAAGGCTG AAGGAATACT GAGTAGGGAG
GGAGGAGAAT TGTTAAAAGA GAAAGTCCTA AAATGGGGTG GAGGTAGGGA TCCATGGGAG
ATGGTAGGCG ACGTGATTGG GGGCGTAGAA GGTGATGAGT TAAGTAAAGG AGATGAGAGG
GCATTGGCAC TGGTTGGAAG CTGGAGTGTC GTATGA
 
Protein sequence
MRLTDRTLIQ ASAIVQRIVV APQDPTGREL RLVVKNLDRL SDILCGVIDM CELIRNVHPH 
QDWVNQSDRT HQILCSFMNE LNATRGLYES LAKAIAHPFN DPLTTSELRV ARIFLTDFER
SGIHLPPSVR ERFVKHSDAL LFLGRSFLSS ASSGPSTVPH IEIPDPHRLL TGLGRQFVDS
LPRTGRNGQA VIEPGSWEAQ MILRYAREGR ARELVYVGGM RADKKRISVL EAMLKERAEL
ASVLGKNNWA EVVLVDKMTK TPENVMRFLT SLAQHHQPVA RAEVDMLRRM KATALTGNYF
DPRNSRTRHL PLFHAWDRDY YSDKYLTSLI PTGSPPSISP YLSTGTVMSG LSRIFSRLYG
ISFKPAVVSP GEVWHPSVRR LDVVHEEEGL IGVIYCDFFS RIGKSSGAAH YTVRCSRRVD
DDDIDGDGLP EDWDKPYGPG LEADKESLSG KPGKYQLPII ALSMDVGTVN EGRPALLNWQ
ELETLFHEMG HAIHSMIGRT EYHNVSGTRC ATDFVELPSI LMEHFVSSPE VLSTLAFHHA
TGEPLPIPVI EAHLALNQSL SALETHGQIA MALLDQKYHT LRHGQDSFDS TAIWFQLQQE
IGVIQPVPGT AWQMQFGHLY GYGATYYSYL FDRAIAGKIW STLFHRSGTS QAYDRKAEGI
LSREGGELLK EKVLKWGGGR DPWEMVGDVI GGVEGDELSK GDERALALVG SWSVV