Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI02830 |
Symbol | |
ID | 3259562 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 771324 |
End bp | 774371 |
Gene Length | 3048 bp |
Protein Length | 808 aa |
Translation table | |
GC content | 49% |
IMG OID | 638258774 |
Product | nuclear pore complex protein, putative |
Protein accession | XP_572655 |
Protein GI | 58270998 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCATAACAA GAATCACAAT CATTCGGCAA ATTGTATCTG GAATCGAACC ATTTCTCAAC TCCAAATCAT TAGGAAATAG GACACAAGAT GGTAGCCGAG AGAACGCCCT ACTCATCTTT TGCGAGCCTG CTTGCGGCAT ATCAAGAACA ATACTCACAA GCTGGCCCTT CAACGCAGCT TGACTCGCCT ATAAGTCATG AGGCTGTTCT TGACGAGCAA AGCGGATTAA TTATTTCTCT CATGGAATCG CTCGAGGAAA CGTGAGTGGA TAATTATGCT ACTGCACTAT TAAAATACTA AGGTGGTCAC AGCATTCGGT CAAAAACACC GACAAACTGC TCGTCATTCG AGATGGACGG TGAGGAGCAG ATATCAGAAG ACGAATACAA GGCGTTATTG TCAGAACATC GAGCGTGGCA GCTTATTCGT GCGGTGTATG AGTAAGTATT GCTTAGTTTA TTTGATCTCA TGTTCTAATA AATGTAGCAA CCGGATACCC CGCACAGATC CAAACTTTGT TCCTCCATCT GCTGCACAGC AGATCATCGA AAATCCCTAT ACAAGTCCGG AAGATCTCGT GCAGACAATG GTCATTGAAG ACCCAGAATT ATCGCTTTGG GCTGTAAGTT CTTGAAGCAA GCGAAAGGTT AATTACTAAT TATATTAGAC ATTGGTCGAG CATCTCCAAA CTCGCCCTCT ATTAACTTCA CCGCCGCCTC TCGAAGCCAG ACATGGCTAT CTGCCATCCA CTGTCCGCCG TTCTAAATTC CATTCGACCA CAGATTCACC TTCACTCGAT CCCGATTTTA CCGTTCGTGA TCCTCACAGT TCTTCTCTCG CGGGCGAAGA CCAGACTTAC CAACTGCCCC TCCTCGAAAC ACTATACAAC CTCGTCCGAT ATGGAGAGCT TGAGTCAGCG ATCAAAGTAT GTGAACAGGG GGGTGAGCCA TGGAGAGGTG CGAGTTTGAT GGGTGTACGG CGGTGGACGA TGGGGGGCAT GAGTAAAGGA ACTGAGCCAA CCGTGATGAC TGGAAACCGA TACCGTGCCT TGTGGAAAAA GTCTTGCCGT ACGATTGCCA AGAACCACAC CCTCCTGCCT GCCGAGCGAA ATCTCTACGC TGCTATGATC TCGGATCTAC CTACGCTCTT ACCGGCATGT GAGTCGTGGG AAGACTATCT TTGGGCGCAC GTGCAGCATA GGATCGAAGC GAGATTGGAG AAGAGATGGC GTGAGCTTGG AGGATTTTGG GAAGGCGAAG GGGGGGTAGG AAAGGATGAC GTTGAAGAAG TGGAGATGGC CAAAGGGGGA CTTGAAGAGG TTTTTGCAAG TATGAGAAAT CTACAGAATG CCTCAATTTC GTAAGTCTGA ATCGTGCATG CGTTCGCCTG GGTTTCTGAC TGAGAAGGAC AGTATTACCA TGACCGAACC GTACCATGTC GCTCAGCAGA TGATCCTGCT CGATCGAACA GAAGCCCTCT TCCACGGATT TGCGGATCAA CTTCTGGAGC TCGAATCTGG GGTTTCTCCA GAGTGAGTGA ACTCCTATCA TCTGTCCCAC CATCAATTAA AAGCGCATAC GCTGAGGTCC TTCGTGAAAA GACTCATAGC ACCTCTTCTC CGCTTTTTCA CTCATCTTGC TCTCATCCTC CGTACCCTTT CTCAACCCGT TCCTGTTTCA GCGGCCAATG CTATCATTCA AGCCTATCTG CAAATTCTCG AGCGAGAAGG CAACGATAAG CTCGTCGCCA TGTATGCGGC TTGTCTGAGG GAAGGTAGTG GTGAAGAAAG TTATGCGCGT TTCTTATGGT GTATGTCCGA GGGTTTTAAT CCCTCGATCT GGGTTTCTCG TTCCTGTCTA TGATGCTGAC TTAAATACAG CGATGGACCC TTCTGCCGGC AGAGACTCCA GGTCAGAAGC GCTTTTGCGA GCCAAGAAGC ATAACCTCGA TGTGGCGCTC ATCGCTCGCG AAACTGTCCG TCTTTGTCTT GAAGAAGTGG TCGCTGTAGG TTCTTATATC TTTTTCTCTT GGCGCACAAT GATTAACCTA TCACCTTTGT ACAGGACCCT CCCAAGAGGC TTCTCGCTGA ACCCGATATT GTTCCCATTT CTATCGGTTT GACGGAACAT GATGTCGTGC TTATTAGATC TATCGAGTGG TTAATCATCT TGCCGGAAAC GGCGGATGAT GCGCTCGTGC GATCCTGCCA GCTCGTTCGA TACTTCTTGT GTATGTTCAT TCTTGTAACA ACACTTAAGC AAAACTACCT TAGCTGACTA AGAACATGTG TCTAGCCAAG GGTCAAGCTA ATGCCGCGCA ATCTCTTCTT CTCTCCCTTC CTTCCCTTCC CACTTCCTCC TCTTTAACCA ATCAAAGTCA CCTCCTCGAA CTCGCCTCCT ACAATCGGCT CTTCTCTCTC TTCTCGTCCC ACACTTACTT TGCCGACATC CTGTTCCGCC AACCTTCGCT TACTGCTAGC AAACTTGAAG TCCACGCTTG GAAGAAGGAT CTTGAGGCCT GCGTGGAAGA TGTCTGGAAG GGTACTGTGG GGTTAATCAA GGAACGGTGG CTGGATTTGC CTAGCTTGTC TCCCAAGGTT GAAAAGGACG GCAAGGGGTT GTACGAGTAC GAAGGGGAGG AGGAGAGGCA GAAGCAACTC AAGCTTATAC GACATATTTT CATCCCCGAC CTCGTCCTTC GCTTGCACAA CACTCTCATC GAACAATCTG TTTTATTCCC CTACTTTTTG CAGAGGGCCC TTGAACTGGC GAGCATAGTG GCGGATGGGA GATATAGAGT GTACGAGGGC TTTCTGCCAC TGGCGACGAT TGCTGGAGGT GCGGAAATGG TTGGGGGTGG CGAAAAGGGT GTCAGCCGGC TTGAGGTGTA CATGGACAAG ATCAGGGAGG TGGCACTTGA GGTGTTAAAG TCAGGAAATG GGAACGCTTT CAAGGTCAGG AGACTGGCAT GAGAGAAAGG ACAACGTATT TAAGTCGATT CCACATATGA CAATGCCAAG GATATAGACA ATGAAGACAG ATGCAAGA
|
Protein sequence | MVAERTPYSS FASLLAAYQE QYSQAGPSTQ LDSPISHEAV LDEQSGLIIS LMESLEETIR SKTPTNCSSF EMDGEEQISE DEYKALLSEH RAWQLIRAVY DNRIPRTDPN FVPPSAAQQI IENPYTSPED LVQTMVIEDP ELSLWATLVE HLQTRPLLTS PPPLEARHGY LPSTVRRSKF HSTTDSPSLD PDFTVRDPHS SSLAGEDQTY QLPLLETLYN LVRYGELESA IKVCEQGGEP WRGASLMGVR RWTMGGMSKG TEPTVMTGNR YRALWKKSCR TIAKNHTLLP AERNLYAAMI SDLPTLLPAC ESWEDYLWAH VQHRIEARLE KRWRELGGFW EGEGGVGKDD VEEVEMAKGG LEEVFASMRN LQNASISITM TEPYHVAQQM ILLDRTEALF HGFADQLLEL ESGVSPELIA PLLRFFTHLA LILRTLSQPV PVSAANAIIQ AYLQILEREG NDKLVAMYAA CLREGSGEES YARFLWSMDP SAGRDSRSEA LLRAKKHNLD VALIARETVR LCLEEVVADP PKRLLAEPDI VPISIGLTEH DVVLIRSIEW LIILPETADD ALVRSCQLVR YFLSKGQANA AQSLLLSLPS LPTSSSLTNQ SHLLELASYN RLFSLFSSHT YFADILFRQP SLTASKLEVH AWKKDLEACV EDVWKGTVGL IKERWLDLPS LSPKVEKDGK GLYEYEGEEE RQKQLKLIRH IFIPDLVLRL HNTLIEQSVL FPYFLQRALE LASIVADGRY RVYEGFLPLA TIAGGAEMVG GGEKGVSRLE VYMDKIREVA LEVLKSGNGN AFKVRRLA
|
| |