Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00540 |
Symbol | |
ID | 3254488 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 178627 |
End bp | 180642 |
Gene Length | 2016 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 46% |
IMG OID | 638253542 |
Product | hypothetical protein |
Protein accession | XP_567617 |
Protein GI | 58260414 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCATTC CAGCAGAGGC ACAAGTAGCT TCTCCACCCA TTTTCACTGA AGACGAGGAA GTCGTCTTGG AAGAGGAGGA TGTCGAACAA CCTGATCTGC GTCGAATCGC CACACATCTA CATGACCCAG CCTCCACAGC TACGTTGAGC GATGATCAAG CAGGCACAAC TGCCGGACAG ACTGTCTTAT CGCATGATCT GGAGAAGGGG GAGGGTCGTA TGGTAGTGGA TTTCGCAGAA GGGCATTATG AAGACCCCAA GGAATGGTCG AAAGGAAAGA AATGGTAAGT TTGACACTTC ATCTCCTATG CAAGTTGCTG ATTGCAAGTA GGTTTGTCAC CATTGCAACC TCTATACTTT GTCTCACAGT CGCTCTTGGT TCCGCTATGC CCACTGGTGA TTTACCTGGA GCTGCTGAAA CTCTCCACGT TTCCAATGAA GCTATCTACC TTACTATTGC CCTTTTCGTC GTTGGTTTCG GTGTCGGTCC TCTTCTGTTC GCTCCATGTA AGTTCCTTCT TGACAGGTAA CGGATAGTAG CATTGACCTT TCACCAGTAT CTGAAGTCAT TGGACGGAAG ACAGTCTACT GCATCAGTAT TTTCTTTTAT TTCATCTTCA CCCTCCCGTC ATGTCTCGCG CCCAATATCG CCACAATGTT GGCTGGTCGT ATGGTGAGTC ATCGACCTGC AATCCCTCCA GGCCTTGCTG ATGAAGATTT TTAGATCGCC GGTATCGCCT CTTCGGCTCC CATGACCAAT GTGGGAGGTA CCATTGCTGA TATCTGGTCG GTTGAGGAAC GTGGTATTCC TATGGCTCTT TTCAGTGGTA TGATTTTGTG AGTTAAACGA AGCGGTCGAG AGAGACCCCC GCTGATGCCT TTTTTAGCAT GGGACCTTGT CTTGGACCAT TGTTTGGTGG TTGGATCGCT TACAAGACCG GACAATGGCG ATGGATTTAC TGGGTTTTGT TCATTTTTGT CGGAGTCGTC TTCCTCTTCA CGCTCGTTAT GCCTGAAACT CTCGCCCCTG TCCTCCTACG ACGGAAAGCC AAGAAACTAA ACAAGGAGAA CCACGTTGAC TCCTATGTTT CGAAACATGA TCTCCACCAC GTTCCCCTTT CCACCACTCT GAAAACTGCC ATGATTCGAC CATTCATTCT CATGTTCATG GAACCCATTA TCTTGTTCAT GAGTTTTTAC TTATCTTTCG TCTACGCTCT GCTCTATGCC ACTTTCTTCG CCTTCCCAAT TGCTTTCGAA GAAATTAGAG GGTGGAATAT GGGTACCACT GGCGTTAGTT TCGTATCTAT CATCGTAAGT TGTCTTTTTC CTGTCATTGT AACATTTGTG CTGATTTCAG CCAGATCGGT ATTGCAGCTG CCTTGCTCTG TATGCCCTTT CAAGAAAGAA TCTACAAAAA GGCTTGTCGA AATGGTCAAG TCCCTGAAGC GAGATTGTAC CCCATGTTAC TTGGTTGTGT GTAAGTACTC CTGTATGGGT GCTGTTCAAT CTCTAATGTA GCCATAGCAT CCTCCCAATT GCTCTTTTCA TCTTAGCTTT CACATCGTAC CCTGGAATCC ACTGGATTGG ACCTTGTGTC GCTGGTGTGC TTTTCGGATT TTCAATGGTT ATCATTTATA TCTCTGCCAA CAGTGTGAGT TTCCATTATT GTATGTCTTT TTTTTCCACA CTTTTACTAA CGTCAAACAG TATATTGTTG ATTCCTATGC TTCTTTCGCT GCGTCAGCCA TTGCTGCCAA GACACTGATG AGGTCTCTCA TCGGAGCCTC AGTTCCTCTT TGGATCACTC AGTTATTTGT GAGTTGTTAA TGACATTTCA CAGGCTGGAA TCTAACTAAC TCTTTCTTCA ATCAGCACAA CCTTGGGTTC CAATATGCTG GTCTCTTTTT AGCACTCATA TCTTGTGTTA TTATTCCCAT TCCTTGGGTC TTCTTCCTCA AGGGTGCAGC TGTCAGAAAG CGATCAAAGA GAGCCGAGAA GTCTGGTACC AATTAA
|
Protein sequence | MIIPAEAQVA SPPIFTEDEE VVLEEEDVEQ PDLRRIATHL HDPASTATLS DDQAGTTAGQ TVLSHDLEKG EGRMVVDFAE GHYEDPKEWS KGKKWFVTIA TSILCLTVAL GSAMPTGDLP GAAETLHVSN EAIYLTIALF VVGFGVGPLL FAPLSEVIGR KTVYCISIFF YFIFTLPSCL APNIATMLAG RMIAGIASSA PMTNVGGTIA DIWSVEERGI PMALFSGMIF MGPCLGPLFG GWIAYKTGQW RWIYWVLFIF VGVVFLFTLV MPETLAPVLL RRKAKKLNKE NHVDSYVSKH DLHHVPLSTT LKTAMIRPFI LMFMEPIILF MSFYLSFVYA LLYATFFAFP IAFEEIRGWN MGTTGVSFVS IIIGIAAALL CMPFQERIYK KACRNGQVPE ARLYPMLLGC VILPIALFIL AFTSYPGIHW IGPCVAGVLF GFSMVIIYIS ANSYIVDSYA SFAASAIAAK TLMRSLIGAS VPLWITQLFH NLGFQYAGLF LALISCVIIP IPWVFFLKGA AVRKRSKRAE KSGTN
|
| |