Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC06390 |
Symbol | |
ID | 3256377 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1855736 |
End bp | 1857709 |
Gene Length | 1974 bp |
Protein Length | 474 aa |
Translation table | |
GC content | 47% |
IMG OID | 638255858 |
Product | conserved hypothetical protein |
Protein accession | XP_569895 |
Protein GI | 58265478 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGAGTATTC TTTTCCCTCA GCCACTTAGC CACAAGTCCT TTTTTCCTAT CTCCATCTAA AAATATAGCT TATTGCTTGT ACTTGGGACC AGACTACCAC GATGGTACAG CCAAAATCGA TTGTTACCAC CGTGTTCATC GCTCTCGTCC TTGATCTCCT AGGTCCGTCA ACTTGAGGTC GAGTTCTTCA ATAATTCGGT TCACTAATAA CGCGTCTATT TATAGCATTC ACTATCCCGC TGCCCCTCTT CCCGCGCCTG ATAGAATGGT ACCTCTCAAA AGACACCTCT CCGGACTCGC TGATATCCCG CTCGCTTGCG TTTTGCAACT CGATTAGAGC GCCTCTCCAT GCATTACGGC CGACTTCCGC TGCCTTGTCC AGTAAGGATG ATGCGGGGAC CAAAAACTGG GATGTGGTTC TTCTCGGAGG TATGATGGGA AGCCTTTTCA GTTTCTGCCA ATGTATCATT AGTCCTTGGC TGGGTCGTTG TAAGTTCCAT CAGCAGACCG CAACTCAACT ACTGACTACT CACCATGTTT GTAGTGGCTG ACAAGTATGG AAGAAAGAAA GTTCTCATAG CCACTATGGT CGGAAATGTT GTCTCTGCCA GCATCTGGAT CAAATCTACC TCTTTTGTGG GTACAAATGA CACTCCATTC TCAAGCCCGA AAAGCGCATT CTGACTGATT GGCAGGAGTC ATATCTCTTG TCTCGTCTCG TAGGTGGGTT GAGCGAAGGC AATGTACAGC TGAGCACGTA AATACGGGTT TAGAAGCCAC CACTTCAAAA AAGCTGATAT AATCACAGGG CAATCATCAG TGATGTGACT ACATCTGCCA CCAGGTCCAA ATCTCTCGCC CTTGTTGGTA TTGCCTTTTC CATCTGTTTC ACTTTTGGGT AAGTTTCATA CGAGTTAGTG ACGCATTACA ACTATTAATA TCGGGCCAGA CCATCTCTAG GTGCCTACTT TGCAACTCGA CCTCTCCCGC TTGGGACCTC TGATGATAAA TTCAACGTTT ATGCGATGCC AGCCGCTATC TCCTTAGGAC TTCTACTTCT TGAAACCTTG TTCTTAGCCG CCAAGTTGCC AGAGACGAAA GGATATAAAG TCGAAGAGGT ATCTAATGCA AATCCCGAGC AACCATCCGT GCCCGAGCCC AAGGACATCT TTGAAGACAA AGAGCAGAAG GTGCAACGAT TGAAAGATAT GACCGGTCTC CATGGCTATT TCCTCTTGTT CTTCTCTGGA GTGAGTGCTT TCACCTCTCC AACTTTAATG ATCTTGCTGA ACTAATTCTA GGCAGAATTC ACATTGACTT TCCTGACCTA TGACATCTTC TCTGCGTCCA ATGCATACAA CGGCAAACTG CTCAGTTGTA AGTCGCTGTC ACCACTATCC TTTGTATTCT CCTTCTGAAC AATATGCAGA CATCGGTATC CTGGCAACCA TGATTCAAGC TCGTCACGTG CGTCCATCCA TGGCCAAAAC TGGTGAAATC CAAGTAGCTC TAGATGGTAT CGCCAGCTGC ATTCTCGGCG TCTTTCTTCT CCATCTAATC CCATACACTG TCTCCCTCGG CACTATCAAT CACATTCTCC TCTATGTCGC TGCTACATGC CTAGCTTACA CCAGCGCAAC GACCGTCACT GGGTTAACAG CTGCCGCTGC TGGATGCTGT GATGAGCGAT ACCCTGAGCT GCAGAGGGGG AGGGCTTTAG GCAAGTTCAG GTCGAGAGGA CAGTTAGGTA GGGCAGTGGG GCCGTTGTTG GCGTCAATGT TGTATTGGAT GGAAGGGCCT TCGGTGGCAT ATTTGACATT GGCTATGTGC TTGGGTGGTG TATTAATTTT GGCTCCCCGA GGCGGTGTGC AAAGGTATAG ATGGTGGGTC AAAGAGACAA AAGAATAGAA GGACAGGAAA AGCACGACGG CGACCAAGTT TGTTTAAAGA CATGTACATT ACGACATATC ATGA
|
Protein sequence | MVQPKSIVTT VFIALVLDLL AFTIPLPLFP RLIEWYLSKD TSPDSLISRS LAFCNSIRAP LHALRPTSAA LSSKDDAGTK NWDVVLLGGM MGSLFSFCQC IISPWLGRLA DKYGRKKVLI ATMVGNVVSA SIWIKSTSFE SYLLSRLVGG LSEGNVQLST AIISDVTTSA TRSKSLALVG IAFSICFTFG PSLGAYFATR PLPLGTSDDK FNVYAMPAAI SLGLLLLETL FLAAKLPETK GYKVEEVSNA NPEQPSVPEP KDIFEDKEQK VQRLKDMTGL HGYFLLFFSG AEFTLTFLTY DIFSASNAYN GKLLSYIGIL ATMIQARHVR PSMAKTGEIQ VALDGIASCI LGVFLLHLIP YTVSLGTINH ILLYVAATCL AYTSATTVTG LTAAAAGCCD ERYPELQRGR ALGKFRSRGQ LGRAVGPLLA SMLYWMEGPS VAYLTLAMCL GGVLILAPRG GVQRYRWWVK ETKE
|
| |