Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG01170 |
Symbol | |
ID | 3258834 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 330643 |
End bp | 332691 |
Gene Length | 2049 bp |
Protein Length | 554 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257734 |
Product | hypothetical protein |
Protein accession | XP_571790 |
Protein GI | 58269268 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCGATTGCA TATAAAGTAA GGGTCGGGGT GTCAAGAGGA GAGTCCTCCA TTTGACCAAT TATTTTCTCT TAGTGAGGCT CATTTAGTCC CGCACCGAAC ATCATGTCGA ACAAAGCTCA GACCGCCCTC TTCTTCTCCG TACACCAGTT ATCCAACCAG CCACAGCACG AAGACCCCCA CGCATCTTAT CAACCGTCAA CTACGTCTAC TATAGCTCCA GAAATTGAAG AAAAGGCATT GTCTACTTCG AAGACGCCTT CTTTAAAGGT CGATCATTTT TCAAACGACG CCAAAAATGA GGAGAATGGG GGAGACATTG ACGGAGAGAA TGACTCACTC AGGACAAGCG GAGAAGGCCA ACTTGGAATT ACAGACGAGA AAGAAGACCA AATACCATCT CGCCATGGTG GATTAGGAAT TAAAGTCTCT CAAAGGAAGA AATGGGGACT TTTAGCTCTT TTCAGCCTGT CTCTTGTCAT TGATCGTGAG TCAGATCACA GAGTCTATTC CCGGCAGCCC TCCAACTGAT GGTTTCCATT GCAGAATGGT GTCTGGCTGC TTTCTATATC CTCACTCCCC CTATCACCGA CTCTATGCAA GTTCCCTTCG CCCAACAATC ATGGGTCATC ACCTCCTACA CAGTCACCTT CGCCGCTACT CTCCTGTTCT GGGGCCGAGT CTCCGACCTC TACTCTGCTG CACCTGTCTT CTCTTACGGT ATTGTCACCC TCGGGGTATT GAACTTGATC ATCTCCTTCC TGCCGGAGAG ATATTCTTTC TTCATTTTTC GGGCGCTGTC TGGAATAGCA GGTAGTTCTT CTGTGCCCTC TGCTTATAGG CTTATCATCG CTGTATTTGA GCCTCATGAG CTGAACAAGG CTTTTACTAT CTATGCTATG AGCGGCGCTC TTGCGAATTC TACAGGAAAT ATTATTGCTG GAATCATTAT GTTGATCCCT TCTGGCGGAC AAGGTGAAGC TTGGAGGTGG TTCTTCAGGA TCATATCGGC TATTGTATTG CCCGTGGGAG TGTGGTCAAT ATTTTGGATT CCAAGAAGCA GGGGTGAGAA TTCCGATGTA AACGATAAAT TGGCAAGAAT GGATCTTCCG GGATGCTTCA TGTGAGTCAA ATATTCGACG AATAGTTTAC TGAATGAATA CTTACGGAAA CATATGTGCA GGATGTTAGT GGCGATCGTT CTCTTGATCC TCTCTTTAAC TCTCGGCGCC TCAAACGGCT GGTCAACGCC CGGGTTCATC GCACCTCTCA TCATATCGGC CATTATTTTC CCAGCGTTCT TTGTCTGGGA ATCCCGCATC AAGTCCACCC ACGCACTCCT CCCACCATCA ATATGGCATT ATCATAATTT TACCCTTTGG GTCGTCTTTG CTCTCCTGGG CTATACCTGG TGGTCGGTTA ACTTTTTCGC ACTCATTGAG TATTGGTTGG AATATATGGG TGAAAAGGCG ATCATCGTCT CATTGAGAGT CTTGGCGGAA GGCGTAACTC CAATGGTAGT CACCATCGTC CTCACTAAAT GGGGGCGCTT GATGGAATTC CCCAGGATCT CGATCACATT TGGCGGTCTG CTGGGCATAG CGGCGTATAT CATGTTCATA TTCTCTGGCA CGCATGTTGG GAGAGATTAC TGGCGATACA TGTTTCCAGC CATGCTTTTT GGGGCAGCAG GGATGTGCAT TGTCTTTACC GCTACAAGGT GAGTTTCCAC CCAAAGATCT CAACGCTTCC TACTGATCTG ATTCAAACCA AGTGTTGGTG CGATGTGCGC TGTTCCTGCA AGCATTGGGG GCGTAGCGGG CGCTACTTTG CAAGTGTCTT TTCAAGTAGG AGCTGCTGTA TCCTTTGCGG TGCAAGCTGG ACTGTTTACC ATCAATGAAG GTGGGATATC CAATTTTGAC AATCTCAAAG CTTCATTCTA CTTTGAGTTG GGTTTCATTG CGTTATGGGT GATTGGTTTC TTGGTGTTCT ATAAACCAAA GAATACAGAG GTGTCCGGGG ATACAGAAAG AATTGCGGCT GGTCATTAG
|
Protein sequence | MSNKAQTALF FSVHQLSNQP QHEDPHASYQ PSTTSTIAPE IEEKALSTSK TPSLKVDHFS NDAKNEENGG DIDGENDSLR TSGEGQLGIT DEKEDQIPSR HGGLGIKVSQ RKKWGLLALF SLSLVIDQWC LAAFYILTPP ITDSMQVPFA QQSWVITSYT VTFAATLLFW GRVSDLYSAA PVFSYGIVTL GVLNLIISFL PERYSFFIFR ALSGIAGSSS VPSAYRLIIA VFEPHELNKA FTIYAMSGAL ANSTGNIIAG IIMLIPSGGQ GEAWRWFFRI ISAIVLPVGV WSIFWIPRSR GENSDVNDKL ARMDLPGCFM MLVAIVLLIL SLTLGASNGW STPGFIAPLI ISAIIFPAFF VWESRIKSTH ALLPPSIWHY HNFTLWVVFA LLGYTWWSVN FFALIEYWLE YMGEKAIIVS LRVLAEGVTP MVVTIVLTKW GRLMEFPRIS ITFGGLLGIA AYIMFIFSGT HVGRDYWRYM FPAMLFGAAG MCIVFTATSV GAMCAVPASI GGVAGATLQV SFQVGAAVSF AVQAGLFTIN EEVSGDTERI AAGH
|
| |