Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE04840 |
Symbol | |
ID | 3257959 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 1352332 |
End bp | 1354371 |
Gene Length | 2040 bp |
Protein Length | 466 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257068 |
Product | conserved hypothetical protein |
Protein accession | XP_571003 |
Protein GI | 58267694 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGACAACA ACAATCGTCG CTCTTCAAAA TGCCATATCA CATCGGGTCT CCGGGGGCTT TGAGAGATAA ACCTAAAATA TAGCGAGAAC GACGACATAA CCCCTTGCGT AGTACAGCAT GATATCGTCA CGCATGCGTT TCCCCAGCGA AAGCGATCAG GGCTCAAGCG ATTGATCTTA CGGCTACCCC CCGGTTTCTT CAATTCTCCC AGAAGATCCT GTCCCAAGAT GGCGGAGCCG CCTTATACTA CCTTCACCAA ATGGCAAAAG CTGTCGCTCG TCGTCGTTGC ATCTCTCTCA GCCTCCTTCA GTGGCTTTGC ATCTAATATA TATTTCCCTG CTATTCCAGT CATGGCTACC TCCCTTGGCA CCTCTATTGA AAACATTAAT TTGACGGTGA CGACGTATAT GATCTTCCAA GCTATCACTC CGACATTTTG GGGGTAAGAC GATTGCAATT CAAACGGAAC CTGCAGTGAC ATGCTAACGG GTCTTAGTGC CATTTCTGAC GCTTATGGCA GACGACTGGT CCTCATATCC ACACTTACGG TATTCTTCTC TGCTTGCATT GGCCTTGCGT TAGTCAAGCA CTATTACCAG CTTGTCATCC TGCGGTGTTT GCAAAGTACC GGAAGCGCAA GTACCATTGC AATCGGCTCA GGCATCATTG GAGATGTGGC AGACAGGAAA GAGCGGGGAA GCTATATGGG CTTCTTTCAA ACAGGTCTTC TACTTCCATT AGGTGAATAC AAGCGTCTGA GTCGCAATAG CTGCTTGGAA CTGATGCTAA CCATGAGCAG CTATTGGACC TGTTTTAGGC GGTATTTTTG CTCAAACATT AGGCTGGTGG GCCATCTTTT GGTTTTTTGT CATTTACGCA GGGATATTCC TTCTAATTCT GGCACTATTT CTGCCGGAAA CTTTGAGGCG TATAGTAGGC AACGGTGCTA TCTACCCTCC TGCTCGATCA CGAACGCCTT TGGAGCATTT CCTAGCTTCG AGGGACAAAA CCCTACCCCC TATAACTTCA GCAACACCCA TACGACCGGA CTGGATAGCC CCTTTGCGCA TTCTCTTTGT ACCTGACGTT TTTTTAACGC TTTCCTTTCT CTCTCTACAT TACGCAACAT GGCAGATGGC GATTACAGCC CAATCTTCTC TGTTCAAGAG CATTTACAAC CTGAACGAAA TTGAGATTGG TCTTACTTTT ATCGCCAACG GCTTTGGCTG TATGCTCGGT ACTCTTTCAA TTGGTCGGTT CCTCGACTAT GACTACCAGC ATTTCAAGAA AAAGTTTTCC GGACCTACAT CAGACTTCCC CATTGAGCAA GCCCGGCTTC GTACTGTCTG GTTCTGGTCT CCGTTCCAGT GCGCCGCAGT CCTATGGTTT GGCTGGACGT TGGATCAAAA GGTACACATG GCGTCCCCTA TTGTCGCATC TTTTGTCCTA GCATGGGCAG CGATGTCCAT CCAAGCCGTT ATTAGTACCT TTATCGTCGA CATATTCCCC AAATCAAGCG CATCTGCTAC AGCCGCTCTC AACCTCGCCA GATGTCTGAT GGGTGCCGGT GCAACAGCTT CGGTAGAACC GTCGATCAAT ACTTTGGGTG TTGGTTTCAC ATTTACACTA TGGGCATGTC TAATGGCATT GTCGTTGGCG TTTGTTGGAG TACAAATGCG CTTTGGCCCA GCATGGCGGA AGAGAAGAGA AAAAAGACTT GAAGAAGGGG AGAAGGGCTA GGTATCCCGG CCGTCATTAC ATAGAATCAA CGGTCAGAAC GGTATGTGCA AAAAAGGTTC AGGCATTAGG TAATCAACCT ATTAAACCTT TTGCCAGACC AACAACGATG CGCTTGATCG CAAGCTGTAT ACCATCTCAG CAGCTCAATC GGGCACGACC GCGAGCGAAG TGAAGCTGGC ATGATTTAAC TGATCGTATC ATAGGGTCCT CATGATGCCC AAGGAGGACA TTGCAGCTGT CAGAGGTTGT CGCTCAGGGA TTAGAAGGAG GGGATGTTTG AAGATTTCCA GGACATACCG
|
Protein sequence | MAEPPYTTFT KWQKLSLVVV ASLSASFSGF ASNIYFPAIP VMATSLGTSI ENINLTVTTY MIFQAITPTF WGAISDAYGR RLVLISTLTV FFSACIGLAL VKHYYQLVIL RCLQSTGSAS TIAIGSGIIG DVADRKERGS YMGFFQTGLL LPLAIGPVLG GIFAQTLGWW AIFWFFVIYA GIFLLILALF LPETLRRIVG NGAIYPPARS RTPLEHFLAS RDKTLPPITS ATPIRPDWIA PLRILFVPDV FLTLSFLSLH YATWQMAITA QSSLFKSIYN LNEIEIGLTF IANGFGCMLG TLSIGRFLDY DYQHFKKKFS GPTSDFPIEQ ARLRTVWFWS PFQCAAVLWF GWTLDQKVHM ASPIVASFVL AWAAMSIQAV ISTFIVDIFP KSSASATAAL NLARCLMGAG ATASVEPSIN TLGVGFTFTL WACLMALSLA FVGVQMRFGP AWRKRREKRL EEGEKG
|
| |