Gene CNC06390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC06390 
Symbol 
ID3256377 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1855736 
End bp1857709 
Gene Length1974 bp 
Protein Length474 aa 
Translation table 
GC content47% 
IMG OID638255858 
Productconserved hypothetical protein 
Protein accessionXP_569895 
Protein GI58265478 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGAGTATTC TTTTCCCTCA GCCACTTAGC CACAAGTCCT TTTTTCCTAT CTCCATCTAA 
AAATATAGCT TATTGCTTGT ACTTGGGACC AGACTACCAC GATGGTACAG CCAAAATCGA
TTGTTACCAC CGTGTTCATC GCTCTCGTCC TTGATCTCCT AGGTCCGTCA ACTTGAGGTC
GAGTTCTTCA ATAATTCGGT TCACTAATAA CGCGTCTATT TATAGCATTC ACTATCCCGC
TGCCCCTCTT CCCGCGCCTG ATAGAATGGT ACCTCTCAAA AGACACCTCT CCGGACTCGC
TGATATCCCG CTCGCTTGCG TTTTGCAACT CGATTAGAGC GCCTCTCCAT GCATTACGGC
CGACTTCCGC TGCCTTGTCC AGTAAGGATG ATGCGGGGAC CAAAAACTGG GATGTGGTTC
TTCTCGGAGG TATGATGGGA AGCCTTTTCA GTTTCTGCCA ATGTATCATT AGTCCTTGGC
TGGGTCGTTG TAAGTTCCAT CAGCAGACCG CAACTCAACT ACTGACTACT CACCATGTTT
GTAGTGGCTG ACAAGTATGG AAGAAAGAAA GTTCTCATAG CCACTATGGT CGGAAATGTT
GTCTCTGCCA GCATCTGGAT CAAATCTACC TCTTTTGTGG GTACAAATGA CACTCCATTC
TCAAGCCCGA AAAGCGCATT CTGACTGATT GGCAGGAGTC ATATCTCTTG TCTCGTCTCG
TAGGTGGGTT GAGCGAAGGC AATGTACAGC TGAGCACGTA AATACGGGTT TAGAAGCCAC
CACTTCAAAA AAGCTGATAT AATCACAGGG CAATCATCAG TGATGTGACT ACATCTGCCA
CCAGGTCCAA ATCTCTCGCC CTTGTTGGTA TTGCCTTTTC CATCTGTTTC ACTTTTGGGT
AAGTTTCATA CGAGTTAGTG ACGCATTACA ACTATTAATA TCGGGCCAGA CCATCTCTAG
GTGCCTACTT TGCAACTCGA CCTCTCCCGC TTGGGACCTC TGATGATAAA TTCAACGTTT
ATGCGATGCC AGCCGCTATC TCCTTAGGAC TTCTACTTCT TGAAACCTTG TTCTTAGCCG
CCAAGTTGCC AGAGACGAAA GGATATAAAG TCGAAGAGGT ATCTAATGCA AATCCCGAGC
AACCATCCGT GCCCGAGCCC AAGGACATCT TTGAAGACAA AGAGCAGAAG GTGCAACGAT
TGAAAGATAT GACCGGTCTC CATGGCTATT TCCTCTTGTT CTTCTCTGGA GTGAGTGCTT
TCACCTCTCC AACTTTAATG ATCTTGCTGA ACTAATTCTA GGCAGAATTC ACATTGACTT
TCCTGACCTA TGACATCTTC TCTGCGTCCA ATGCATACAA CGGCAAACTG CTCAGTTGTA
AGTCGCTGTC ACCACTATCC TTTGTATTCT CCTTCTGAAC AATATGCAGA CATCGGTATC
CTGGCAACCA TGATTCAAGC TCGTCACGTG CGTCCATCCA TGGCCAAAAC TGGTGAAATC
CAAGTAGCTC TAGATGGTAT CGCCAGCTGC ATTCTCGGCG TCTTTCTTCT CCATCTAATC
CCATACACTG TCTCCCTCGG CACTATCAAT CACATTCTCC TCTATGTCGC TGCTACATGC
CTAGCTTACA CCAGCGCAAC GACCGTCACT GGGTTAACAG CTGCCGCTGC TGGATGCTGT
GATGAGCGAT ACCCTGAGCT GCAGAGGGGG AGGGCTTTAG GCAAGTTCAG GTCGAGAGGA
CAGTTAGGTA GGGCAGTGGG GCCGTTGTTG GCGTCAATGT TGTATTGGAT GGAAGGGCCT
TCGGTGGCAT ATTTGACATT GGCTATGTGC TTGGGTGGTG TATTAATTTT GGCTCCCCGA
GGCGGTGTGC AAAGGTATAG ATGGTGGGTC AAAGAGACAA AAGAATAGAA GGACAGGAAA
AGCACGACGG CGACCAAGTT TGTTTAAAGA CATGTACATT ACGACATATC ATGA
 
Protein sequence
MVQPKSIVTT VFIALVLDLL AFTIPLPLFP RLIEWYLSKD TSPDSLISRS LAFCNSIRAP 
LHALRPTSAA LSSKDDAGTK NWDVVLLGGM MGSLFSFCQC IISPWLGRLA DKYGRKKVLI
ATMVGNVVSA SIWIKSTSFE SYLLSRLVGG LSEGNVQLST AIISDVTTSA TRSKSLALVG
IAFSICFTFG PSLGAYFATR PLPLGTSDDK FNVYAMPAAI SLGLLLLETL FLAAKLPETK
GYKVEEVSNA NPEQPSVPEP KDIFEDKEQK VQRLKDMTGL HGYFLLFFSG AEFTLTFLTY
DIFSASNAYN GKLLSYIGIL ATMIQARHVR PSMAKTGEIQ VALDGIASCI LGVFLLHLIP
YTVSLGTINH ILLYVAATCL AYTSATTVTG LTAAAAGCCD ERYPELQRGR ALGKFRSRGQ
LGRAVGPLLA SMLYWMEGPS VAYLTLAMCL GGVLILAPRG GVQRYRWWVK ETKE