Gene CNE04840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE04840 
Symbol 
ID3257959 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1352332 
End bp1354371 
Gene Length2040 bp 
Protein Length466 aa 
Translation table 
GC content48% 
IMG OID638257068 
Productconserved hypothetical protein 
Protein accessionXP_571003 
Protein GI58267694 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATGACAACA ACAATCGTCG CTCTTCAAAA TGCCATATCA CATCGGGTCT CCGGGGGCTT 
TGAGAGATAA ACCTAAAATA TAGCGAGAAC GACGACATAA CCCCTTGCGT AGTACAGCAT
GATATCGTCA CGCATGCGTT TCCCCAGCGA AAGCGATCAG GGCTCAAGCG ATTGATCTTA
CGGCTACCCC CCGGTTTCTT CAATTCTCCC AGAAGATCCT GTCCCAAGAT GGCGGAGCCG
CCTTATACTA CCTTCACCAA ATGGCAAAAG CTGTCGCTCG TCGTCGTTGC ATCTCTCTCA
GCCTCCTTCA GTGGCTTTGC ATCTAATATA TATTTCCCTG CTATTCCAGT CATGGCTACC
TCCCTTGGCA CCTCTATTGA AAACATTAAT TTGACGGTGA CGACGTATAT GATCTTCCAA
GCTATCACTC CGACATTTTG GGGGTAAGAC GATTGCAATT CAAACGGAAC CTGCAGTGAC
ATGCTAACGG GTCTTAGTGC CATTTCTGAC GCTTATGGCA GACGACTGGT CCTCATATCC
ACACTTACGG TATTCTTCTC TGCTTGCATT GGCCTTGCGT TAGTCAAGCA CTATTACCAG
CTTGTCATCC TGCGGTGTTT GCAAAGTACC GGAAGCGCAA GTACCATTGC AATCGGCTCA
GGCATCATTG GAGATGTGGC AGACAGGAAA GAGCGGGGAA GCTATATGGG CTTCTTTCAA
ACAGGTCTTC TACTTCCATT AGGTGAATAC AAGCGTCTGA GTCGCAATAG CTGCTTGGAA
CTGATGCTAA CCATGAGCAG CTATTGGACC TGTTTTAGGC GGTATTTTTG CTCAAACATT
AGGCTGGTGG GCCATCTTTT GGTTTTTTGT CATTTACGCA GGGATATTCC TTCTAATTCT
GGCACTATTT CTGCCGGAAA CTTTGAGGCG TATAGTAGGC AACGGTGCTA TCTACCCTCC
TGCTCGATCA CGAACGCCTT TGGAGCATTT CCTAGCTTCG AGGGACAAAA CCCTACCCCC
TATAACTTCA GCAACACCCA TACGACCGGA CTGGATAGCC CCTTTGCGCA TTCTCTTTGT
ACCTGACGTT TTTTTAACGC TTTCCTTTCT CTCTCTACAT TACGCAACAT GGCAGATGGC
GATTACAGCC CAATCTTCTC TGTTCAAGAG CATTTACAAC CTGAACGAAA TTGAGATTGG
TCTTACTTTT ATCGCCAACG GCTTTGGCTG TATGCTCGGT ACTCTTTCAA TTGGTCGGTT
CCTCGACTAT GACTACCAGC ATTTCAAGAA AAAGTTTTCC GGACCTACAT CAGACTTCCC
CATTGAGCAA GCCCGGCTTC GTACTGTCTG GTTCTGGTCT CCGTTCCAGT GCGCCGCAGT
CCTATGGTTT GGCTGGACGT TGGATCAAAA GGTACACATG GCGTCCCCTA TTGTCGCATC
TTTTGTCCTA GCATGGGCAG CGATGTCCAT CCAAGCCGTT ATTAGTACCT TTATCGTCGA
CATATTCCCC AAATCAAGCG CATCTGCTAC AGCCGCTCTC AACCTCGCCA GATGTCTGAT
GGGTGCCGGT GCAACAGCTT CGGTAGAACC GTCGATCAAT ACTTTGGGTG TTGGTTTCAC
ATTTACACTA TGGGCATGTC TAATGGCATT GTCGTTGGCG TTTGTTGGAG TACAAATGCG
CTTTGGCCCA GCATGGCGGA AGAGAAGAGA AAAAAGACTT GAAGAAGGGG AGAAGGGCTA
GGTATCCCGG CCGTCATTAC ATAGAATCAA CGGTCAGAAC GGTATGTGCA AAAAAGGTTC
AGGCATTAGG TAATCAACCT ATTAAACCTT TTGCCAGACC AACAACGATG CGCTTGATCG
CAAGCTGTAT ACCATCTCAG CAGCTCAATC GGGCACGACC GCGAGCGAAG TGAAGCTGGC
ATGATTTAAC TGATCGTATC ATAGGGTCCT CATGATGCCC AAGGAGGACA TTGCAGCTGT
CAGAGGTTGT CGCTCAGGGA TTAGAAGGAG GGGATGTTTG AAGATTTCCA GGACATACCG
 
Protein sequence
MAEPPYTTFT KWQKLSLVVV ASLSASFSGF ASNIYFPAIP VMATSLGTSI ENINLTVTTY 
MIFQAITPTF WGAISDAYGR RLVLISTLTV FFSACIGLAL VKHYYQLVIL RCLQSTGSAS
TIAIGSGIIG DVADRKERGS YMGFFQTGLL LPLAIGPVLG GIFAQTLGWW AIFWFFVIYA
GIFLLILALF LPETLRRIVG NGAIYPPARS RTPLEHFLAS RDKTLPPITS ATPIRPDWIA
PLRILFVPDV FLTLSFLSLH YATWQMAITA QSSLFKSIYN LNEIEIGLTF IANGFGCMLG
TLSIGRFLDY DYQHFKKKFS GPTSDFPIEQ ARLRTVWFWS PFQCAAVLWF GWTLDQKVHM
ASPIVASFVL AWAAMSIQAV ISTFIVDIFP KSSASATAAL NLARCLMGAG ATASVEPSIN
TLGVGFTFTL WACLMALSLA FVGVQMRFGP AWRKRREKRL EEGEKG