Gene CNG01170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG01170 
Symbol 
ID3258834 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp330643 
End bp332691 
Gene Length2049 bp 
Protein Length554 aa 
Translation table 
GC content47% 
IMG OID638257734 
Producthypothetical protein 
Protein accessionXP_571790 
Protein GI58269268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCGATTGCA TATAAAGTAA GGGTCGGGGT GTCAAGAGGA GAGTCCTCCA TTTGACCAAT 
TATTTTCTCT TAGTGAGGCT CATTTAGTCC CGCACCGAAC ATCATGTCGA ACAAAGCTCA
GACCGCCCTC TTCTTCTCCG TACACCAGTT ATCCAACCAG CCACAGCACG AAGACCCCCA
CGCATCTTAT CAACCGTCAA CTACGTCTAC TATAGCTCCA GAAATTGAAG AAAAGGCATT
GTCTACTTCG AAGACGCCTT CTTTAAAGGT CGATCATTTT TCAAACGACG CCAAAAATGA
GGAGAATGGG GGAGACATTG ACGGAGAGAA TGACTCACTC AGGACAAGCG GAGAAGGCCA
ACTTGGAATT ACAGACGAGA AAGAAGACCA AATACCATCT CGCCATGGTG GATTAGGAAT
TAAAGTCTCT CAAAGGAAGA AATGGGGACT TTTAGCTCTT TTCAGCCTGT CTCTTGTCAT
TGATCGTGAG TCAGATCACA GAGTCTATTC CCGGCAGCCC TCCAACTGAT GGTTTCCATT
GCAGAATGGT GTCTGGCTGC TTTCTATATC CTCACTCCCC CTATCACCGA CTCTATGCAA
GTTCCCTTCG CCCAACAATC ATGGGTCATC ACCTCCTACA CAGTCACCTT CGCCGCTACT
CTCCTGTTCT GGGGCCGAGT CTCCGACCTC TACTCTGCTG CACCTGTCTT CTCTTACGGT
ATTGTCACCC TCGGGGTATT GAACTTGATC ATCTCCTTCC TGCCGGAGAG ATATTCTTTC
TTCATTTTTC GGGCGCTGTC TGGAATAGCA GGTAGTTCTT CTGTGCCCTC TGCTTATAGG
CTTATCATCG CTGTATTTGA GCCTCATGAG CTGAACAAGG CTTTTACTAT CTATGCTATG
AGCGGCGCTC TTGCGAATTC TACAGGAAAT ATTATTGCTG GAATCATTAT GTTGATCCCT
TCTGGCGGAC AAGGTGAAGC TTGGAGGTGG TTCTTCAGGA TCATATCGGC TATTGTATTG
CCCGTGGGAG TGTGGTCAAT ATTTTGGATT CCAAGAAGCA GGGGTGAGAA TTCCGATGTA
AACGATAAAT TGGCAAGAAT GGATCTTCCG GGATGCTTCA TGTGAGTCAA ATATTCGACG
AATAGTTTAC TGAATGAATA CTTACGGAAA CATATGTGCA GGATGTTAGT GGCGATCGTT
CTCTTGATCC TCTCTTTAAC TCTCGGCGCC TCAAACGGCT GGTCAACGCC CGGGTTCATC
GCACCTCTCA TCATATCGGC CATTATTTTC CCAGCGTTCT TTGTCTGGGA ATCCCGCATC
AAGTCCACCC ACGCACTCCT CCCACCATCA ATATGGCATT ATCATAATTT TACCCTTTGG
GTCGTCTTTG CTCTCCTGGG CTATACCTGG TGGTCGGTTA ACTTTTTCGC ACTCATTGAG
TATTGGTTGG AATATATGGG TGAAAAGGCG ATCATCGTCT CATTGAGAGT CTTGGCGGAA
GGCGTAACTC CAATGGTAGT CACCATCGTC CTCACTAAAT GGGGGCGCTT GATGGAATTC
CCCAGGATCT CGATCACATT TGGCGGTCTG CTGGGCATAG CGGCGTATAT CATGTTCATA
TTCTCTGGCA CGCATGTTGG GAGAGATTAC TGGCGATACA TGTTTCCAGC CATGCTTTTT
GGGGCAGCAG GGATGTGCAT TGTCTTTACC GCTACAAGGT GAGTTTCCAC CCAAAGATCT
CAACGCTTCC TACTGATCTG ATTCAAACCA AGTGTTGGTG CGATGTGCGC TGTTCCTGCA
AGCATTGGGG GCGTAGCGGG CGCTACTTTG CAAGTGTCTT TTCAAGTAGG AGCTGCTGTA
TCCTTTGCGG TGCAAGCTGG ACTGTTTACC ATCAATGAAG GTGGGATATC CAATTTTGAC
AATCTCAAAG CTTCATTCTA CTTTGAGTTG GGTTTCATTG CGTTATGGGT GATTGGTTTC
TTGGTGTTCT ATAAACCAAA GAATACAGAG GTGTCCGGGG ATACAGAAAG AATTGCGGCT
GGTCATTAG
 
Protein sequence
MSNKAQTALF FSVHQLSNQP QHEDPHASYQ PSTTSTIAPE IEEKALSTSK TPSLKVDHFS 
NDAKNEENGG DIDGENDSLR TSGEGQLGIT DEKEDQIPSR HGGLGIKVSQ RKKWGLLALF
SLSLVIDQWC LAAFYILTPP ITDSMQVPFA QQSWVITSYT VTFAATLLFW GRVSDLYSAA
PVFSYGIVTL GVLNLIISFL PERYSFFIFR ALSGIAGSSS VPSAYRLIIA VFEPHELNKA
FTIYAMSGAL ANSTGNIIAG IIMLIPSGGQ GEAWRWFFRI ISAIVLPVGV WSIFWIPRSR
GENSDVNDKL ARMDLPGCFM MLVAIVLLIL SLTLGASNGW STPGFIAPLI ISAIIFPAFF
VWESRIKSTH ALLPPSIWHY HNFTLWVVFA LLGYTWWSVN FFALIEYWLE YMGEKAIIVS
LRVLAEGVTP MVVTIVLTKW GRLMEFPRIS ITFGGLLGIA AYIMFIFSGT HVGRDYWRYM
FPAMLFGAAG MCIVFTATSV GAMCAVPASI GGVAGATLQV SFQVGAAVSF AVQAGLFTIN
EEVSGDTERI AAGH