Gene CNM00630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM00630 
Symbol 
ID3255127 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp194567 
End bp196533 
Gene Length1967 bp 
Protein Length546 aa 
Translation table 
GC content49% 
IMG OID638254215 
Producttrehalose transport-related protein, putative 
Protein accessionXP_568347 
Protein GI58261874 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTCAC ACAGCGACAA GACAACCCGT ATTGCCGGCA AACATGCCGA GCAGCTTTTC 
TATGCTGATC CAACGCTCGT GCAGAGTGCA CTCTCGGCGG CAGCTGCAGA GAAAGAACTG
GGCTTCAAAA AGACCTTCCG ACTTTACTAC AAAGCTGCTT TGTGGTCCAT GGCTCTTTCC
CTTGCTCTTG TAATGGAGGG CTATGATGTC GGCATCGTGA GTTATGATCG CTTGACCATT
GTATACTGTA CTGACAGCCT CACTCAGATC AACTCGTTCT GGGGTCAGGC TTCCTTCTTG
AACAAGTTCG GCAGTACCGC TGCGGATGGA ACGAAGTATA TCCCTGCCAA TTGGCAAGCT
GCGTTGAACA ACGCGACCTC GATCGGACAA ATGATCGGTC TAGCTATCAA CGGATGGGCA
CAACCCAAGT TTGGTTCCAA AAAAGTTTAC TTGACTGCCA TGGCCGTGAT GGTAGGGACT
ATCTTCCTTC CAGTGTTTTC GACTTCGTTG CCCATGCTTT TTGGAGGTGA AATCCTGTGT
GGTATCCCAT GGGGTATCTT CCGTAAGTAT ATACACTGAC ATGGGGTATT ATGTGCTCAT
CTAACCATCC ATAGAAACCC TCTCCACCGC TTACGCTGCC GAAATCTGTC CTCTTGCCAT
GCGAGGTTAC CTCACCGCAT TTGTCAACAT GTGCTGGGGC TTTGGACTCT TGCTCTCTGC
TGGGGTGGTT CGGGCCTCTC TCGAGCTAGA CAGTCAATGG GGCTGGAGAA TCCCTTTCAT
GATTCAATGG GTGTGGCCAG TACCATTGTT CGTCATTGCG TGTTTCGCTC CTGAAAGTGA
GTGCATTGCC ACCCCTGGAT CCATACGATC GCTCACAGGT TTCATGGAAC TGCAGGTCCT
TGGTACCTTG TCAAAGTCGG CCGCGAAGAC GACGCCCGAG CGACTACTAA GCGACTTGCA
CCATCTGAAT ACCTCACCGA TCAGCTCGTT GATCAACAAA TCGCCCTCAT GAAACATACC
ATCGAAATGG AGAAAGCAGA AACTCAAGGC GCTTCTTTCT TAGACTGTTT CAAAGGTTCC
AATCTGCGTC GAACTGAAAT TGTAAGTGTC CCATTACATC ACATTAAAGG AGCGTATCAA
CCCTTACACA AATCTAGGTC ATGCTGGTAT GGATTATCCA GTACTGGTCC GGCCAAAACA
TCATCAACTA CGCCACCCAG TAGTAAGTGT GATTCATACT GCGCATTTTG CAAACCCTGC
TGATCAATGA CACAGCCTGC AGACCGGCGG AATGACTGAA GATGGCGCTT TCAACATGTT
CCTCGGCATC ACATGCTGCT ACATCGTGGG CACTGCTGCG TGCTGGTACA CTATGCATCA
CTTCGGTCGA AGATCCATTT ACATCGCAGG AGTCGCCCTG ATGGTAGTCT GGCACGTCAT
CATTGGAAGT CTTGGAACTG TCAACTCGAC CAAAGCCACC CTTGCAATTG GCATTGTAAT
GGTCATCATC AACTTTTCCA CCAATGCAAC TTTCTTCCCC GTGACCTACA CGATTGCGGC
AGAGGTTCCC GCTACTCGTG TCAGAGCAAA GACCATGGTC CTCGGCAGAG GAGTCTACCT
CATCTCATCA ATCATTTGTA ATCAGATCAC CCCTCGAATG TTCTCCGTCT CAGAGTGGAA
CTGGGGTGCC AAGTCCAGCT TCTTCTGGAT GGGGTGTTGT CTCATATCCT TGGTGTACCT
ATGGTTCCGA CTTCCCGAGA CCAAAGGTAG AATGTTCAGC GAGTTGGATG TGCTGTTTGG
TGAGTGAGGC CTTTTCAAAA TATTGGGATA TGGCTGATCA CGTCACCGTT ACAGCAAATG
AAGTACCGGC TCGGCAATTC AAGCACACTA TCGTTGATGA GCTTTCCGCA GTGACTGAAG
AGAAATACAC TGAGAAGGGA GAGATAGAAT ATCAGGAGAA TGCGTAG
 
Protein sequence
MLSHSDKTTR IAGKHAEQLF YADPTLVQSA LSAAAAEKEL GFKKTFRLYY KAALWSMALS 
LALVMEGYDV GIINSFWGQA SFLNKFGSTA ADGTKYIPAN WQAALNNATS IGQMIGLAIN
GWAQPKFGSK KVYLTAMAVM VGTIFLPVFS TSLPMLFGGE ILCGIPWGIF QTLSTAYAAE
ICPLAMRGYL TAFVNMCWGF GLLLSAGVVR ASLELDSQWG WRIPFMIQWV WPVPLFVIAC
FAPESPWYLV KVGREDDARA TTKRLAPSEY LTDQLVDQQI ALMKHTIEME KAETQGASFL
DCFKGSNLRR TEIVMLVWII QYWSGQNIIN YATQYLQTGG MTEDGAFNMF LGITCCYIVG
TAACWYTMHH FGRRSIYIAG VALMVVWHVI IGSLGTVNST KATLAIGIVM VIINFSTNAT
FFPVTYTIAA EVPATRVRAK TMVLGRGVYL ISSIICNQIT PRMFSVSEWN WGAKSSFFWM
GCCLISLVYL WFRLPETKGR MFSELDVLFA NEVPARQFKH TIVDELSAVT EEKYTEKGEI
EYQENA