Gene CNF02120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF02120 
Symbol 
ID3258197 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp619804 
End bp622280 
Gene Length2477 bp 
Protein Length470 aa 
Translation table 
GC content44% 
IMG OID638257338 
Productexpressed protein 
Protein accessionXP_571560 
Protein GI58268808 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAAGTGCGT GCGAATACTA CGATGGGTCT AACACAGTCG ACAATGAGCT CCAACCAGCC 
AGGCAGAGGC TTCTTGAAAG TTAGCGGAAA GGATATAACC TTAGATGGAA AACCGATTAC
ATTGAGAGGT ACGAGCAAAA GTCAGGCTTG GACTTCATAC ACTGACTAGT CATTGTAATG
TAGGGACTGC AATTGGCGGC TGGTGTGGGT TGCAATGCTA TTTTTTCTGT GCTGACCCGT
GTGTTTACTC ATAATATTTG TATTGATTGG GTATCAGTGA ATATGGAGAA CTTCATCACC
GGCTATGCTG GACATGAACA TCAAGTTCGG CATGCTCTAA AGCAGGTATT AGGGACAGAG
AAATACAATT ACTTCTTTGA AAAGGTCAGT TGAAAGGAAT GAACAAACAT CTGACAACTT
CTACTGACTG CTTGACAGTT CCTTGAGTAT TTCTTCGCCG AAGATGATGC AAAATTCTTT
GCATCACTAG GATTGAACTG TATTCGCATT CCTGTGAGTT TTCAGTATCG GCCATAAGCA
GGTTACTTAC AATTGCTTGT TTATTGCAGG TAAATTATCA TCACTTTGAG GATGACATGA
ACCCACGAGT GTTCAAGAAA GACGGCTTGA AACATCTCGA TCGCGTGATT CAAATTGTAT
GTCGATCCGT GCAGGTTACT AAACCACTTT TGTTCACATT CATGCAGTGT GCCAAGTACG
GTATCTACAC TGTCATCGAT CTGCATGCAG CTCCCGGAGG TATGCACCCT CATGCGAAAC
AGATTGACAG GCTTGCTGAC TGTTTAAAGG ACAAAATTTC GACTGGCATT CAGACAATCC
AACTCACAAG GCGTTGTGTA AGCCCAAAAT CGCGACACGC GGTAATATTC GTTTGACGTT
TGTTTGGTAG TCTATGAGCA CAAGGATTTC CAAGATCGAA CAGTCTTCAT TTGGGAAAAC
CTAGCGCGTG TGAGTACGTC CAGCTGCGTC GTGTAACCTC TCGTTCTAAT TTTCGCCCAC
GACCAGCATT CTAAGGACAA TACTTGGGTT GCAGGTTATA ATCCCTTGAA TGAACCTTCT
GATGAGCAAC ACGTTCGCCT TGTGGCATTC TACAACAGGG TAGAAAAAGC AATCAGATCT
ATTGATAGCA ATCATATGCT CTTTTTAGAG TAAGTGCAAA GGATATGCGT GCTGCTTGTC
TGCCGGTTCC CTCTTAACCT TTTCAATCAT AGCGGAAAGT GAGGCTATTA CGATACATCG
CCACTTCAGA CGCATTGCTA ACCGATATGC TGGGAAAAGC ACTTTTGCAG CGGACTTTAG
CCGGTTTGGG AAGCCTCTCC ACAATTGCGT TTATGCTTGT CATGACTATT CCATGTGAGC
TCATATAATT AAATAAAGAT GCCTGTACTC ATTCTTTAGC TATGGGTTCC CAAATCCACC
CTCTCTATAT GAGGTCAGTC GACGGAAATA TTATCAGGAT CTCTATCATT GACAGTGCTT
TAGGGCTCAA AGGAACAAAT CCAATTCCAC ATTGATTCAT TCAATGGTAA AACCGAGTAT
ATGCGCAAGC ATGGGGTGAG TAGGTCGACT TGTTGGACAT GGTTGTATTG ACTAATTGTT
GTTCGGCGTC CAGAGTCCAG TATGGGGTAA GCAATAGTAA GGAATTTAAA AGGGTATTGA
CTAACCTCTG TGTCAGTTGG GGAATTCGGC CCTGTTTATC AAACATCTAA GGACGGATAT
CCTGATTGGA AACACATCAA TGACACCCGA TTTGATGTCC TTCAGCTTCA GCTTGATATC
TACGCCAAAG CTCGGGCTAG TTGGTCCATC TGGCTCTATA AAGATATTGG TTTCCAGGGT
ATGATTTACG CGGGTGAAGA TACTGCATAT GTAAAACTTC TCAAGGAATT CTTACACAAG
AAAAAGGCAC GTTCGACTAA TCCCCATCCA CTGCCCCGGC TGATACACCC CTTAGGTTGT
TGCCGCTGAT AAGTGGGGAG CGGATGATCG TGCAGTGCGA CCGTTGTTTA CACCCGTTGA
GTCATGGCTT CTCAAGACCG TACCATCAAT CTCGGACCGA TACCCACAAG ATTGGAGTGT
AGGCGAGCAC CTTTCTAGGC TAGTCAGAAA TATGCTCCTC AGTGAAGAGC TAGTCAAAGA
GTACGCAGAG CATTTTAGAG GGAAGAGTCT TGAAGAGTTG GATGAGCTAG CAAAGAGTTT
TAAATTCTGT AAGCCTTAAT TTGTGGGTAT TTTATGGATT GAAAGCTGAT GGATGCTTGG
CCGCGAAGCT AATTGTACTC AGAGGAAGAG GTTGAATGAT GTGCTCAAGT CAGATTCAGA
GCGTGGCACT GATGAGAAGA AGTCGTTGTG GCAAGCTGGT GAGAAGGTAT GACAGAAGAT
CAAGACTTTT GATTGCGAGC ATGTTATACA GTGAGAAAGA ATTGTGTCGA AACCAATAAA
TCAATGCAGA AGTGTTA
 
Protein sequence
MGLTQSTMSS NQPGRGFLKV SGKDITLDGK PITLRGTAIG GWLNMENFIT GYAGHEHQVR 
HALKQVLGTE KYNYFFEKFL EYFFAEDDAK FFASLGLNCI RIPVNYHHFE DDMNPRVFKK
DGLKHLDRVI QICAKYGIYT VIDLHAAPGG QNFDWHSDNP THKALFYEHK DFQDRTVFIW
ENLARHSKDN TWVAGYNPLN EPSDEQHVRL VAFYNRVEKA IRSIDSNHML FLDGNTFAAD
FSRFGKPLHN CVYACHDYSI YGFPNPPSLY ESPVWVGEFG PVYQTSKDGY PDWKHINDTR
FDVLQLQLDI YAKARASWSI WLYKDIGFQG MIYAGEDTAY VKLLKEFLHK KKVVAADKWG
ADDRAVRPLF TPVESWLLKT VPSISDRYPQ DWSVGEHLSR LVRNMLLSEE LVKEYAEHFR
GKSLEELDEL AKSFKFSNCT QRKRLNDVLK SDSERGTDEK KSLWQAGEKV