Gene CNL04420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04420 
Symbol 
ID3254885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp223860 
End bp225123 
Gene Length1264 bp 
Protein Length273 aa 
Translation table 
GC content50% 
IMG OID638253913 
Productconserved hypothetical protein 
Protein accessionXP_567992 
Protein GI58261164 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5597] Alpha-N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACCGCGATG TCTCCTGAAA AGCTCACCGA GGAAAATATC CTAGATCGTG AAGAGACAAA 
CATCCCGTCT ACCTCTTCAT CCATCCTTCC TGCCCCTATC TCCTACCCTG CGACGGTGTC
CTACCGAATA GCGAACACAT GGTTGCGCCT GATTGCCAGC GGCATCATCG CCATACTACT
GATCCATATC CTCCACATCC AGCTATCTCC AATGGAGGCG GTAGAGCCGG CCCCGCCAAC
TCCGCCACCA GCGGCAGAAG CCTACGTGAC CTTCCTCGCG CATTCCGATG ATCCCAGACC
ATGGTATTTC AACGCCGTGC GCAGGCTCAT GTTTCAGCTC AAATATGACC CCCTTACACT
CGACCCACAC CCCAGAGATT TTGTGGTAAT CACCACACCA GGTGTCCCAG AATGGCAGCT
CGAGCAGCTT CGTGAAGAGG GAGCTATCAT TGCCTCCCGT CCTTTGATCG ACCACCTCCC
TCTTCCGGAA AAGGGAATCT CGCGCTACGC TGAAGTGTAC ACCAAGTTGT TCATTTTCAA
CCTTACAGAC TATGAGCGCG TTCTCTTTGT TGATGCTGAC CAGTTGATGG TGAAGCCGTT
GACTGGGATT TGGGATGATC CGAATGCCTG GCCGGAGAGC GGGATGGCTG CGTGTGGAGA
GAGTAAGAGT GCCTGGGACC ATCCGACGCC GATCGAGGAT CAAAATTATT TCAATAGTGG
TTTCATGTTG GCTAGGCCGG ATGAAAAGAC TTTCAACGAG TTGCTACAGG AGAAGGATTT
CGACCCATGG TTTCCTGAAC AGGTGAGATT GGTCAGGGTT TTATCATAAT ATCCGAGCTG
CCAGATCAAC TGCTGACAGC GTTATCGTAG AACTTGTTGA ATCATTACTT CCGGAGGGAT
GGGCCCAGAC CGTGGAGGCC TCTGAATCAT ATGTGGGTTG TACCTTGACA TGTTTCAATT
TGTCTACGTA TCACTTACAA GGGCTTTAGG TTTGTCACAA CCTTCCCAAG GAAAGTCGAC
CTCGAAGCTG GTATCCATGT GTAAGCCCAG ATTGCAAAAT CCATTCATCC CAAGCTAATT
TGGAATAGCC TCCATGACAA GATGTGGTTA CCCCATATTG ACAGGGAAGT CAAAGAAGTA
TGGCGACAAA AGCTTGGGCG AATGGAAGGC TATTGGTTGG CGATGGGCCG TGGGCCTGAG
GCTTGGAATT CTACTTCACT TACCTATATG TAGTAGATTT TGCCATTTAT TATATGTAGA
ATGG
 
Protein sequence
MSPEKLTEEN ILDREETNIP STSSSILPAP ISYPATVSYR IANTWLRLIA SGIIAILLIH 
ILHIQLSPME AVEPAPPTPP PAAEAYVTFL AHSDDPRPWY FNAVRRLMFQ LKYDPLTLDP
HPRDFVVITT PGVPEWQLEQ LREEGAIIAS RPLIDHLPLP EKGISRYAEV YTKLFIFNLT
DYERVLFVDA DQLMVKPLTG IWDDPNAWPE SGMAACGESK SAWDHPTPIE DQNYFNSGFM
LARPDEKTFN ELLQEKDFDP WFPEQVRLVR VLS