Gene CNB00920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB00920 
Symbol 
ID3255720 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp257781 
End bp258951 
Gene Length1171 bp 
Protein Length353 aa 
Translation table 
GC content51% 
IMG OID638254743 
Producthypothetical protein 
Protein accessionXP_569002 
Protein GI58263184 
COG category[R] General function prediction only 
COG ID[COG1100] GTPase SAR1 and related small G proteins 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.286854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCATC CAGGGATGGC CGATCCGGCT ATCTCTATCA AACCTCAAAG TGAAATATCC 
CCAGAGGCCA GTTCCCTCAC AGCTTTGCTC GCTCATCCGC TCCTCCAGGA TCCCAAGTTT
GTGGCTGCAG CTGGTGGACT GGCACTGTTA CTGCTTTTTC TTACCCGTGA GTGGGGAATG
CTATTCCGAT GCGATATGCG TCACTTATCT TTTGGAATAG TTTTCCGTCA GGGCAAGAAA
ACTCACAAGC GGAATGGTCC TGCAACTGTC CTTCTCGTCG GACCGTCCGA CGGAGGCAAG
ACTAGTTTGT TTACCAAGGT TATTCAGCGC GGTGGGCCTC CATGTAGTCT ATCAATACTA
ATGCTCTTTT GTAGCTAATT CATGACATCT ATCCCCAAAC CCACACCTCG ATTGTTCCCT
CTGACACCAC TTTCGATTTT GACTCGCCAT ATGAAGACGA CCAAAAGAAA CAGATCCGCT
TGATCGATAT CCCTGGACAT CCTAGACTGC GAGACGAAGT CAAGAAATAC ATTGCTGACT
CTGCGGGAGT TGTATTTGTG GTGGATATCC AAGGCATCGT CCGCAACGCG TCAGGCGTAG
CCGAGTGGGT GTTCTGTTCC TTCATCGTTA TGTTTCCGTG GTTAAAGATC TTTCCAGACA
ACTCCCTCCT ATTCTCACAG CACTTTCCAA TATTTCTTCT CGACTTCCTC CTTCGGCTCC
TCCTCCCAAA TTGCTCTTGC TCGCCCACAA GGCCGACCTT CTCGCTCGCC CCACGCCCTC
GCCCAGCCAC TGCCCTCCCG AAATCCCCTC TTCTACCCTC ACAACCTCCA CCGACCGTCT
CAAATCTATT CTCACCCGAG AAATGGACAG ACTCAAGTCT ACGCGTGCAG GAACAGGTGG
GAAGATTGAG GGGATCGGAA AAGTTGCTGG GACGTCAGGT GGTTTCTTCA GCAAGCTGTT
TGGCGGAGGA GCCGGGGATG TTGCGGGAGA AGATGAAGGT GATGACGATG AGAGCCTTAT
CTGGGGTGGG AAAGGGCCCT TTAAATGGGA AGATGTGGAG GGCGTTGAAG TTGAGTGGGG
AGCGAGCGGA TTAGGCTCGA CTAAAGGGAA GACAGAAGCA GAGAGTGGTA ATGGCTTGGA
TGAGCTGAAG GCATTTTTGT GGGACATCTA A
 
Protein sequence
MSHPGMADPA ISIKPQSEIS PEASSLTALL AHPLLQDPKF VAAAGGLALL LLFLTREWGM 
LFRCDMRHLS FGIVFRQGKK THKRNGPATV LLVGPSDGGK TSLFTKLIHD IYPQTHTSIV
PSDTTFDFDS PYEDDQKKQI RLIDIPGHPR LRDEVKKYIA DSAGVVFVVD IQGIVRNASG
VAEQLPPILT ALSNISSRLP PSAPPPKLLL LAHKADLLAR PTPSPSHCPP EIPSSTLTTS
TDRLKSILTR EMDRLKSTRA GTGGKIEGIG KVAGTSGGFF SKLFGGGAGD VAGEDEGDDD
ESLIWGGKGP FKWEDVEGVE VEWGASGLGS TKGKTEAESG NGLDELKAFL WDI