Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00920 |
Symbol | |
ID | 3255720 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 257781 |
End bp | 258951 |
Gene Length | 1171 bp |
Protein Length | 353 aa |
Translation table | |
GC content | 51% |
IMG OID | 638254743 |
Product | hypothetical protein |
Protein accession | XP_569002 |
Protein GI | 58263184 |
COG category | [R] General function prediction only |
COG ID | [COG1100] GTPase SAR1 and related small G proteins |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.286854 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCATC CAGGGATGGC CGATCCGGCT ATCTCTATCA AACCTCAAAG TGAAATATCC CCAGAGGCCA GTTCCCTCAC AGCTTTGCTC GCTCATCCGC TCCTCCAGGA TCCCAAGTTT GTGGCTGCAG CTGGTGGACT GGCACTGTTA CTGCTTTTTC TTACCCGTGA GTGGGGAATG CTATTCCGAT GCGATATGCG TCACTTATCT TTTGGAATAG TTTTCCGTCA GGGCAAGAAA ACTCACAAGC GGAATGGTCC TGCAACTGTC CTTCTCGTCG GACCGTCCGA CGGAGGCAAG ACTAGTTTGT TTACCAAGGT TATTCAGCGC GGTGGGCCTC CATGTAGTCT ATCAATACTA ATGCTCTTTT GTAGCTAATT CATGACATCT ATCCCCAAAC CCACACCTCG ATTGTTCCCT CTGACACCAC TTTCGATTTT GACTCGCCAT ATGAAGACGA CCAAAAGAAA CAGATCCGCT TGATCGATAT CCCTGGACAT CCTAGACTGC GAGACGAAGT CAAGAAATAC ATTGCTGACT CTGCGGGAGT TGTATTTGTG GTGGATATCC AAGGCATCGT CCGCAACGCG TCAGGCGTAG CCGAGTGGGT GTTCTGTTCC TTCATCGTTA TGTTTCCGTG GTTAAAGATC TTTCCAGACA ACTCCCTCCT ATTCTCACAG CACTTTCCAA TATTTCTTCT CGACTTCCTC CTTCGGCTCC TCCTCCCAAA TTGCTCTTGC TCGCCCACAA GGCCGACCTT CTCGCTCGCC CCACGCCCTC GCCCAGCCAC TGCCCTCCCG AAATCCCCTC TTCTACCCTC ACAACCTCCA CCGACCGTCT CAAATCTATT CTCACCCGAG AAATGGACAG ACTCAAGTCT ACGCGTGCAG GAACAGGTGG GAAGATTGAG GGGATCGGAA AAGTTGCTGG GACGTCAGGT GGTTTCTTCA GCAAGCTGTT TGGCGGAGGA GCCGGGGATG TTGCGGGAGA AGATGAAGGT GATGACGATG AGAGCCTTAT CTGGGGTGGG AAAGGGCCCT TTAAATGGGA AGATGTGGAG GGCGTTGAAG TTGAGTGGGG AGCGAGCGGA TTAGGCTCGA CTAAAGGGAA GACAGAAGCA GAGAGTGGTA ATGGCTTGGA TGAGCTGAAG GCATTTTTGT GGGACATCTA A
|
Protein sequence | MSHPGMADPA ISIKPQSEIS PEASSLTALL AHPLLQDPKF VAAAGGLALL LLFLTREWGM LFRCDMRHLS FGIVFRQGKK THKRNGPATV LLVGPSDGGK TSLFTKLIHD IYPQTHTSIV PSDTTFDFDS PYEDDQKKQI RLIDIPGHPR LRDEVKKYIA DSAGVVFVVD IQGIVRNASG VAEQLPPILT ALSNISSRLP PSAPPPKLLL LAHKADLLAR PTPSPSHCPP EIPSSTLTTS TDRLKSILTR EMDRLKSTRA GTGGKIEGIG KVAGTSGGFF SKLFGGGAGD VAGEDEGDDD ESLIWGGKGP FKWEDVEGVE VEWGASGLGS TKGKTEAESG NGLDELKAFL WDI
|
| |