Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND04790 |
Symbol | |
ID | 3257385 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1316872 |
End bp | 1318837 |
Gene Length | 1966 bp |
Protein Length | 405 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256415 |
Product | aerobic respiration-related protein, putative |
Protein accession | XP_570415 |
Protein GI | 58266518 |
COG category | [R] General function prediction only |
COG ID | [COG1100] GTPase SAR1 and related small G proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.850973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCTGTATAA CCACCCTCCA GTTACAGCAA CAGCATGTCC GCCCCTGCCA CATCTGAAGC TGGACCTTCC AGCACCAACC CCCCTTCCGT CGAGGGCAAG AAACAGCCTG TCGTCATCCT CTGTATTGGC ATGGCGGGAT CGGTGTGTTT TCGTTCTAGC TTTTCCTCTT TGGTCCCCTC CACTAACAGC CTTCATTAAA ATACAGGGCA AAACGACCCT CATGCAACGA CTTAACTCCC ACCTCCATTC CAAGAATACC CCCCCGTATA TTCTCAATCT TGACCCTGCC GTAACGCACA TGCCTTACTC AGCGAACATT GACATCCGGG ATACGGTCGA TTACAAAGAA GTCATGAAGC AGTACAAGCT CGGCCCCAAC GGTGGTATCT TAACAGCGTT AAATCTGTTT ACCACAAAAT TTGATCAGGT ATTGGGGTAT GTGGAGAAGC GGGCGGAGAC TGTTGAGTGA GTGCATATGT CCCAATCAGT TGAGCCTGTG AACTGACAGA GACACAGCTA TATCCTTGTG GACACCCCAG GACAGATCGA AATCTTCACC TGGTCCGCGT CTGGTGCTAT CATAACGGAT GCTATTGCTT CTTCTCTTCC CACCGTCGTT GCCTACATTG TCGACACACC GCGAACTGCG TCCCCCGTTA CCTTCATGAG TAACATGCTT TACGCTTGTT CTATACTTTA CAAAACGAAA TTACCATTCA TCATAGTATT CAACAAAATC GATGTGCAAC CTCATGAGTT TGCCCTGGAC TGGATGACAG ACTTTGAAAA ATATCAAGAA GCGTTAAACG ATAAGAGCAG AGACGAGCAT GGGGAAGGAA GTTATGTGAA CAGCCTAATG TCAAGTATGA ACCTTGTTCT GGAGGAGTTC TATAACAATT TGAGGGTAAG CCGGATCAGC TAATTGGGCT CTGGGAAAGA ACTGATCAGT TTCAGGCGGT GGGTGTGAGC GCGATGACCG GAGAGGGTAT GAAGGCATTT TTTAGTGCGG TAGAAGAGGC CAGAAAGGAA TATGAGACGT ATGTTGTATT ATTGTTTAGC TCTTCGATTG GCTAATATGA TTTAGCGATT ACAAACCTGA ACTAGATCGT TTGGCAGCTG AGCGGGCCGC ACAGACCGAA GCAGACAAAA AAGCTCAGCT TGAGCGTCTT ATGCGGGATA TGAATATTTC GGACTCTCCT CGGTCCGGAC CTGGTGGTAA CCCCTTTGGT CCTCATGCCC GTAATGACCG TGAAGATCGA TACTATGATG ACGAGGGCGA AGCGAGTGAC ATCGACGAGC AGGAGCAAGA AGCTATCAGG AGACAAATGG AGGAAGAAGA AGAAGATGCA GAAGCGGAAG AATTGGGCAA GTTGGACGTA GAAGAGCCAG AGATCGGAAG CTTGGCCGGT GGAGCTGCTG CTGCTGCTGC GAGTCGGGGC GTAACATGGC CTGCGCCCAG ATAGATGCAA TGTGTGTACA ATTAGATAGG CATGTATGTG CATAAAAGCA ACAGTCTAGA ACGATCTAAC TTGATAAATC TCTAGGATGG GTGATCTATT TAGTCAAGTT CAGCCAAGTT CAGCTCTGGC AAGACGTCAC CGCAAGTCTT CCAGCCCTCC CAACCCATCG CAACTTTCTC AAGCTTTTCG CCAGCCACCT TCAGATCATC GTTGACAACG TAAACATCAT ATTTGCCCTC CTTCGCGTAC CTCAGCTCCT CCTTGGCAGC ATCGAGCCTC TTTCGGATGG AGGCGTCGGT CTCGGTACCT CGCCCAGACA AACGGGATTT AAGTTGAGAA ATGGAAGGCG GAGAAAGGAA GAGAAACACC GGTTCGAGGG GAGGGGTTTG GAGCGGGGCC TTAGCCTTTA GCTGAAGGAC ACCTTGCAGT TCAATGTCGA GAATGCATCG GCGGGGATGA AGGGCGGTCA AGGCAGCAAA TGTAGTTCCG TAACTTGAAG GTCTTT
|
Protein sequence | MSAPATSEAG PSSTNPPSVE GKKQPVVILC IGMAGSGKTT LMQRLNSHLH SKNTPPYILN LDPAVTHMPY SANIDIRDTV DYKEVMKQYK LGPNGGILTA LNLFTTKFDQ VLGYVEKRAE TVDYILVDTP GQIEIFTWSA SGAIITDAIA SSLPTVVAYI VDTPRTASPV TFMSNMLYAC SILYKTKLPF IIVFNKIDVQ PHEFALDWMT DFEKYQEALN DKSRDEHGEG SYVNSLMSSM NLVLEEFYNN LRAVGVSAMT GEGMKAFFSA VEEARKEYET DYKPELDRLA AERAAQTEAD KKAQLERLMR DMNISDSPRS GPGGNPFGPH ARNDREDRYY DDEGEASDID EQEQEAIRRQ MEEEEEDAEA EELGKLDVEE PEIGSLAGGA AAAAASRGVT WPAPR
|
| |