Gene CNM01450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNM01450 
Symbol 
ID3255255 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006682 
Strand
Start bp422822 
End bp424575 
Gene Length1754 bp 
Protein Length425 aa 
Translation table 
GC content47% 
IMG OID638254298 
Productconserved hypothetical protein 
Protein accessionXP_568455 
Protein GI58262090 
COG category[L] Replication, recombination and repair 
COG ID[COG3145] Alkylated DNA repair protein 
TIGRFAM ID[TIGR00568] DNA alkylation damage repair protein AlkB 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTACCACGA CCAACACAGC TTGCTTTATA ATACATTTAA TACATCGCAA ATACCAAATT 
TCACTAGCAT GGAAAATGCC AGCTCCGCCT TAACCGCTTT CAGGCAAGCG GAGAAGCACT
TCAAGAATCG AGCTAATAAA GATATCTACC CATCACTTCG TCAATGGCAA GACCGTTTGA
TCGACTTGTC CCGGCCAGAT TCTCAAGAGG AGGATGAAAT ATGGGCCGCT GGGTGGTGGA
GTCCTGATCA TGACGTTGTG CCAGCAGCTA CAAGTGGCAG ACGGAGAAAG GGGGTGGAAA
AAAAGGATAA AGGGGAGAGA CCAGAGCTGG ATATCGCGAG TCTTGAATCT TTGTCTTTAC
ACGGTGGGAA GACTGGATAT ATCGTCGCTC CAGGTGCGTG TGCCCCTCCC CATCTGAGAG
ACCGCTGTAA CAACTTTTGT GACACAGGAT GTGTTCTTAT ACCTGGCTAC CTCACGGTCG
AACAGCAACT TTCCTTCCTG CATGATTCCC TTGCCCGATA CACTCTCCCA CCTAACCCTC
TCTCGCTTAG CACTCATTAC GATCTTCCTC CCAACCTTTT CTCCTTATTT GTCTCAAACC
CGGAAGCGAC CGTTCTTCCG AAACACATGA CTGGCACAGT CAACCCCGAA GCACTTGCCT
CCGCTTCCCA ACCAAAGAGC AGGAAATTGA ATGATACAGA ACCGGCATCA GTGATAGGGT
ATGAAGAGAT TGTAGCTCGA AATAAAGCTT GGACAGGGGA TTTGCCTAGC GACAAGCTGG
GAGCAAAAGA GGTGAGGAAG CTTTGGAAGG AAATTCGCTG GGCGAATCTG GGATGGGTAT
ATCAAGTAAG TTTCATTTTT GTTGTCAATA GTCTGCAATG TAACTTTGCT CATCATCATT
TGATTAATTA GTGGTCGACA AAATCGTATG ATTTCGCACC AGAAACCCCA ATACCTTTCC
CCGCTCCGCT CGCCGATCTT TGCTCCGAAG CAGTAGCATC AGTGCCGTGG GAGAATGTGT
TCTCTTCAGT ATCGGATCCA GACGCTTCAA CATATGGTTG GCAATCTTGG CCAAGAGATT
ACAGTACGTC TTCTCACATT AATTGAATCG CTTTGAAATC AAATCTGACT TGTGGTGATA
GAGCCTGATA CGGGCATTGT CAACTTTTAT CAGCTGAATG ATACACTCAT GGCACACGTC
GATCGTGCAG AGTGAGTTTG ACAAACCCCA AAGAGTCACA ACCATCTTGC TGACCCATAG
AAACAGACTA GATCCCGCTC GACCGCTGGT TTCAGTCTCG TAAGTATTCG GATCCATCCT
CTCTCCCACT CTCTTCTCGC TGGTCCTCGT ATAACTAACT CCAATCCAGT TTGGGGCACG
CTGCAATCCT TCTTTTGGGT TCTGACTCTC GTGATGAAGT CCCTAGACCG ATAATACTTC
GTTCCGGCGA TATGCTGATC ATGAGCGGTA AAGGCAGACA GTCTTATCAT GGTAAGCTAC
TTTTCTTAAA TACCTGTCTC CCCAGTCCTC TGGACCGTCC TGAAGGAAGA TGCTTATAAA
CGCTATGCTG GACAGGTGTA CCCCGTATCC TGGAAGGGAG CCTTCCATCA CATTTCTTGG
TACAGGAAAG TGACTCTGAG GAGATGAAGG CAGCGAAGAA TTGGATAAGT ACAGCTAGGA
TTAACATCAA TGCTAGACAA GTCTTTCCAC CAGGTTTCAA AAGAGTAAAT TGACTAGCAT
CACACATCCG AATA
 
Protein sequence
MENASSALTA FRQAEKHFKN RANKDIYPSL RQWQDRLIDL SRPDSQEEDE IWAAGWWSPD 
HDVVPAATSG RRRKGVEKKD KGERPELDIA SLESLSLHGG KTGYIVAPGC VLIPGYLTVE
QQLSFLHDSL ARYTLPPNPL SLSTHYDLPP NLFSLFVSNP EATVLPKHMT GTVNPEALAS
ASQPKSRKLN DTEPASVIGY EEIVARNKAW TGDLPSDKLG AKEVRKLWKE IRWANLGWVY
QWSTKSYDFA PETPIPFPAP LADLCSEAVA SVPWENVFSS VSDPDASTYG WQSWPRDYKP
DTGIVNFYQL NDTLMAHVDR AELDPARPLV SVSLGHAAIL LLGSDSRDEV PRPIILRSGD
MLIMSGKGRQ SYHGVPRILE GSLPSHFLVQ ESDSEEMKAA KNWISTARIN INARQVFPPG
FKRVN