Gene CNF03410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03410 
Symbol 
ID3258378 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1005255 
End bp1007182 
Gene Length1928 bp 
Protein Length534 aa 
Translation table 
GC content49% 
IMG OID638257459 
Productanthranilate synthase, putative 
Protein accessionXP_571668 
Protein GI58269024 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAACTCGCA AAATGGACCC CTCAGTAAGC ATTTCAACAC CATTGCTGTT ACGTGGATGC 
TTACTCTTCG TCAGGTTTCG AAGCCTACGC CCTCTCTTGA GGAGCTTACA AACCTCTTCG
CCACGGCCTC CTCGTCTACC ACCACTCTCA CTTCTCGATC CGCCACCTTA TTTCCCACTC
CCAATGCCGA ACCCTCAAAA CCCGTCGAGC CCGCGAAGCC TAACCTCATT CCTATCTATG
TTGAAATTCC TGCCGATTTG CTCACCCCCG TTTCGGCCTA TTTAAAGATT GCAAAGGATG
AAAAGTACAG CTACTTATTG GAGAGTGTTG TTGGTGGAGA AAGCTTGGCC AGATATAGTT
TTGTCGGTTC TAGTGAGTCC ATCAACGTAG TTTGCCAACC ATTCAGACTG ACATACTTGT
AAGACCCTTT CAAAACCATC AAAACCGGCG CGGGAGAAGA AGTCGAGGGT GACCCCCTGG
AAGCTTTGGA GAAGGAACTT GAGCCCTACA GATTCGCTAA GATCCCTGAA ATCTCTGCCT
TCACTGGAGG TGCCGTTGGT TTTATTACCT ACGATGCTAT CAACCATTTT GAACCCGTCA
CCACTCCCGC CACACCTCTT CACAACCCTA TCCCTGGCAT GCCTGAGGCT TGTTTCATGC
TTTTCTCTAC CAATATCATC TTTGACCACA TCTACCAGAC AGTCAAGATA GTGTCTCATG
TCTACCTCCG CGACGGTACA CCCGCTTCCC AAATCCCTTC TCTTTACGAT GAAGCCTCAG
CCAGAATTGA GAGTGCCCGA CGTAAGCTCA TGAACCCCGA AACCCCCATG CCTCACCAAG
GGCCTATCAC TCTTGGTAAC CAGTCCGAGA GCAATGTTGG AAAGGCCGGA TACGAAGGTT
TCGTTACCAA GCTCAAGGAG CACATTGTTA AGGGCGATAT CATCCAGGCT GTGCCCTCGC
AAAGACTGAC TAGAGAGACT GCTTTGCATC CGTTCAATGT GTACAGGCAC TTGAGAAGAT
TGAACCCCAG TCCTTACATG TTCTACTTGG ACTGCGGAGA TGTTCGATTA GTGGGCGCAA
GTCCAGAGAC GCTGTGTAAG GTCGAGGGAA GAAAAGTGTA CAACCACGCT ATTGCAGGTA
CTGTTAAACG AGGGAAGACC GCGGAGGGTA TGTCCATTTC TTCAAGCGGT TGGATGTATA
GGCTGACATT GTATTGTAGA GGACGCCGTC CTTGGTGCCG GACTTCTTGC CTCTGACAAG
GACCGAGCGG AGCACATCAT GCTTGTCGAC CTTGCCAGAA ACGATGTCAA TAGAATTTGC
AAGCCCGAGA CCGTTAATGT TGACAACCTT ATGCAAGTCG AAAAGTTCAG TCACGTTATA
CACTTGACAA GTCAGATCAG CGGTATGCTG AGGGATGACC AATCTAGGTA AGCTCTCAAT
CTGCTTGGAC GAATTTGCAA GAAGACCGAC ATTACGCAGG TTCGACGCCT TCCGATCCAT
CTTCCCTGCC GGTACCGTCT CTGGCGCTCC CAAGATCAAG GCAGTCCAAC TCATTTCTGG
TCTTGAGAAG GAGCGACGTG GTGTCTACGC CGGTGCGGTT GGTCGATTCG ACTTTGACAG
GGACAATCTC GATACCTGTA TCGCCATCCG AACAATGACA TTTAAGGATG GAAAGGTGTT
CTTGCAGGCA GGTGGAGGTA TCGTCTTTGA CAGTGTGGAA GAAGATGAGT TTGTGGAGAC
CATTAACAAG TTGGGGGCGA ATGTCAAATG TATCGAAGAG GCTGAGAGTG AGTCGATTTT
TATCTTTTTG CTTTGTTCCG GATTTATCGT GGTGCTGATC AGACGTTGCA GAGTACTACG
CGAGGTTGCA AGGACAGAAC GTGTAAAAAT TTTTAGACAA TATAGAAGCC CATGAACCTG
TGCATTAC
 
Protein sequence
MDPSVSKPTP SLEELTNLFA TASSSTTTLT SRSATLFPTP NAEPSKPVEP AKPNLIPIYV 
EIPADLLTPV SAYLKIAKDE KYSYLLESVV GGESLARYSF VGSNPFKTIK TGAGEEVEGD
PLEALEKELE PYRFAKIPEI SAFTGGAVGF ITYDAINHFE PVTTPATPLH NPIPGMPEAC
FMLFSTNIIF DHIYQTVKIV SHVYLRDGTP ASQIPSLYDE ASARIESARR KLMNPETPMP
HQGPITLGNQ SESNVGKAGY EGFVTKLKEH IVKGDIIQAV PSQRLTRETA LHPFNVYRHL
RRLNPSPYMF YLDCGDVRLV GASPETLCKV EGRKVYNHAI AGTVKRGKTA EEDAVLGAGL
LASDKDRAEH IMLVDLARND VNRICKPETV NVDNLMQVEK FSHVIHLTSQ ISGMLRDDQS
RFDAFRSIFP AGTVSGAPKI KAVQLISGLE KERRGVYAGA VGRFDFDRDN LDTCIAIRTM
TFKDGKVFLQ AGGGIVFDSV EEDEFVETIN KLGANVKCIE EAEKYYARLQ GQNV