Gene CNG00550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00550 
Symbol 
ID3258521 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp145618 
End bp147408 
Gene Length1791 bp 
Protein Length398 aa 
Translation table 
GC content49% 
IMG OID638257670 
Productarginase, putative 
Protein accessionXP_571761 
Protein GI58269210 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01229] arginase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGCTCGTC ATCCAATATC AGGTATACAT TTCCACTTCT GAACAGCACC ACCACAATCG 
AAATCAACTC TAATAACTGT CATCACTATG CTCTCTCGCG TCTCTCCATC TCTTCGTCAA
GTCCTTGCTG CCCCATTGAG GGCATCAAGG ATGGGCACTT CCCCTATGCA AAAAGCGTTC
ACCCACTCAT CAGCACCTAG AAACCATCAT ATTACCGCTC AGCCTAGTAG TGAGAAGGGT
CTTACCTCCC CTACTTACAA TTACAAGTTT TTGAACGAGC CTGCTACTGC TTCTGTACGT
ATCGTTTGCT CCGTGCAAGT AGTCGAACCT AGCTCATAAA TAGTGATGAT AGCTTGTCGG
ATGCCCTTTT AGGTAAGCAC GCTATCTGCA CTTTCTATTC AACTTCTTTA GGGAGCTTAT
TTCGGTTTAA GCGGTGGACA AGGTCGAGCT GGTGTCGATC TCGCCCCAAA CAAGCTCGTC
TCTGCCGGTC TTGTAGAGCA AATCTCGGCC CTTGGCTGGA ACGTCCATTA CGAATCTCAC
CAAAACTTCC TCGACATCCC TTACAACCCC CTTCCTTCTT CTTCCCCCGT CACTTCAACC
GAAGGCCCCT CTACCCACAC TACGACCGCT CAGGGCGAAA AGATGGTCCA GAGGTTGCCG
GATCCTGATA TTGGAAGCAT GAAAAAACCT AGGTTAGTCA GTGCGGTGAA CGAAATGGTA
GCCAAGGAGG TTGGGGATAT CGCGGAGAAG GGGTGGTTGC CTGTGACTTT GGGAGGCGAC
CACAGTTTGG CGATGGGTAC TATTGCTGGT ACTAAGCGCA AGTACCCTAA CGCTGGTGTG
ATCTGGGTAG GCTTTCAAGA TCCGGTAAGT GAGGTGACAT ATGGCTGATG TCCTATATTT
AGGTTGATGC TCACGCCGAT ATCAACACCC CCTTGACCAC CGAGTCTGGC AATCTCCACG
GCTGCCCTGT TTCTTTCCTC TTGGGTTTGG ATGGCTGTGA CGTCGAGCCT TTCAACAAGT
GGCTTAAACC TTGCCTCAAG CCTGAAGATA TGTATGTCTT AATGTCCCCC ATTTTTACTT
CTTGAATAAT GACTGATTGG ACGCCATCTA CTACTTAGCG TCTACATCGG TCTCCGGGAC
ATTGACGACG CCGAGAAGAA GATTCTGAAG GAAAATAACA TCAAGACTTT CACTATGCAC
CACGTCGACA GGCACGGTAT TGGCAAAGTC ATGGAACTTG CGCTTCAGCA TATCAACCCC
AATGGTGACC GACCCCTTCA CCTCAGTTTC GACGTTGACG CTCTTGACCC CACAGTTGCT
CCTAGTACGT CCCTTCCCCA ACAGTATTTG GGTAGACGAG GAAGGGCTAA CGTGGATGAT
GAAGGCACAG GTACCCCCGT TCGAGGTGGT CTGACTTTCC GAGAAGGTGA GTCATTTTTG
GTCACCACAG AACGTGTCTG ACTCGTTGTT TAGGCCATTA CATTACCGAG GTTGTTGCTG
AGACCGGCTG CCTCGTCGCT TTGGACATCA TGGTACGTCT GCATGACTTC TGCGCTTCAG
ATGAGCTGAG ATACTCCACT CATTTATTAG GAGGTCAACC CTTCCCTGCT TGACCCCAGG
TCCGTCGAGA TGACTGTCGC TGCTGGCTGT TCCCTCACGC GGGCATCTTT GGGTGAGACT
TTGTTGTAAG AGAAGGGGGA TTGGTGTCTA TGGAGTAAGA AGACATTTAC ACCACACGTG
TAAAACTGTA TAATATATGG CTTTTAAAGT TTAACAGGAC AATACATAAT T
 
Protein sequence
MLSRVSPSLR QVLAAPLRAS RMGTSPMQKA FTHSSAPRNH HITAQPSSEK GLTSPTYNYK 
FLNEPATASL VGCPFSGGQG RAGVDLAPNK LVSAGLVEQI SALGWNVHYE SHQNFLDIPY
NPLPSSSPVT STEGPSTHTT TAQGEKMVQR LPDPDIGSMK KPRLVSAVNE MVAKEVGDIA
EKGWLPVTLG GDHSLAMGTI AGTKRKYPNA GVIWVDAHAD INTPLTTESG NLHGCPVSFL
LGLDGCDVEP FNKWLKPCLK PEDIVYIGLR DIDDAEKKIL KENNIKTFTM HHVDRHGIGK
VMELALQHIN PNGDRPLHLS FDVDALDPTV APSTGTPVRG GLTFREGHYI TEVVAETGCL
VALDIMEVNP SLLDPRSVEM TVAAGCSLTR ASLGETLL