Gene Ndas_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2228 
Symbol 
ID9246078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2662372 
End bp2663814 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content71% 
IMG OID 
ProductXanthine/uracil/vitamin C permease 
Protein accessionYP_003680156 
Protein GI297561182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0279599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0422886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAC TCAAGGACTC CGGCCCGCAC ACGGGGGCGC GGAAGCCCAC GGCTCGATCC 
TGGCTCGACC GCTTCTTCTT CGTCAGCGAG CGCGGATCCA CCTTCGGGCG CGAGGTCCGC
GGCGGACTGA CCACCTTCAT GGCGATGGCC TACATCATCG TCCTGAACCC GATCATCCTC
AGCGGCGTCT CCGACGTGAA CGGCGACGTC CTGTCGGCGG GCCAGCTGAC CACCATGACC
GCCCTGTCCG CCGGGCTGGT CACGATCATG ATGGGGGTGG TCGGCCGGGC GCCGATCGCC
TGCGCCGCGG CCCTGGGGGT GATGGCGGTC GTGGCCTACC AGGCCGCGCC CGTGATGGCC
TGGCCCGAGG TGATGGGCCT GGTCGTGTGG CAGGGCGTGG CCATCATCCT CATGGTGGTG
ACCGGCGTGC GGACCGCGGT GATGAACGCG CTGCCGCACG ACCTCAAGAT GGCCATCGGC
GTGGGCATCG GCCTGTTCGT GGCCCTCATC GGGCTGGACA ACGCGGGCTT CGTCAGCGCC
GGGGAGGGCG GCGGCCTGCT CCAGATCGGC GCGGCCGGGG CCGGCGGCCA CCTCGACGGA
TGGCCGATCC TGGTCTTCGT GTGCGGCCTG GTGCTGGCCA GCGTCCTGCT GGTGCGCGGG
GTGCCCGGCG CGATCTTCTA CGGCATCGTC GGCGCGACGG TCCTGGCGAT CGCCGTGCAC
TACGCGGCGG GCCTGGACTC CCGGGACTGG GGCGGGGCCA GCCCCGAGCT GCCCGGCAAC
CCGTTCGCGG CCCCCGATTT CGGCCTGCTG CTGCGGGTGG ACATGTTCGG CGCCTGGACC
TCGGCCGGTG CGACCACGGC GGGCGTCATC CTGTTCACCC TGGTGCTGGC CGGGTTCTTC
GACGCCCTGG GCACCATCCT GGCCATCGGC ACCAAGGCCG ACATCGCCGA CGCCGACGGG
CACATGCCCC GGGTCAACCA GATCCTGGTG ACCGACGGCG CGGGCGCCGT GGCCGGGGGT
CTGACCAGCT CCTCGGCGAC GCTGGTGTTC GTGGAGTCCA CGGCGGGGGT GAGCGAGGGC
GCCCGGACCG GTCTGGCGAG CGTGGTGACG GGGCTGTTCT TCCTCGCGGC GATCTTCCTG
GCCCCGGTGT TCGGCGTGGT CCCGGCGCAG GCCGCGGCGG TGGCGATGGT GCTGGTGGGC
GCCATGATGA TGATGCACAT CAGGGAGATC GACTGGTCGG ACGTCGCCGT GGCGATCCCG
GCCTTCCTGA CCATCGCGAT GATGCCGTTC ACCTTCGACA TCGCCAGCGG GATCGGCATC
GGGATCATCT CCTACACGCT GGTCAGGTCG GCCCAGGGGC GCGTGCGCGA CGTGGGCTGG
CTGATGTGGG CGCTCTCGGC CGTGTTCGCG TTCCACTTCT CCATGCACGC GCTGGGGCTT
TGA
 
Protein sequence
MTRLKDSGPH TGARKPTARS WLDRFFFVSE RGSTFGREVR GGLTTFMAMA YIIVLNPIIL 
SGVSDVNGDV LSAGQLTTMT ALSAGLVTIM MGVVGRAPIA CAAALGVMAV VAYQAAPVMA
WPEVMGLVVW QGVAIILMVV TGVRTAVMNA LPHDLKMAIG VGIGLFVALI GLDNAGFVSA
GEGGGLLQIG AAGAGGHLDG WPILVFVCGL VLASVLLVRG VPGAIFYGIV GATVLAIAVH
YAAGLDSRDW GGASPELPGN PFAAPDFGLL LRVDMFGAWT SAGATTAGVI LFTLVLAGFF
DALGTILAIG TKADIADADG HMPRVNQILV TDGAGAVAGG LTSSSATLVF VESTAGVSEG
ARTGLASVVT GLFFLAAIFL APVFGVVPAQ AAAVAMVLVG AMMMMHIREI DWSDVAVAIP
AFLTIAMMPF TFDIASGIGI GIISYTLVRS AQGRVRDVGW LMWALSAVFA FHFSMHALGL