Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2228 |
Symbol | |
ID | 9246078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2662372 |
End bp | 2663814 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Xanthine/uracil/vitamin C permease |
Protein accession | YP_003680156 |
Protein GI | 297561182 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0279599 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0422886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAC TCAAGGACTC CGGCCCGCAC ACGGGGGCGC GGAAGCCCAC GGCTCGATCC TGGCTCGACC GCTTCTTCTT CGTCAGCGAG CGCGGATCCA CCTTCGGGCG CGAGGTCCGC GGCGGACTGA CCACCTTCAT GGCGATGGCC TACATCATCG TCCTGAACCC GATCATCCTC AGCGGCGTCT CCGACGTGAA CGGCGACGTC CTGTCGGCGG GCCAGCTGAC CACCATGACC GCCCTGTCCG CCGGGCTGGT CACGATCATG ATGGGGGTGG TCGGCCGGGC GCCGATCGCC TGCGCCGCGG CCCTGGGGGT GATGGCGGTC GTGGCCTACC AGGCCGCGCC CGTGATGGCC TGGCCCGAGG TGATGGGCCT GGTCGTGTGG CAGGGCGTGG CCATCATCCT CATGGTGGTG ACCGGCGTGC GGACCGCGGT GATGAACGCG CTGCCGCACG ACCTCAAGAT GGCCATCGGC GTGGGCATCG GCCTGTTCGT GGCCCTCATC GGGCTGGACA ACGCGGGCTT CGTCAGCGCC GGGGAGGGCG GCGGCCTGCT CCAGATCGGC GCGGCCGGGG CCGGCGGCCA CCTCGACGGA TGGCCGATCC TGGTCTTCGT GTGCGGCCTG GTGCTGGCCA GCGTCCTGCT GGTGCGCGGG GTGCCCGGCG CGATCTTCTA CGGCATCGTC GGCGCGACGG TCCTGGCGAT CGCCGTGCAC TACGCGGCGG GCCTGGACTC CCGGGACTGG GGCGGGGCCA GCCCCGAGCT GCCCGGCAAC CCGTTCGCGG CCCCCGATTT CGGCCTGCTG CTGCGGGTGG ACATGTTCGG CGCCTGGACC TCGGCCGGTG CGACCACGGC GGGCGTCATC CTGTTCACCC TGGTGCTGGC CGGGTTCTTC GACGCCCTGG GCACCATCCT GGCCATCGGC ACCAAGGCCG ACATCGCCGA CGCCGACGGG CACATGCCCC GGGTCAACCA GATCCTGGTG ACCGACGGCG CGGGCGCCGT GGCCGGGGGT CTGACCAGCT CCTCGGCGAC GCTGGTGTTC GTGGAGTCCA CGGCGGGGGT GAGCGAGGGC GCCCGGACCG GTCTGGCGAG CGTGGTGACG GGGCTGTTCT TCCTCGCGGC GATCTTCCTG GCCCCGGTGT TCGGCGTGGT CCCGGCGCAG GCCGCGGCGG TGGCGATGGT GCTGGTGGGC GCCATGATGA TGATGCACAT CAGGGAGATC GACTGGTCGG ACGTCGCCGT GGCGATCCCG GCCTTCCTGA CCATCGCGAT GATGCCGTTC ACCTTCGACA TCGCCAGCGG GATCGGCATC GGGATCATCT CCTACACGCT GGTCAGGTCG GCCCAGGGGC GCGTGCGCGA CGTGGGCTGG CTGATGTGGG CGCTCTCGGC CGTGTTCGCG TTCCACTTCT CCATGCACGC GCTGGGGCTT TGA
|
Protein sequence | MTRLKDSGPH TGARKPTARS WLDRFFFVSE RGSTFGREVR GGLTTFMAMA YIIVLNPIIL SGVSDVNGDV LSAGQLTTMT ALSAGLVTIM MGVVGRAPIA CAAALGVMAV VAYQAAPVMA WPEVMGLVVW QGVAIILMVV TGVRTAVMNA LPHDLKMAIG VGIGLFVALI GLDNAGFVSA GEGGGLLQIG AAGAGGHLDG WPILVFVCGL VLASVLLVRG VPGAIFYGIV GATVLAIAVH YAAGLDSRDW GGASPELPGN PFAAPDFGLL LRVDMFGAWT SAGATTAGVI LFTLVLAGFF DALGTILAIG TKADIADADG HMPRVNQILV TDGAGAVAGG LTSSSATLVF VESTAGVSEG ARTGLASVVT GLFFLAAIFL APVFGVVPAQ AAAVAMVLVG AMMMMHIREI DWSDVAVAIP AFLTIAMMPF TFDIASGIGI GIISYTLVRS AQGRVRDVGW LMWALSAVFA FHFSMHALGL
|
| |