Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0796 |
Symbol | |
ID | 9244641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 977928 |
End bp | 979178 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | UDP-glucuronosyl/UDP-glucosyltransferase |
Protein accession | YP_003678746 |
Protein GI | 297559772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.702052 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGTTA TATTGTTCGC CCCCGAGACA TTCAATCTCG CGGAGACAAC CCGGTCCATC GAGGTGGCGA AACACCTCCG TGAAACATAT GAATGCGTCT TTTCCGGGTA TTCCGAAAGA TACTCCGGCC TCATTGAAGA GGCCGGGTTC ACGTTTCACC GCCTGGCCCC CGCGCTGACC GACGAGGACG CGGACCAGCT GATCCGCGTC GACCAGGGGA AGGCGGTCCG GCACCCCTTC ACCGCCGCGA TGCTCCGCAC CCGCGTCGCC AGTGAACTCG CCCTCATCGG CACGCTCCGC CCGGCGGCGG TCGTGATCGG CACGACGCTC AGCCAGTTCG TCTCCGCCCG TGCCGCCGGC GTGCCGCTCG TCTACGTCAA GCCCTTCGCC TACAGCTGGC CCCACATCCT CCAGACGCGG TCGCTGCCGC TCGCCGAAGG GGACGGCCCG CTCCCCCGGG CGGTCAACAC CGGCGCCGCC GCGCTCCTGC GGGAGGCCGC CCGGGTGACC ACCTACAAGC CCGCGGCCTT CCGGGCAGTC GCGCGCGAGC ACGGGGTCAG GCTGCCCGGT CGCACCATCC AGGCGCTCGA CGCCGACCTC AACCTGATCA CCTCCCTCTC CTGCTACCTG CGCCCCTACC GGATGCCCGC GAACTACCGC CTGGTCGGCC CGGTCTTCGC CCGGATCGAC CGGGAGATCC CTCCGGACGT GGTCCGCGTC GCCGAGGCCT CGGCGCGGGC GAAGCGCCCG GTGGTCTACT TCGCCATGGG CAGTTCCGGG AACCGGGAAG TGGTGCTCCG GGTCCTCACC GAGCTCTCCC GGATGCCCGT CACCGTCATC GCCCCCGTCG CCTCCTACCT GGAGGAGAGC GACCTCCCCC AGGTCGCGGA CAACATCCAC GTGCGCGACC TCCTCCCCGC ACACCTGCTC GGCGACCTCA TCGACGCCTC CGTGATCCAC GGGGGCGAGG GGACCGTGCA GACCGCGGTC ACGACCGGAA AACCCTTCGT CGGAATCGGC CTGCAGATGG AGCAGCGGTG GAACGTGGCG GACTGCGTCC GCTTCGGAAA CGCCGTCGCC GTCTCCCCCA AGGACGTTTC CGGAGCCTCT TTCCGCAATG CCGTGGAGAA GGTCCTCACG GACCCCCGCA CCCGTTCCCG CGCACGCACC CTCCGCGAAC TCCTCTCCGG AGTCGACGGG GCGGCGTCAG CAGCGGAACA CATTCACGAA CACGTGTCTC AGAAGCCGTG A
|
Protein sequence | MRVILFAPET FNLAETTRSI EVAKHLRETY ECVFSGYSER YSGLIEEAGF TFHRLAPALT DEDADQLIRV DQGKAVRHPF TAAMLRTRVA SELALIGTLR PAAVVIGTTL SQFVSARAAG VPLVYVKPFA YSWPHILQTR SLPLAEGDGP LPRAVNTGAA ALLREAARVT TYKPAAFRAV AREHGVRLPG RTIQALDADL NLITSLSCYL RPYRMPANYR LVGPVFARID REIPPDVVRV AEASARAKRP VVYFAMGSSG NREVVLRVLT ELSRMPVTVI APVASYLEES DLPQVADNIH VRDLLPAHLL GDLIDASVIH GGEGTVQTAV TTGKPFVGIG LQMEQRWNVA DCVRFGNAVA VSPKDVSGAS FRNAVEKVLT DPRTRSRART LRELLSGVDG AASAAEHIHE HVSQKP
|
| |