Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3851 |
Symbol | |
ID | 9247722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4622261 |
End bp | 4623988 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003681754 |
Protein GI | 297562780 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0448869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.314973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCTGGC GACACCTGCG GGCCGAACCG GCCCGGCTGC CGCTGCTGGC GCTGCGGCTG ACGCCCGGTC CGCCCCGGCG CGCGGTGCGC GCCCTGGCCT CCCGGGCGGG CGGCCGGGCC CGCGCCTACG CCCTGTGGGA CCAGGGCCGG CGCGCGGACG CCCGCGAGGC GGTGCTCGCC GCGGCGAGCG GTGCCTCCCC CCGCCGGGTG GGCCGCCTGG TGGCGTCCTG CCTGGCGGCG GGAGACGCGG CCACCGCCCG GGAGCTGGTC GAACAGCTTC CGGAGGGCGC TCTTCGGGAG GCCGCGGAGC ATCGGACGGC CCTGGCCACG GGACGCGCTG TCGCCGCGCC CCCGTCCACG TCGCCTGACC GTCACGCCAT CACGTTAACC CGCGATCCAC GAACGCCGCC CGACGCCACG GTGAACGCTC GGCGACCGGA GGCGTCGGAG GGTGAACGGT GGCGTGTGGC CGGGGACCCG GCGTCACCCG GAGCGGGCCT GCGGGTGCTG CACCTGGTGA CCAACGCGCT GCCGCACACC AACGCGGGCT ACACCCAGCG CACCCACAAG ATCGCGGTCG CCCAGCGCGA GGCCGGAATG GACGTGCACG TGGTCACCCG GGCGGGGTAC CCCCTGGTCA AGGGGGTTCC CGACCCGCGC ACGCTGGTGC GGGTGGACGG CATCCCCTAC CACCGCCTCC TCCCCTGGAC GGCGCCCGCC GACGCCGCCC AGGAGCTGGC CGCGGGGGTG CGGCTGGGGT CGGAGCTGGT GGAGGCGCTG CGGCCCGACG TGCTGCACGC CGCGAGCAAC CACCACAACG CCCGCCTGGC CCTGGAGCTG GGCCGCAGGT TCGGCCTGCC GGTGGTGTAC GAGGTGCGGG GCTTCCTGGA GGAGTCGTGG CTCTCGCGCG ACCCCTCGCG CAGCGTGGAC GACGCCTTCT ACCGGGCCGA GCGCGCGAGC GAGACCGAGT GCATGCTGGC CGCCGACCTT GTGGTGACCC TGGGCGAGGC GATGCGCGCC GACATCGAGG CGCGCGGCGT GCCGCGCGAG CGCCTGCTGG TGGTGCCCAA CGCGGTGGAC GCCTCCTTCC TGGCCCCGCT GCCGCCGGGC GCGGGGGTGC GCGCCGAGCT GGGGATCGGC GGCGAGGACT TCGTGGTGGG CACCACCACC AGCTGCTTCG GCTACGAGGG CCTGGACACG CTGCTGGAGG CGGTGGCCCT GATGCGCGAA CGCGGCGAGG CGGCGCACGC CCTGGTGGTG GGGGACGGCC CCGAGCTGCC CGCGCTGCGC TCCCTGGCCG ACTCGCTGGG TCTGGAGGGG GCCGCGCACT TCACCGGCCG CGTCCCGGCC GCGCGGGTGC GCGACCACCA CGCCGCGCTG GACGTGTTCG CGGTGCCCAG GCGCGACGAG CGGGTGTGCC GTCTGGTCAC CCCGCTCAAA CCCGTGGAGG CCATGGCGGG CGGGCTTCCG GTGGTGGCCA GTGATCTCCC CGCGTTGCGA GAGATCGTGG AACCGGGAGT GACAGGAGAG TTAATTCCGG CAGGCGAATC GGCGACCCTA GCCGATGTGC TGACAAAACT CGCTTACAGT CGTGAAAAGC GGATCTCCTA CGGCAGTGCG GGTCGCGATC TCGTCGGCGA CCGCACCTGG GCCGAGGCCG CATACCGCTA CAACCAGGCG TATCGGGTTC AGATTCGCGA AATGACCGAA CCAGGCCAGA ACCCGTAA
|
Protein sequence | MAWRHLRAEP ARLPLLALRL TPGPPRRAVR ALASRAGGRA RAYALWDQGR RADAREAVLA AASGASPRRV GRLVASCLAA GDAATARELV EQLPEGALRE AAEHRTALAT GRAVAAPPST SPDRHAITLT RDPRTPPDAT VNARRPEASE GERWRVAGDP ASPGAGLRVL HLVTNALPHT NAGYTQRTHK IAVAQREAGM DVHVVTRAGY PLVKGVPDPR TLVRVDGIPY HRLLPWTAPA DAAQELAAGV RLGSELVEAL RPDVLHAASN HHNARLALEL GRRFGLPVVY EVRGFLEESW LSRDPSRSVD DAFYRAERAS ETECMLAADL VVTLGEAMRA DIEARGVPRE RLLVVPNAVD ASFLAPLPPG AGVRAELGIG GEDFVVGTTT SCFGYEGLDT LLEAVALMRE RGEAAHALVV GDGPELPALR SLADSLGLEG AAHFTGRVPA ARVRDHHAAL DVFAVPRRDE RVCRLVTPLK PVEAMAGGLP VVASDLPALR EIVEPGVTGE LIPAGESATL ADVLTKLAYS REKRISYGSA GRDLVGDRTW AEAAYRYNQA YRVQIREMTE PGQNP
|
| |