Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3848 |
Symbol | |
ID | 9247719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4616624 |
End bp | 4618879 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003681751 |
Protein GI | 297562777 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTA CGGGCAACGA GAAAGCATCC GAAGGGAAGA ACGTGAGCAC ACCGCCTCCG CGCGCACTCC TGTACGGGGA CGTGGACCTC AACATCATCG ACGGTTCCGC GATCTGGGCG CAGTCGATGG CGCAGGCCCT GGCCGCCGCC GGATGCGAGG TGACCCTGCT GCTGAAGGCG CCCGTGCGCA CCGACCGGCT CACCGAGCCC CTCACCAGGG TGCCCGGGGT CCGGCTGCTG CGGCCCTACG AGGACAAGGC GCTGCCCGAC CTGGGGCCGA GGGGGCTCAC CCCGGAGCAG GCCGTCACCC TGATGACCAG GCACCACGAG CGCAAGCCCT TCGACCTCGT CGTCGTGCGC GGGCGCCGCC TGGCCGGGCT GGCCGCGCAG GAGGAGGCGC TGGCGGGCCG CCTGTGGACC TACCTGACCG ACTTCCCGCA CAGCGTGGGC GAGCTGTCCG CGACCGCCAC CGCCGAACTC ACCGAGATCG CGCTGGCCTC GCGGTTCCTG CTCTGCCAGA CCGAGGAGCT GCGCGCCTTC CTGGAGTCGA CGGTGCCCGC CGCCTGCGGC CGGTCGGTCC TGTTCCCGCC CGTGGTGGTC GTCCCCGAGG ACGTGCGCGC CGACGGCGGG GCCGGCGGCC GCGCCCGCCT GGCCTACACC GGCAAGTTCG CGCCCCGCTG GAACACCCTG GAGATGACCG AGCTGCCCGC CGAGCTGGCG CGGCGCGGGG TGGACGCCGA ACTCGTGATG ATCGGCGACA AGATCCACGC CGAACCCGCC GACTGGGCCA AGAACATGCG CAGGGCCCTG GAGGGCACCC CGAACGTGGA CTGGCGCGGC GGGATGTCGC GCGCCGAGGC GCTCCGCCAG GCCGCCGAGT GCGGCTTCGG GCTGTCCTGG CGCGACCCGT CCATGGACGC CAGCCTGGAG CTGTCCACCA AGGTGCTGGA GCTGGGCGCG CTCGGACTGC CGGTGGTGCT CAACCGCACG CCCATGCACG AGGCCATGCT GGGCGCGGAC TACCCGCTCT TCGCCGGGAC CGACGTCGCC TCGGTCGCCG ACGTGGTGGC CCGCGCCCAC GGCGACCGCG CGGTCTACGC GGACGCCGCC GCGCGCTGCC GCGACGCCGC GGCCGACCAC ACGCTGGAGC GGGCCGCCGA GCGGCTGCGC GGCTACCTCG CCGACGCCCT GCCGCCCACC CCCGAGGGCG CCGACCCCGA GCGGCCGCTC AAGGTGGTCA TCGCCGGGCA CGACATGAAG TTCTTCACCC GCCTGGCCGA GTACCTGGAC TCGCTGCCCG GTCTCGACGT GCGCATGGAC GAGTGGGAGG GGCTGAGCAC CCACGACCAG TACCGCTCCC GGGAGCTGGC CGCCTGGGCC GACGTGGTGA TCTGCGAGTG GTGCGGGCCC AACGCGCTGT TCTACTCCAA GTGGAAGCGC CCCGACCAGC GGCTCATCGT GCGGCTGCAC CGCTTCGAGC TCTACGCGGA GTGGCCCCGC AAGCTCGACA TCGACAAGGT CGACGCGGTG GTGTGCGTGA GCCCCCACTA CGCCGACCTG ACCCGCGAGA TCACCGGGTG GCCCGCCGGG AAGGTGGTCG TGGTCCCCAA CTGGGTGGAC GACGAGCAGC TCGGCCGCCC CAAGCTCCCC GGGGCCGAGT ACTCCCTGGG CATGGTCGGC ATCGCGCCCT CGCGCAAGCG GCTGGACCGG GGCCTGGACG TCATCGCCGA GCTGCGGCGC ATGGACCCGC GGTACACGCT GTCGGTCAAG ACCAAGCAGC CGTGGGAGTA CTGGTGGATC TGGAACCGGC CGGAGGAGCG CGCCTACTTC GAGCGCGTCT ACCGGCGGAT CCAGCGCGAC GAGCGCCTCG CCTCCGGGGT GGTGTTCGAC CCCTTCGGGC CGGACGTGGC CACCTGGCTG CGGCGGGTGG GGTTCATGCT CTCCACCAGC GACGACGAGT CCTTCCACCT GGCGCCCGCC GAGTGCGCCG CCTCGGGCGG CGTGCCCGCC CTGCTGCCGT GGCCGGGCGC GGACACCATC TACGACCCGC ACTGGATCCA CGACGACGCC GTGGCGATGG CCGAGGCCAT CCACGCGACC GTCAGCGAGG GGCGGTTCTC CTCGGAGGCC GCGCGCGCAC GCGAGGAGGT CACCACCGCC TACGGCCTGT CCCGGGTGCG GTCGCTGTGG AGCGACCTGG TGGTCCGCGG CAGGGCGCCC CAGGCGGAGC ACTCCGCCGC CACAGCAGGC GCCTGA
|
Protein sequence | MSRTGNEKAS EGKNVSTPPP RALLYGDVDL NIIDGSAIWA QSMAQALAAA GCEVTLLLKA PVRTDRLTEP LTRVPGVRLL RPYEDKALPD LGPRGLTPEQ AVTLMTRHHE RKPFDLVVVR GRRLAGLAAQ EEALAGRLWT YLTDFPHSVG ELSATATAEL TEIALASRFL LCQTEELRAF LESTVPAACG RSVLFPPVVV VPEDVRADGG AGGRARLAYT GKFAPRWNTL EMTELPAELA RRGVDAELVM IGDKIHAEPA DWAKNMRRAL EGTPNVDWRG GMSRAEALRQ AAECGFGLSW RDPSMDASLE LSTKVLELGA LGLPVVLNRT PMHEAMLGAD YPLFAGTDVA SVADVVARAH GDRAVYADAA ARCRDAAADH TLERAAERLR GYLADALPPT PEGADPERPL KVVIAGHDMK FFTRLAEYLD SLPGLDVRMD EWEGLSTHDQ YRSRELAAWA DVVICEWCGP NALFYSKWKR PDQRLIVRLH RFELYAEWPR KLDIDKVDAV VCVSPHYADL TREITGWPAG KVVVVPNWVD DEQLGRPKLP GAEYSLGMVG IAPSRKRLDR GLDVIAELRR MDPRYTLSVK TKQPWEYWWI WNRPEERAYF ERVYRRIQRD ERLASGVVFD PFGPDVATWL RRVGFMLSTS DDESFHLAPA ECAASGGVPA LLPWPGADTI YDPHWIHDDA VAMAEAIHAT VSEGRFSSEA ARAREEVTTA YGLSRVRSLW SDLVVRGRAP QAEHSAATAG A
|
| |