Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3362 |
Symbol | |
ID | 8448977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3700090 |
End bp | 3701367 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 645042439 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003202679 |
Protein GI | 258653523 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000110111 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0175779 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCATC CGCCCCCGCC GGTCGGCGTC GTCGTGCCGT ACTACGACGA CCAACCCCGG CTGACGCTGC TGCTGCGGGC CCTGGCCCAG CAACGCACGA CCGTGCCGTT CGAGATCGTC GTGGCCGACG ACGGATCGCC GGCGCGGCCG GTCATCCCGC CGGGACTGGG CCGGCCCGTC ACCGTGGTCA GCCAGCCGGA CCAAGGATTT CGGGCCGCCG CCGCCCGCAA CCTGGGCGCG ACGCACACGG CCGCCGACCT GCTGCTGTTC CTGGACGGGG ACACCCTGCC CACCCCCGGG TACGTGCAGG CCATGGTCGA CCGGCTGTGC GCGCTGGACG ACGGGTCCGG GGCGCTGGTC GTCGGCCGGC GCCGGCACGC CGATCTCGGG TCGGTCCCGG CCGACGAGGT GCTCCGCTTC CTGTGCGGTG ACACCGTTTC GGGCATCACC GTGCTGGACG AGCCGTCCTG GCTGCGGGAC GGGTACGCGC GCACCGACGA CCTGGGCCGG GCCGGGGAGG AGGACTTCCG GCTGATCATC TCCGCGGTGC TGGGGGTCGA CCGGCGGTTG TGGTCGGCGA TCGGTGGGTT CGACGCCTCC TTCGTCGGGT ATGGCGGCGA GGACTGGGAT CTGGGCTGGC GGGCCTGGCT GGCCGGCGCC CGGTGGGCCC ATGAGCCGGC CGCCCTGGCC TGGCACGACG GCCCGGATGC GGCCGGCCGG GCCGCGTCGT CGTCCGGGTC CGCCGCGAAC ACCGGCAAGA ACACCGAATC CCTGCGCCTG GCCCGGACGA TTCCGCTGCC GTCGGTGCGG GGTCGGGCGC TGGTGCTCGC CCAGCCCGAC ATCGTCGTGC GGTACCTGGG TCCGACGACC GGCACCGCCG CGGACGCCGC GGTGGTGGCC GGCGTCGTCG ACCTGCTGGC CGGGGTCGAT GCCGGCGTCT GGTTCCCGCG GTGCTCGGCC GACGGTCCGG ACCGACTGCC GCCGCTGCTG GCCCAGGACC CGCGGGTGCG GGCCGGGCAG GTGCCGGCGG ATGTGCTGCG GCGGGCCCGG TTTCAGGTGT CGGTGACCCG GCCGCTGACG TTGTCGGCGC CGCTGGACCG GTGCTGCGCG GCCGGAGAAT GGGAGGTGCC CGGCCGGGTG AGCATCCGCC GGACGCGGTC GCTGAACCGG GGCGAGCCAC CCCCGCCCCG CCGGCCGATC GCGCGGCCGG GCCCCGACCC GCTGGTCCGT CCGATCCCGG ATGACGTGTC GCTGGAGCGT TGGTGGGGCG GTTGGTGA
|
Protein sequence | MTHPPPPVGV VVPYYDDQPR LTLLLRALAQ QRTTVPFEIV VADDGSPARP VIPPGLGRPV TVVSQPDQGF RAAAARNLGA THTAADLLLF LDGDTLPTPG YVQAMVDRLC ALDDGSGALV VGRRRHADLG SVPADEVLRF LCGDTVSGIT VLDEPSWLRD GYARTDDLGR AGEEDFRLII SAVLGVDRRL WSAIGGFDAS FVGYGGEDWD LGWRAWLAGA RWAHEPAALA WHDGPDAAGR AASSSGSAAN TGKNTESLRL ARTIPLPSVR GRALVLAQPD IVVRYLGPTT GTAADAAVVA GVVDLLAGVD AGVWFPRCSA DGPDRLPPLL AQDPRVRAGQ VPADVLRRAR FQVSVTRPLT LSAPLDRCCA AGEWEVPGRV SIRRTRSLNR GEPPPPRRPI ARPGPDPLVR PIPDDVSLER WWGGW
|
| |