Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4636 |
Symbol | |
ID | 9248517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5506846 |
End bp | 5508096 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | phosphoribosylamine/glycine ligase |
Protein accession | YP_003682528 |
Protein GI | 297563554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.728715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.385617 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGCCC TCGTTCTCGG CGGCGGAGGC CGCGAGCACG CTCTGGTCCG CGCCCTGTCC CTGGACCCGG GTGTCACCAG CATCCACAGC GCCCCGGGCA ACCCCGGCAT CTCGGAGCTG GCCGAGAACC ACGTGCTCAA CGTGACCGAC GGCCTGGCCG TCACCGAGCT GGCCGCGCGC ATCCGCGCCG AGCTGGTCGT CATCGGACCG GAGGCCCCGC TGGTCTCCGG TGTGGCGGAC GCCCTGCGCG ACCGGGGCAT CCCGGTGTTC GGCCCCGACC AGGAGGCCGC ACGCCTGGAG GGCTCCAAGG CCTTCGCCAA GGAGGTCATG GAGGCCGCCG GGGTGCCCAC CGCCAAGGCG CGGGTGTGCA GGACCGCCAG CCAGGTGTCC GAGGCCCTCG ACGAGTTCGG CACGCCCTAC GTGGTCAAGA ACGACGGCCT GGCCGCGGGC AAGGGCGTCG TGGTGACCGA GGACCGCGCC CTCGCCGAGC AGCACGCCCG GGAGTGCGGC CGCGTGGTCA TCGAGGAGTT CCTCGACGGC CCCGAGGTGT CCCTCTTCGT GCTGAGCGAC GGGCTGCACG CCCTGCCGCT GCTGCCCGCC CAGGACTTCA AGCGCGCCTA CGACGGCGAC CAGGGCCCCA ACACGGGCGG CATGGGCGCG TACGCGCCGC TGCCGTGGGC CCCGGCGGGC CTGGTGGACG AGGTGATGGA GTCGGTCGTG CGGCCGACCC TGGTGGAGAT GAACCGGCGC GGTAAGCGCT ACCAGGGCCT GCTGTACGTG GGGCTGGCGC TCACGTCGCG GGGTCCGCGC GTGGTGGAGT TCAACGCCCG GTTCGGCGAC CCGGAGACCC AGGTGGTCCT GGACAGGCTG GCCACCCCGA TCGGCGCCGT CCTCCAGGCC ACCGACACCG GCGGCCTGGG GGGCATCGGC TCCCTCCAGT GGAAGTCGGG CGCCGCGGTC ACCGTGGTGG TCGCCGCCGA GAACTACCCG GGCGACCCGG TCAAGGGCGA CGTCATCGGC GGCCTGGACC AGGCCAACGC GATGGAGGGC GCGTACGTGC TGCACGCGGG CACCGACTGG GAGGGCTCGG GCGGCGTCAA GGCGAGCGGA GGCCGGGTGC TCAACGTGGT CGGCACCGGG ATCGACCTGC GCCAGGCGCG CGAGCGGGCC TACGAGGCCG TGGCGCGCAT CGAGCTGCGC GGCTCGTTCC ACCGCACCGA CATCGCCGAG CGCGCCGCGG CCGAACTGTA G
|
Protein sequence | MKALVLGGGG REHALVRALS LDPGVTSIHS APGNPGISEL AENHVLNVTD GLAVTELAAR IRAELVVIGP EAPLVSGVAD ALRDRGIPVF GPDQEAARLE GSKAFAKEVM EAAGVPTAKA RVCRTASQVS EALDEFGTPY VVKNDGLAAG KGVVVTEDRA LAEQHARECG RVVIEEFLDG PEVSLFVLSD GLHALPLLPA QDFKRAYDGD QGPNTGGMGA YAPLPWAPAG LVDEVMESVV RPTLVEMNRR GKRYQGLLYV GLALTSRGPR VVEFNARFGD PETQVVLDRL ATPIGAVLQA TDTGGLGGIG SLQWKSGAAV TVVVAAENYP GDPVKGDVIG GLDQANAMEG AYVLHAGTDW EGSGGVKASG GRVLNVVGTG IDLRQARERA YEAVARIELR GSFHRTDIAE RAAAEL
|
| |