Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4596 |
Symbol | |
ID | 9248477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5450705 |
End bp | 5451877 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | glutamate--cysteine ligase GCS2 |
Protein accession | YP_003682489 |
Protein GI | 297563515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0023216 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAACC GCTCCACGGC AGCACACGAC ACGCGGCGGC CGGTGTCCGC CGGCACCCGC ACCGGCAACC GCACCGTGCC CCCGACCTTC GGCGTCGAGG AGGAGTTCTT CGTCGTCGAC CCCGGCACCC GCCTCATCCT GTCGCGGGCG CCCGAGGTGC TCGCCAACAC CCGCGTCCTG GGCGACCACC TGTGCGGTGA GTTCTCCCGC GCGCAGGTGG AGGCCAACAG CCCGGTGTGC GACGACGCGG GCGAGGCCCT GCGCTTCCTG CGCGCGGCGC GCGGGGAGCT GGACCGCGCC GCGGGCGCGG CCGGGCTCGC CGTCGTCGCC TCCGGCACCG CCGTCCTGGG CGATCCGGCC TCGGTGGGGA CCAGTGAGGG GCGCCGCTAC GCCGACATCA CCGCGCACTT CGGCGCGCTG CGCGAGTCGC ACGTGGTGTG CGGCTGCCAC GTGCACGTGG GCGTCCCCGA CCGGGAGACC GCCGTGGCCG TGGGCAACCA CCTGCGCCGG TGGCTGCCGT TCCTGGTGGC GCTGTCGGCG AACTCGCCGT TCCACGCCGG ACGCGACACC GGCTACTCCA GCTGGCGCAC GGTCGCCTGG AACCGGCTGC CCTCGGCCGG CCCTCCCCCG TTCCTGCGCT CCCTGGCCGA GCACGAGCAG GCGGTGCGGG CGCTCTCCGA CTCCGGGGCG ATCCTGGACC GGCGGATGGT CTACTGGGAC GTCCGCCTCT CCGACCACCT GCCCACGCTG GAGATCCGGG TGAGCGACGT GGCCGCCACC GCCGAGGAGG CGCTGCTGCT GGCCCTGCTG GTCCGCGGCC TGACCGGTCG CGCGCTCGCC GACGTGCTCT ACGGGGTCCC CGCGCCCGCC ATCCCCGACC AGGCGCTGCG GGCCGCCGTG TGGCGGGCCG CCCGGGACGG GTTGGAGGGC GTGGTGCCCG ACCCGCTGAC CGGTGAGGCG CTGCCCGGGC ACGCGGCCGC CGAACGGCTG CTGCACGCCG CGATGCCCGG CCTGCTGGCC AACGGCGACG CCGACCTGGC CTCCTCGCTG CTCGACCGGG TGCGGGCGGC CGGGAGCGGG GCCGCCCGCC AGCGCGCGGT GTACGCCCGG CGGGGCAGAC TCGCCGACGT GGTGGACCAC CTCGTGGTCC AGACGAGGGA GGGCCTCGTC TGA
|
Protein sequence | MENRSTAAHD TRRPVSAGTR TGNRTVPPTF GVEEEFFVVD PGTRLILSRA PEVLANTRVL GDHLCGEFSR AQVEANSPVC DDAGEALRFL RAARGELDRA AGAAGLAVVA SGTAVLGDPA SVGTSEGRRY ADITAHFGAL RESHVVCGCH VHVGVPDRET AVAVGNHLRR WLPFLVALSA NSPFHAGRDT GYSSWRTVAW NRLPSAGPPP FLRSLAEHEQ AVRALSDSGA ILDRRMVYWD VRLSDHLPTL EIRVSDVAAT AEEALLLALL VRGLTGRALA DVLYGVPAPA IPDQALRAAV WRAARDGLEG VVPDPLTGEA LPGHAAAERL LHAAMPGLLA NGDADLASSL LDRVRAAGSG AARQRAVYAR RGRLADVVDH LVVQTREGLV
|
| |