Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4213 |
Symbol | |
ID | 9248087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5030581 |
End bp | 5031741 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | arginine biosynthesis bifunctional protein ArgJ |
Protein accession | YP_003682111 |
Protein GI | 297563137 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCC AGACCCCCGT GCCCGCAGGT TTCCGCGGCG TGGCCGGAAA CATCGGGATC AAGGACCCCA ACGACGACTT CTTCGCGGTG GTGTCGGAGG TCCCCGCCCG CGTGTCCGCC GTGTTCACCC GGTCGCGGTT CGCCGGACCC AGCGTGCTGC TCAGCCGCCG TTCGGCGGCC GACGGCAGCG CGCGGGGGGT CGCGGTGATC GCCCGCAACG CCAACGTGGC CACGGGCAGG GTCGGCGACA GCCACGCACA CGAACTCCAG GAGCGCGTGG CTCGGGCCGC GGGGGTGCCG CCCGAGGAGG TGCTGGTCGC CTCCACCGGC GTGATCGGCC GCCCGTACCC GATCGAGAAG GTGCGCGCCT ACCTGGACGC GCTGCCCGAG GAGTTCCCCC CGGCGGACCT GGACCGCTCG GCCGCGGCGA TGATGACCAC CGACACCCGG CCCAAGACCG CCTCGGCGAC GGTGGGCGGG GCGACGCTGA CGGGCATCGC CAAGGGAGTC GGCATGATCG AGCCGAACAT GGCGACCATG CTGGCCTGGT TCTTCACCGA CGCCGAACTG GAGCAGCCGG TGCTGGACGA GGTGTTCCGT CGGGTGGTGG ACCGCACGTT CAACGCGCTG AGCATCGACA CCGACACCTC CACCAGCGAC TCGGCGGCGG TGTTCGCCAA CGGGCTGGCC GGGCCGGTGG ACGTGGCGGA GTTCGAGGCG GCGCTGCATG AGGTGGCCCT GAAACTGGTG CGGATGATCG CCTCCGACGG CGAGGGCGCC AGCAAGCTGA TCGAGGTGCG CGTGACCGGC GCGCGCGACG ACGCCCAGGC CAAGCGGGTG GCCAAGGCCG TGGTGAACTC CCCGCTGGTG AAGACCGCCG TGCACGGCGC GGACCCCAAC TGGGGCCGGG TGACGATGGC GGTCGGCAAG TGCGAGGAGG AGACCGACAT CCTCCCCGAC AACGTGCGGA TCTCCTTCGG TGACGTGGAG ACCTACCCGG CGGAGGCCAC GGACGAGGTG CTGGAGCGCG CCGCCCAGCA CATGAAGGGC GACGAGGTGG TGATCGGCGT CGGCCTGGGC ATCGCCGACG GCGCCTTCAC GGTCTACGGG TGCGACCTGA CCGAGGGGTA CATCCGGATC AACGCCGACT ACACGACCTG A
|
Protein sequence | MSSQTPVPAG FRGVAGNIGI KDPNDDFFAV VSEVPARVSA VFTRSRFAGP SVLLSRRSAA DGSARGVAVI ARNANVATGR VGDSHAHELQ ERVARAAGVP PEEVLVASTG VIGRPYPIEK VRAYLDALPE EFPPADLDRS AAAMMTTDTR PKTASATVGG ATLTGIAKGV GMIEPNMATM LAWFFTDAEL EQPVLDEVFR RVVDRTFNAL SIDTDTSTSD SAAVFANGLA GPVDVAEFEA ALHEVALKLV RMIASDGEGA SKLIEVRVTG ARDDAQAKRV AKAVVNSPLV KTAVHGADPN WGRVTMAVGK CEEETDILPD NVRISFGDVE TYPAEATDEV LERAAQHMKG DEVVIGVGLG IADGAFTVYG CDLTEGYIRI NADYTT
|
| |