Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3943 |
Symbol | |
ID | 9247814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4714548 |
End bp | 4716317 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Carbamoyl-phosphate synthase L chain ATP-binding protein |
Protein accession | YP_003681846 |
Protein GI | 297562872 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.705932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTAAAG TACTGATCGC CAACCGCGGC GAGATCGCCG TGCGCATCGC CCGCGCCTGC CGCGACGCCG GACTCGCCAG CGTCGCCGTC TACGCCGAGC CCGACCTGGA CGCCCTGCAC GTCAAGGTCG CCGACGAGGC CCACTCCCTG GGCGGGAGCA CCCCGGCCGA CTCCTACCTG GACATCGCCA AACTCCTGGC GGTGGCCGAG GCCTCCGGTG CGGACGCCGT CCACCCCGGC TACGGGTTCC TGGCCGAGAA CGCCGACTTC GCCCAGGCCG TCATCGACGC CGGGCTGACC TGGATCGGCC CGCCCCCCTC GGCGATCACC GCCCTGGGCG ACAAGGTCCA GGCCCGCCAC ATCGCCCAGA AGGTCGGCGC CCCGCTGGTG GCGGGCACCG CCGACCCCGT GGAGTCGGCC GAGGAGGTCG TGGCCTTCGC CGAGGAGCAC GGCCTGCCCA TCGCGATCAA GGCCGCCTTC GGCGGCGGCG GGCGCGGCCT CAAGGTCGCC CACACCCTGG ACGAGGTCGC CGACGCCTAC GAGTCCGCGG TGCGCGAGGC GGTCACCGCC TTCGGCCGGG GCGAGTGCTT CGTGGAGCGC TACCTCGACC GGCCCCGGCA CGTGGAGACC CAGTGCCTGG CCGACACCCA CGGCAACGTC GTGGTGGTCT CCACCCGCGA CTGCTCGCTC CAGCGGCGCC ACCAGAAGCT GGTGGAGGAG GCGCCCGCGC CGTTCCTGTC CGCGGAGCAG ATGGAGCGGC TGCACAGCTC CTCCAAGGCC ATCCTCGCCG AGGCCGGGTA CACCGGCGCG GGCACCTGCG AGTTCCTGGT GGGCGTGGAC GGCACCATCT CCTTCCTGGA GGTCAACACC CGTCTCCAGG TCGAACACCC CGTGACCGAG GAGGTCACCG GGATCGACCT GGTGCGGGAG ATGTTCCGCA TCGCCGACGG CGAGGAGCTG GGCTACGGCG ACCCCGAGGT GCGGGGGCAC TCCTTCGAGT TCCGCATCAA CGCCGAGGAC GCCGGGCGGG GCTTCATGCC CGCGCCGGGC ACCATCACCG AGCTGAACCT GCCCGGCGGC CCGGGCGTGC GCGTGGACAC CGGCTGCGAG GCCGGTTTCA CGGTGCCCCA GGCCTTCGAC TCGATGGTCG CCAAGCTCGT GGTGACGGGC CGCACCCGCC AGGAGGCGCT CCAGCGCTCG CGGCGGGCCC TGGCCGAGTT CACGGTCGGC GGGATGCCCA CGGTGATCCC GTTCCACCAG GCCGTGGTGA GCGACCCGGC CTTCGCACCG GCCGACCCCG AGCAGCCGTT CGGGGTGTAC ACCCGGTGGA TCGAGACCGA GTTCGAGAAC ACCATCGAGC CGTGGTCGGG CACTCCGGGC GAGCCCGTCG AGGCCGAGCG CGAGAGGGTC ACCGTCGAGG TGGGCGGCAA GCGTGTGGAG GTGATCCTGC CCGCCGGGCT GGGCGCCTCG GCCGCCGCCC CCGCCGGTGG CGGTCAGAGC CGCAGGAAGC GGACCGCGCG CAAGGGCGGC GGGAACACCG TGGCGGCGGG CGGCGACTCG CTGGTCTCGC CGATGCAGGG CACCGTGGTC AAGCTGGTGG CCGAGGAGGG CCAGCAGGTC GCCGAGGGCG ACACCGTGGT CGTCATCGAG GCGATGAAGA TGGAGCAGCC GCTCAACGCG CACAAGGCCG GGACGGTGAC GGGCCTGAGG ATCGCCGCGG GCGAGACCGT GGGCAACGGC GCGGTGGTCT GCGAGATCAA GGACGCCTAG
|
Protein sequence | MRKVLIANRG EIAVRIARAC RDAGLASVAV YAEPDLDALH VKVADEAHSL GGSTPADSYL DIAKLLAVAE ASGADAVHPG YGFLAENADF AQAVIDAGLT WIGPPPSAIT ALGDKVQARH IAQKVGAPLV AGTADPVESA EEVVAFAEEH GLPIAIKAAF GGGGRGLKVA HTLDEVADAY ESAVREAVTA FGRGECFVER YLDRPRHVET QCLADTHGNV VVVSTRDCSL QRRHQKLVEE APAPFLSAEQ MERLHSSSKA ILAEAGYTGA GTCEFLVGVD GTISFLEVNT RLQVEHPVTE EVTGIDLVRE MFRIADGEEL GYGDPEVRGH SFEFRINAED AGRGFMPAPG TITELNLPGG PGVRVDTGCE AGFTVPQAFD SMVAKLVVTG RTRQEALQRS RRALAEFTVG GMPTVIPFHQ AVVSDPAFAP ADPEQPFGVY TRWIETEFEN TIEPWSGTPG EPVEAERERV TVEVGGKRVE VILPAGLGAS AAAPAGGGQS RRKRTARKGG GNTVAAGGDS LVSPMQGTVV KLVAEEGQQV AEGDTVVVIE AMKMEQPLNA HKAGTVTGLR IAAGETVGNG AVVCEIKDA
|
| |