Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3078 |
Symbol | |
ID | 8448692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3394258 |
End bp | 3395865 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645042160 |
Product | anthranilate synthase component I |
Protein accession | YP_003202401 |
Protein GI | 258653245 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00101032 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000123729 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGGCA TGTCCGTCAG CACCCCCGCC CCGCCGCGCG CGGCCCAGCC CGGGCTCGGG GAGATCAGCC CCAGCCGGGA GGAGTTCCGC GAGCTCGCCC GGGATCGGCG GGTGATCCCG GTGACCCGGC GCCTGCTCGC CGACACCATC ACCCCGGTCA GCCTGTACGC GACCCTGGCC GGCGACCGGC CCGGCACCTT CCTGCTGGAA TCGGCCGAGA ACGGGCGGTC CTGGTCGCGC TGGTCGTTCG TCGGGGTGTC CGCCCCGGCC GTGCTGACCG AACGCGACGG GCAGGCGACC TGGCTGGGCA CCCCGCCGGC CGGCCTGCCG ACCAGCGGCG ACCCGCTGCG GGTCCTGGAC GAGTCGCTGC GCTTCCTGCA CACCGAGCCG CTGGCCGGGC TGCCCCCGCT GACCGGCGGC CTGGTCGGCT ACCTCGGCTA CGACGTGGTC CGCCGCTGGG AGAAGATCGA CACCGCGGCC AGCGCGACCC GCCCGCCGGC GCCGGCCGAC CCGGAGATTC CCGAGCTGGT CATGCTGCTG GCCACCGACC TGGCCGCCCT GGACCACCAC GCCGGCACCG TCACCCTGAT CGCCAACGCG GTGAACTGGG ACGGCACCGA CGCCCGGGTC GACCAGACCT ATGACCACGC GGTGGCCCGG CTGCACGAGA TGAGCCGCAC GCTGGCCCAG CCGCGGTCGC TGCCGGCCGC CCACTTCACC GCCCGCACCC CGCCGGTGCG CCGGCGCACC GAGTCCGCCG AGTACCAAGC CAACGTGGAC GTGGCCAAGG AGCACATCCG GGCCGGGGAC GCCTTCCAGA TCGTGCTGTC GCAACGGTTC GACGTGCCCA CCGAGGCCGA CCCGCTGGAC ATCTACCGGG TGCTGCGGGC CACCAACCCG AGCCCGTACA TGTACCTGCT GCGCCTGCCC ACCCCCGACG GCGGCTCCTT CTCGGTGGTC GGCTCCTCGC CCGAGGCGCT GGTCACCGTG CGCGAGGGCC TGGTGACGAT GCACCCGATC GCCGGCACCC GGCCCCGCGG GCACACCGAG GAGGACGACG TCTGGCTGGC CAAGGACCTG TTGGCCGACG AGAAGGAACG CAGCGAGCAC GTGATGCTGG TCGACCTGGG CCGCAACGAC CTGGGCCGGG TCTGCGCCCC GGGCACGGTC AAGGTGGTCG ACTTCTTCAC CATCGAGCGG TACAGCCACG TCATGCACAT CGTCTCGACG GTCACCGGGC AGCTGGCCGC CGACCGCACC GCCTACGACG CACTGGCCGC CTGCTTCCCC GCGGGCACCC TGTCCGGGGC GCCCAAGCCG CGGGCCATGC AGATCATCAA CGAGCTCGAA CCGCTGCGCC GCGGCGTGTA CGGGGGAGTC GTGGGCTACC TGGACTTCGC CGGGGACGCC GACACCGCGA TCACCATCCG TACCGCGCTG GTGGTCGACG GCACCGCCTA CGTGCAGGCC GGCGCCGGGG TGGTGGCCGA CTCGGTACCC GAGAACGAGG ACGCGGAGTG CCGGAACAAG GCCGCCGCCG TCATCGCCGC CGTCGGCGCC GCCGCGACCA TGCAGGTGGT CGGGGCCACG CAGGTGATCG GTGACTGA
|
Protein sequence | MTGMSVSTPA PPRAAQPGLG EISPSREEFR ELARDRRVIP VTRRLLADTI TPVSLYATLA GDRPGTFLLE SAENGRSWSR WSFVGVSAPA VLTERDGQAT WLGTPPAGLP TSGDPLRVLD ESLRFLHTEP LAGLPPLTGG LVGYLGYDVV RRWEKIDTAA SATRPPAPAD PEIPELVMLL ATDLAALDHH AGTVTLIANA VNWDGTDARV DQTYDHAVAR LHEMSRTLAQ PRSLPAAHFT ARTPPVRRRT ESAEYQANVD VAKEHIRAGD AFQIVLSQRF DVPTEADPLD IYRVLRATNP SPYMYLLRLP TPDGGSFSVV GSSPEALVTV REGLVTMHPI AGTRPRGHTE EDDVWLAKDL LADEKERSEH VMLVDLGRND LGRVCAPGTV KVVDFFTIER YSHVMHIVST VTGQLAADRT AYDALAACFP AGTLSGAPKP RAMQIINELE PLRRGVYGGV VGYLDFAGDA DTAITIRTAL VVDGTAYVQA GAGVVADSVP ENEDAECRNK AAAVIAAVGA AATMQVVGAT QVIGD
|
| |