Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4527 |
Symbol | |
ID | 4597046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4788425 |
End bp | 4789738 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639779138 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_925711 |
Protein GI | 119718746 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGACC CCGCCCTCGA CCCCGCCCTC GATCCCGCAC GCGACCCGGT GGCGTTCTTC CGCGGCGTGG CGGCGGCGTA CCCGCGCTGC TTCTGGCTCG ACGGTGGCGG TGCCCGGGAG TGGTCGGGTC GTCGGTCGAT GGTCGGCTGG CTGGGCGAGG ACGACGTGTC GCTGACCTAC TCCGCGGCCC GCCGGGAGGT GCGCCGGCAC GTCGGCGGCA GCTGCGAGGT GGTCGGCGAC GACGTGTTCG TCGTACTCGA GGCCGAGCTG GCCGCCGGCG CGCCCGATGA CCACTGGGTG GGCTACCTCG GCTACGCCTG CCGCCCGGAC CTGCCCGCCT CGACCGGTCC CGGCCTGCCC GACGCCGTGT GGATGCGGCC GGCGGGCGTC CGGTTCTTCG ACCACGGACT GGGCGGGCAA TCCCGGAACT TCCTGGGGGA AGTTCCGGCC CAACAGTTCC GGTTTCCCCG GTTCACCGGG GAAACCGACC CGGCCCCGCC CGCCTACGCC ACCGCGTTCG AGGAGGTGCA GGAGCAGCTG CGGGCGGGGA ACAGCTACGA GGTCAACCTG ACCTACCGGC TGGCGCATCG CAGCGGGGTG GACCCGGTGA CGGCGTACCT GAGGCTGCGC GAGCTCAACC CGGCGCCGTA CGCCGGGTTC CTCCAGCACG ATGTCCGCGA CGTGCCGGAC GCCCGGGCCT GGCTGCTCAG CTCCAGCCCG GAGCGCTACG CGCTGGTGAC CGCCGACCGG AGCATCGAGA CCAAGCCGAT CAAGGGCACC ACGCTGCGCG GCGCGACCCC CGCCGAGGAC GAGGCCAGCC GGCACCGGCT CGCGACCGAC TCGAAGTTCC GCGCCGAGAA CCTGATGATC GTCGACCTGC TCCGCAACGA CCTCTCGATG GTGTGCCGCC CGGGGACCGT GAGCGTGCCG GCGCTGATGG ACGTCGAGTC CTACGCGACC GTGCACCAGC TGGTCAGCAC CGTCCGCGGC GAGCTGCGCG ACGACGTCAG CACGGTGCAG GCGCTGCGCG CGCTGTTCCC GGCCGGCTCG ATGACCGGCG CGCCGAAGCT GCGCACCATG CAGGTGATCG AGCAGGTCGA GGCCACCGAG CGCGGCCCGT ACGCCGGCGC CTTCGGCTGG GTCTGCGCCG ACGGCCGCGC CGACCTCGGC GTGGTCATCC GCAGCCTCGC CAGCACCGGC GACGGCGCCT ACCTGCTCGG CACCGGCGGC GGGATCACGG TCCGCTCCGA GGTCGCCGAG GAGTACGCCG AGTCCCGCTG GAAGGCCGAC CGGCTGCTGG CTGCGCTCGG CTGA
|
Protein sequence | MSDPALDPAL DPARDPVAFF RGVAAAYPRC FWLDGGGARE WSGRRSMVGW LGEDDVSLTY SAARREVRRH VGGSCEVVGD DVFVVLEAEL AAGAPDDHWV GYLGYACRPD LPASTGPGLP DAVWMRPAGV RFFDHGLGGQ SRNFLGEVPA QQFRFPRFTG ETDPAPPAYA TAFEEVQEQL RAGNSYEVNL TYRLAHRSGV DPVTAYLRLR ELNPAPYAGF LQHDVRDVPD ARAWLLSSSP ERYALVTADR SIETKPIKGT TLRGATPAED EASRHRLATD SKFRAENLMI VDLLRNDLSM VCRPGTVSVP ALMDVESYAT VHQLVSTVRG ELRDDVSTVQ ALRALFPAGS MTGAPKLRTM QVIEQVEATE RGPYAGAFGW VCADGRADLG VVIRSLASTG DGAYLLGTGG GITVRSEVAE EYAESRWKAD RLLAALG
|
| |