Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1684 |
Symbol | |
ID | 4445790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1876496 |
End bp | 1878082 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689505 |
Product | anthranilate synthase, component I |
Protein accession | YP_831178 |
Protein GI | 116670245 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.102471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGACC TTGGAATCAT CAGCCCGGGC CTCGAGGAAT TCCGGGAGCT CGCCAGCCAC AGCCGTGTCA TCCCCGTCCG GCTGAAGGTC CTGGCCGACG CCGAGACCCC CATCGGGCTC TACCGCAAGC TGGCGAACGG CCAGCCCGGC ACCTTCCTGA TGGAGTCCGC GGCCGTGGGC GGTTCGTGGT CCAGGTATTC CTTTATCGGC GCCAAGTCCC GCGCCACCCT GACCACCAAG GACGGGCAGG CGCACTGGCT GGGAAAGCCG CCTGCCGGCG TGCCGGTCGA CGGGAACCCC GTGGATGCCA TCCGTGACAC CGTGGAAGCC CTGCGCACCG ACCGCTTCGA AGGCCTGCCG CCGTTCACCT CGGGCCTGGT CGGGTTCCTC GGCTGGGAAA CCGTACGGCA TTGGGAAAAA CTCACCAGCC CGCCCGAGGA CGACCTGGAA CTTCCGGAAC TGGCCCTGAA CCTGGTCACG GACATGGCAG TGCACGACAA CATGGACGGC ACTGTCCTGT TGATCGCGAA CGCCATCAAC TTCGACAACA GCACGGAACG CGTGGATGAG GCATGGCACG ATGCCGTGGC CCGGGTCAAG GCGCTGCTGG CCAAGGTCAG CACTCCCGTG GAACAGCCGA TTTCGGTGCT GGAACCGGCC GCCCTGGACT TTGCCTCCAG TGTCCAGGAA CGCTGGAATG AGCCCGACTA CCTGGCTGCC CTGGACCGCG GCAAGGAAGC GATCGTTGAC GGGGAAGTGT TCCAGGTGGT CATCTCCCGC CGCTTCGAAA TGGAGTGCGG TGCCGATCCG CTGGATGTGT ACCGGGTGTT GCGGAATACC AACCCCAGCC CGTACATGTA CATCTTCAGC CTCGAAGACG CCGCCGGCCG GCAGTACTCG ATTGTGGGTT CTTCCCCTGA GGCCCTTGTG ACGGTCACCG GCGAGGAGGT CATCACCCAC CCCATCGCCG GATCACGTCC CCGGGGCAAG ACCGTGGAGG CGGACAAGAC CTTTGCCGAG GAGCTCCTTG CCGACCAGAA GGAACGCGCC GAGCACCTCA TGCTGGTGGA CCTGTCCCGC AACGACCTCT CCAAGGTGTG CGTAGCCGGC ACGGTGGATG TCACGCAGTT CATGGAGGTG GAGCGCTTCA GCCACATCAT GCACCTGGTG TCCACGGTGG TGGGCAAACT CGCCCCGACC GCCAAAGCCT ATGACGTGCT GAAGGCAACG TTCCCGGCCG GTACTCTCTC CGGCGCCCCG AAACCCCGTG CCCTGCGGCT CCTCGACGAG TTGGAACCGC ACCGCCGCGG CATCTACGGC GGCGTAGTGG GCTACCTGGA CTTTGCCGGC GACATGGACA TGGCCATCGC CATCCGTTCT GCGCTGCTCC GCGACGGCCG CGCCTACGTC CAGGCCGGCG GCGGCATCGT TGCCGACTCG GTGAACCCGA CGGAAGCCCT GGAAACGGTG AACAAGGCTG CCGCGCCGCT GCGGGCCGTC CACACGGCGG GGTCACTGCA CAACATCACG GCCGATTCCG TTGCGGAGCC GGGCGATGCT GCCGGCACAG ACACAGCGGC CCGTTGA
|
Protein sequence | MQDLGIISPG LEEFRELASH SRVIPVRLKV LADAETPIGL YRKLANGQPG TFLMESAAVG GSWSRYSFIG AKSRATLTTK DGQAHWLGKP PAGVPVDGNP VDAIRDTVEA LRTDRFEGLP PFTSGLVGFL GWETVRHWEK LTSPPEDDLE LPELALNLVT DMAVHDNMDG TVLLIANAIN FDNSTERVDE AWHDAVARVK ALLAKVSTPV EQPISVLEPA ALDFASSVQE RWNEPDYLAA LDRGKEAIVD GEVFQVVISR RFEMECGADP LDVYRVLRNT NPSPYMYIFS LEDAAGRQYS IVGSSPEALV TVTGEEVITH PIAGSRPRGK TVEADKTFAE ELLADQKERA EHLMLVDLSR NDLSKVCVAG TVDVTQFMEV ERFSHIMHLV STVVGKLAPT AKAYDVLKAT FPAGTLSGAP KPRALRLLDE LEPHRRGIYG GVVGYLDFAG DMDMAIAIRS ALLRDGRAYV QAGGGIVADS VNPTEALETV NKAAAPLRAV HTAGSLHNIT ADSVAEPGDA AGTDTAAR
|
| |