Gene Arth_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3848 
Symbol 
ID4447600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4328121 
End bp4330169 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content70% 
IMG OID639691672 
Productanthranilate synthase, component II / anthranilate synthase, component I 
Protein accessionYP_833323 
Protein GI116672390 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I
[COG0572] Uridine kinase 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCTG CACCCGTGAT CATTGCCGTC GACGGCCGGT CCGGCGCAGG AAAAACCACC 
CTCGCAATCG AGCTTGCCGC GCGGCTGCGG GAGCATCACA AGGTGTCGCT GTTCCATCTG
GAGGACGTCT ATCCCGGCTG GAACGGCCTG GCGGCGGGCA TCGAACGTTA TGTCACCACC
GTGCTGGCCC CGCTGAGCCG CGGCGAAGCT GCCGAATGGG TCAGCTGGGA CTGGGAGAAC
CACTACGACG GCCAGTCGCG CACAACACTT CCGGCAGAGA TCGTCATCGT GGAGGGTGTG
GGCGCCGCTG CGGCATCCGC CCGCCCGCTG CTGGACGCCG TGGTCTGGGC GGAGTCCTCC
GACGTCGACC GCCGCACCCG CGCCCTGGCA CGCGACGGCA GCACTTACGA ACCGTTCTGG
GACCAGTGGG CAGCTCAGGA GGATGAGTGG CTGGCAACGG ACAGGGTGCA GGACCATGCC
GACATCCGGG TGCTCAATCT GGCCGACGGC ACCGCGCCGG ATGATGTCCT CCGGGCCCTC
CAGTACCTTC CCGCCCTTGC GGCCGTGCTG CTGCCCGAGC TCTCGGCCCG CCGCGGCCTT
CAGCTCCATG CCGAACGGCT GGACTGCGTT CCCGACGCCG CCCGGCTCTT CGGTGCACTC
TACGGCGACT CACCGAATGC CGTGTGGCTG GATTCCTCGC TCAGGACGGA CGGAGCGGCC
GCCGAGCGCA GCCGCTTCAG CATCATGGCG GACGACGGCG GCAGCTTCGG CCAGGACGTC
CGGCACCGTT CCGGCGTGAC CCGGCTGACC ACCGGCACCG CCACGGTTGC CATCGCCGGA
CCGTTCTTCC GCTGGCTCGA TGCCGTCTGG GGCCGCCGCG CCCTCCGCGC GCCGGAGGGC
TACCCCTGCG AATTCACCCT CGGCTGGCTG GGCTACCTGG GCTACGAACT CAAGCGCGAA
ACCGGCGGCA GCGACGTTCC GTCCGGAATC CCCGATGCCG CCATGATCTT CGCCGGCCGG
GCAATGGTCC TGGACCATCG GGATGGCAGC GTGTGGCTCC TGGCCCTGGA GGCTCCCGAT
GCCGCGGAGT GGCTGGCATT CGCCCGCTCA GCCGTCGCGG CGGCCGCCGG CCCCTCGGAA
CCGGGCGCCA GCGCCGCCGT CGTACCCGCA ACAGCGCCCG CCCCCGCATT CACGGGCCGT
GACACCGAGC AGGCCTACAA GCTCAAGATC GCCGACGCGC AGCGAGAAAT CGCCGAGGGA
AACACCTACG AAGTGTGCCT CACCACGGCC GTTAGCGCAG CCGTGCCGGC CGGTGCGGAC
GGATTGGATC CCTGGCGCAC CTACCTGGCG CTGCGCCGGA AAAACCCTGC ACCGTTCGCC
AGCTATCTGC GGTTCGGCGG CCTCACCGTC GCCAGCACGT CGCCGGAGCG TTTCCTGCGG
ATAGCGGCCG ACGGCGGCAT GCGCGCCGAG CCGATCAAGG GCACCCGCCC GCGGGCGGCG
GAACCGGTCC GGGACCGGCA GCTGCGCGAG GACCTGGAGT CCTCCCCCAA GGACCGGGCC
GAGAACATCA TGATCGTGGA CCTGCTCCGC AACGACCTCA GCCACTTCGC CGTGGCAGGG
TCTGTCACTG TCAGCAGGCT CTGCGCGATC GAAAGCTATG CCACGGTCCA CCAGATGGTC
AGCACCATCG ATGCGAAGCT GAGGCCGGGG CTGCCGCGCG CCGAGGCCGT CGCCGCATGC
TTTCCCGCGG GCTCCATGAC GGGGGCGCCG AAGATCAGCA CCATGGCCAT CCTGGACCGT
CTGGAAGGGG CGCCGCGCGG GGTCTACTCG GGCGCCATCG GCTATTTCTC CCTGAGCGGG
GCGATGGACA ACGCAGTGGC CATCCGGACC CTGGTGATCC GCGCCGACGG TGCCGGCGGC
ACTGAACTGA GCCTCGGCGT CGGCGGTGCC ATCACCGCGG ATTCCTCACC GCAGGAAGAG
TACGACGAAA TCCGGACCAA GGCGTACGGG GTCCTGTCCG CGCTGGGCGC GGACTTCCCG
GAAAGCTGA
 
Protein sequence
MTPAPVIIAV DGRSGAGKTT LAIELAARLR EHHKVSLFHL EDVYPGWNGL AAGIERYVTT 
VLAPLSRGEA AEWVSWDWEN HYDGQSRTTL PAEIVIVEGV GAAAASARPL LDAVVWAESS
DVDRRTRALA RDGSTYEPFW DQWAAQEDEW LATDRVQDHA DIRVLNLADG TAPDDVLRAL
QYLPALAAVL LPELSARRGL QLHAERLDCV PDAARLFGAL YGDSPNAVWL DSSLRTDGAA
AERSRFSIMA DDGGSFGQDV RHRSGVTRLT TGTATVAIAG PFFRWLDAVW GRRALRAPEG
YPCEFTLGWL GYLGYELKRE TGGSDVPSGI PDAAMIFAGR AMVLDHRDGS VWLLALEAPD
AAEWLAFARS AVAAAAGPSE PGASAAVVPA TAPAPAFTGR DTEQAYKLKI ADAQREIAEG
NTYEVCLTTA VSAAVPAGAD GLDPWRTYLA LRRKNPAPFA SYLRFGGLTV ASTSPERFLR
IAADGGMRAE PIKGTRPRAA EPVRDRQLRE DLESSPKDRA ENIMIVDLLR NDLSHFAVAG
SVTVSRLCAI ESYATVHQMV STIDAKLRPG LPRAEAVAAC FPAGSMTGAP KISTMAILDR
LEGAPRGVYS GAIGYFSLSG AMDNAVAIRT LVIRADGAGG TELSLGVGGA ITADSSPQEE
YDEIRTKAYG VLSALGADFP ES