Gene Arth_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1684 
Symbol 
ID4445790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1876496 
End bp1878082 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content66% 
IMG OID639689505 
Productanthranilate synthase, component I 
Protein accessionYP_831178 
Protein GI116670245 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.102471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGACC TTGGAATCAT CAGCCCGGGC CTCGAGGAAT TCCGGGAGCT CGCCAGCCAC 
AGCCGTGTCA TCCCCGTCCG GCTGAAGGTC CTGGCCGACG CCGAGACCCC CATCGGGCTC
TACCGCAAGC TGGCGAACGG CCAGCCCGGC ACCTTCCTGA TGGAGTCCGC GGCCGTGGGC
GGTTCGTGGT CCAGGTATTC CTTTATCGGC GCCAAGTCCC GCGCCACCCT GACCACCAAG
GACGGGCAGG CGCACTGGCT GGGAAAGCCG CCTGCCGGCG TGCCGGTCGA CGGGAACCCC
GTGGATGCCA TCCGTGACAC CGTGGAAGCC CTGCGCACCG ACCGCTTCGA AGGCCTGCCG
CCGTTCACCT CGGGCCTGGT CGGGTTCCTC GGCTGGGAAA CCGTACGGCA TTGGGAAAAA
CTCACCAGCC CGCCCGAGGA CGACCTGGAA CTTCCGGAAC TGGCCCTGAA CCTGGTCACG
GACATGGCAG TGCACGACAA CATGGACGGC ACTGTCCTGT TGATCGCGAA CGCCATCAAC
TTCGACAACA GCACGGAACG CGTGGATGAG GCATGGCACG ATGCCGTGGC CCGGGTCAAG
GCGCTGCTGG CCAAGGTCAG CACTCCCGTG GAACAGCCGA TTTCGGTGCT GGAACCGGCC
GCCCTGGACT TTGCCTCCAG TGTCCAGGAA CGCTGGAATG AGCCCGACTA CCTGGCTGCC
CTGGACCGCG GCAAGGAAGC GATCGTTGAC GGGGAAGTGT TCCAGGTGGT CATCTCCCGC
CGCTTCGAAA TGGAGTGCGG TGCCGATCCG CTGGATGTGT ACCGGGTGTT GCGGAATACC
AACCCCAGCC CGTACATGTA CATCTTCAGC CTCGAAGACG CCGCCGGCCG GCAGTACTCG
ATTGTGGGTT CTTCCCCTGA GGCCCTTGTG ACGGTCACCG GCGAGGAGGT CATCACCCAC
CCCATCGCCG GATCACGTCC CCGGGGCAAG ACCGTGGAGG CGGACAAGAC CTTTGCCGAG
GAGCTCCTTG CCGACCAGAA GGAACGCGCC GAGCACCTCA TGCTGGTGGA CCTGTCCCGC
AACGACCTCT CCAAGGTGTG CGTAGCCGGC ACGGTGGATG TCACGCAGTT CATGGAGGTG
GAGCGCTTCA GCCACATCAT GCACCTGGTG TCCACGGTGG TGGGCAAACT CGCCCCGACC
GCCAAAGCCT ATGACGTGCT GAAGGCAACG TTCCCGGCCG GTACTCTCTC CGGCGCCCCG
AAACCCCGTG CCCTGCGGCT CCTCGACGAG TTGGAACCGC ACCGCCGCGG CATCTACGGC
GGCGTAGTGG GCTACCTGGA CTTTGCCGGC GACATGGACA TGGCCATCGC CATCCGTTCT
GCGCTGCTCC GCGACGGCCG CGCCTACGTC CAGGCCGGCG GCGGCATCGT TGCCGACTCG
GTGAACCCGA CGGAAGCCCT GGAAACGGTG AACAAGGCTG CCGCGCCGCT GCGGGCCGTC
CACACGGCGG GGTCACTGCA CAACATCACG GCCGATTCCG TTGCGGAGCC GGGCGATGCT
GCCGGCACAG ACACAGCGGC CCGTTGA
 
Protein sequence
MQDLGIISPG LEEFRELASH SRVIPVRLKV LADAETPIGL YRKLANGQPG TFLMESAAVG 
GSWSRYSFIG AKSRATLTTK DGQAHWLGKP PAGVPVDGNP VDAIRDTVEA LRTDRFEGLP
PFTSGLVGFL GWETVRHWEK LTSPPEDDLE LPELALNLVT DMAVHDNMDG TVLLIANAIN
FDNSTERVDE AWHDAVARVK ALLAKVSTPV EQPISVLEPA ALDFASSVQE RWNEPDYLAA
LDRGKEAIVD GEVFQVVISR RFEMECGADP LDVYRVLRNT NPSPYMYIFS LEDAAGRQYS
IVGSSPEALV TVTGEEVITH PIAGSRPRGK TVEADKTFAE ELLADQKERA EHLMLVDLSR
NDLSKVCVAG TVDVTQFMEV ERFSHIMHLV STVVGKLAPT AKAYDVLKAT FPAGTLSGAP
KPRALRLLDE LEPHRRGIYG GVVGYLDFAG DMDMAIAIRS ALLRDGRAYV QAGGGIVADS
VNPTEALETV NKAAAPLRAV HTAGSLHNIT ADSVAEPGDA AGTDTAAR