Gene Achl_3635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3635 
Symbol 
ID7295116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4041101 
End bp4043260 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content70% 
IMG OID643592041 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_002489680 
Protein GI220914371 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I
[COG0572] Uridine kinase 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACACCT GGCGCGGCAG GCGCTCTGAA AACGGTACGC TCTACTCCAT GAGCCCAGCC 
CCCGTCATCA TCGCCCTTGA CGGACGCTCG GGCGCAGGCA AGACCACCCT CGCCGTGGAA
CTGGCCGCGC GCCTGCGGGC GCGCCATAAA GTGTCGCTGT TCCACCTCGA GGACATCTAT
CCGGGCTGGA ACGGCCTGGC CGCCGGCATC GAACGCTACG TCAGCACCGT CCTGGGCCCG
CTGAGCCGCG GCGAGGCCGC CACCTGGACC AGCTGGGACT GGGAAAAGCA TTACGACGGC
GGCTCCCGGG TAACACTGCC CGCCGAGATT GTCATCGTGG AGGGAGTGGG TGCCGCGGCC
GCCGCCGCCC GGCCGTTCCT GGGCGCGGCC ATCTGGGCGG ACTCCCCGGA GGACGTCCGG
CGCACGCGCG CGCTGCAGCG GGACGGCGAA ACCTACGAGC CCTACTGGGA CCAGTGGGCG
GCCCAGGAGT CCGAATGGCT GGCCGGGGAC GATGTTCCCG GCGCGGCGGA CCTGCACATC
AGGAACGTGG CAGACGGCAG CGCACCGGAG GACGTGCTGC AACTCCTGCC GTACCTCCCG
GCGCTGGCAC AGGTCCTCGC CCCGGAGCTG TCCGCCCGGC GGGGCCTGAG CCTCCGTGCC
GAACGGCTCG ACGCACGCCC GCAGGCCGCC GAACTTTTCC ATGCCCTGTA CGGCACGTCC
GCCAACGCCG TCTGGCTCGA CTCGTCCAAC GCGGGAGCCG CGGATCCCGG CAGTGCGGAT
CCGAGCACCC GGACGGCCGC CGAACGCAGC CGGTTCAGCA TTCTGGCGGA CGACGGCGGC
ACGTTCGGCC AGTCGGTCAT GCACCGGTCC GGAGCCAGCC ACATCAGCGC CGGCTCCGTC
ACCGCCACCG TGGACGGACC GTTCTTCCGT TGGCTGGATA CCGTGTGGGG CCGCCGGGCG
GTCCGCGCCC CGGAGGGCTA CCCCGGAGAG TTCACCCTGG GGTGGCTGGG CTACCTGGGC
TACGAGCTGA AACGCGAAAC CGGCGGCACC GACGTCTCAG CGGCCACGCC GGACGCCGCA
CTGATCTTCG CCGGACGGGC GGTGGTCCTG GACCACGCGG AAGGCACCGC CTGGCTCCTG
GCACTGGACG CCCCGGACGC CGCCGAATGG CTGGACGCCG CACGGGCAGC CGTGCAGCGT
GCAGCGGGTG CGCCCGCGGC GTTGGACAGC GGGGACGGCA ACGAACCGGC CGGCTCCGGC
ATTGATGGCG GCAGCACCGC CGTCGGCAGC GCGTCCGTGC CCGTGTTCGC AAGCCGGGAC
AGCGGAGTGA CCTACCGCGA AAAGATCACC AAGTCCCAGC GTGAAATCGC CGAGGGAAAC
ACCTACGAAG TTTGCCTGAC CACCACCCTC GCGGCCCGCG TCCCGGGAGG CACCGTCGAC
CCGTGGCATA CGTACCTGGC ACTGCGCCGC CGGAATCCGG CACCGTTCGC CAGCTACCTG
GCCTTCGACG GGTTGGCGGT TGCCAGCACG TCGCCGGAAC GTTTCCTGCG GATAGCGTCC
GACGGCGGCA TGCGCGCCGA GCCGATCAAG GGCACCCGCC GCCGGGCTTC CGGCGCCGCT
GAGGATGCCG CCCTCCGGAC GGAGCTTGCC ACCTCGCTGA AGGACCGTGC CGAGAACATC
ATGATCGTTG ACCTGCTGCG GAATGACCTG AGCCATTTCG CTGTCCCCGG CTCCGTGACG
GTGAGCCGGC TGTGCGCCAT CGAAAGCTAC GCCACCGTGC ATCAGATGGT CAGCACTATC
GATGCCTCCC TGCCGCCGGG TTCCCCGCGG GCCGAAGCCG TGGCTGCCTG CTTCCCTGCC
GGTTCGATGA CAGGGGCACC GAAGATCAGC ACCATGGCCA TCCTCGACCA GCTGGAAGCC
GGCCCCCGAG GAATCTATTC GGGGGCCATC GGATACTTCT CGCTGAACGG TGCCACGGAC
CTGGCCGTCG CCATCCGGAC CCTGGTGATC CGCCGGGATG GTGACGGTAC TGCGGAACTG
AGTCTCGGCG TCGGCGGCGC CATCACGTCC GATTCCGTGC CGGACGAGGA ATACGACGAA
ATCCGCACCA AGGCCTACGG AGTCCTCTCG ACGCTCGGCG CAACTTTTCC GGACGCCTGA
 
Protein sequence
MHTWRGRRSE NGTLYSMSPA PVIIALDGRS GAGKTTLAVE LAARLRARHK VSLFHLEDIY 
PGWNGLAAGI ERYVSTVLGP LSRGEAATWT SWDWEKHYDG GSRVTLPAEI VIVEGVGAAA
AAARPFLGAA IWADSPEDVR RTRALQRDGE TYEPYWDQWA AQESEWLAGD DVPGAADLHI
RNVADGSAPE DVLQLLPYLP ALAQVLAPEL SARRGLSLRA ERLDARPQAA ELFHALYGTS
ANAVWLDSSN AGAADPGSAD PSTRTAAERS RFSILADDGG TFGQSVMHRS GASHISAGSV
TATVDGPFFR WLDTVWGRRA VRAPEGYPGE FTLGWLGYLG YELKRETGGT DVSAATPDAA
LIFAGRAVVL DHAEGTAWLL ALDAPDAAEW LDAARAAVQR AAGAPAALDS GDGNEPAGSG
IDGGSTAVGS ASVPVFASRD SGVTYREKIT KSQREIAEGN TYEVCLTTTL AARVPGGTVD
PWHTYLALRR RNPAPFASYL AFDGLAVAST SPERFLRIAS DGGMRAEPIK GTRRRASGAA
EDAALRTELA TSLKDRAENI MIVDLLRNDL SHFAVPGSVT VSRLCAIESY ATVHQMVSTI
DASLPPGSPR AEAVAACFPA GSMTGAPKIS TMAILDQLEA GPRGIYSGAI GYFSLNGATD
LAVAIRTLVI RRDGDGTAEL SLGVGGAITS DSVPDEEYDE IRTKAYGVLS TLGATFPDA