Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_3582 |
Symbol | |
ID | 5421497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 4004419 |
End bp | 4005843 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640882836 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_001418467 |
Protein GI | 154247509 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00051468 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTGCCCAG CGTTTCGAAT CGGTCGCCCT GCCGTGCTTA TCGTTGAAAT TCCGTACCGC GACCCGTTCG CCGTCCTCCA CGCCTTCGCC GATGCGCCCT ATTGCGCCTT CCTCGACAGC GCCGCGGAGG GGGACGCGCG GGCACGCTGG TCCTATCTGG GGGCCGATCC GTTCCAGGTC ATCACCAGCG CGGCCGATGG CGTGCGCGTG GACGGACGGC TCGTGCCCGG CACTCCGTTC GAGGTTCTGT CTGTGGCGCT GGCCGCCAAT GCGGCGCCGG CCGAACCCTC CCCGGTGCCG TTTCGCGGCG GCGCCATGGG GTATCTCGGC TATGAGCTGG GCCGCCATCT GGAGCGCCTG CCCGCCCCCC GCCCGGCAGC CCCGGCCATT CCCGAGCTGG TTCTCGGCCT CTACGACACG GTGCTCGCCT TCGACCGTCT GGAGCGCCGC GCCTACATCG TTTCCACCGG CCGGCCGGAG CATGGCGCCG CCGCCAAGGC GCGCGCGCAG GTCCGGGCCG AGGCGATCCG CCGCCGGCTG GAGACGGCCC CCGCCATCGC GCCGGCGCCG GATTTTTCCC GCACCGGCCG CTTCGTGCCG GAGCAGCCGC GCGCCCAGGT GGAAGCCGCC ATCGCGCGGG TGATCGAATA CATCCGCGCC GGCGACATCT TCCAGGCCAA CCTCACCCAG CAGATGCGCG CGGCGCGGCC CCAGGGGCTC AGCGATCTAG CCCTCTACAC CCGCCTGCGC GCGCTCTCCC CCTCCCCCTT CGCCGCCTTC CTGCGCGCCG GACCGGAGCT TGCCGTGCTC AGCGCCTCAC CGGAGCGCTT CCTCTCCCTC GATCCCGACG GGCGGGTGGA GACGCGGCCC ATCAAGGGCA CCCGCCGCCG CAGCCCGGAC CCGCAGGAGG ACGCGCGGCT TGCCGCCGCC CTCCTCGCCT CGCCCAAGGA CCGGGCGGAA AACCTGATGA TCGTGGACCT GCTGCGCAAC GACCTCTCGC GGGTCTGCAA GGTGGGCAGC GTGAAGGTGC CGGCCCTGTG CGCGCTGGAG ACCTTCGCCA GTGTGCATCA CCTGGTCTCG GTGGTGGAAG GCCGGCTGAA AGACGGCCTC GGCCCGGTGG ACCTGCTCAC CGCCTGCTTT CCCGGCGGGT CCATCACGGG AGCGCCGAAG ATCCGCGCCA TGGAGATCAT CCACGAGCTG GAACCGGTGC CGCGCGGGGT CTATTGCGGC TCGGTGTGCT GGATCGGCTT CGACGGGGCG ATGGATTCCT CCATCGTCAT CCGCACCATA ACCCGGGCCG GGGAGACGCT GCTGGCCCAG GCGGGCGGCG GCATCGTGGC GGATTCCGAT CCGGCGGACG AATATGAGGA GAGCCTGGTG AAGCTCTCGC CCATGCTGCG GGCCTTGTCG GGGGAGACAT CGTGA
|
Protein sequence | MCPAFRIGRP AVLIVEIPYR DPFAVLHAFA DAPYCAFLDS AAEGDARARW SYLGADPFQV ITSAADGVRV DGRLVPGTPF EVLSVALAAN AAPAEPSPVP FRGGAMGYLG YELGRHLERL PAPRPAAPAI PELVLGLYDT VLAFDRLERR AYIVSTGRPE HGAAAKARAQ VRAEAIRRRL ETAPAIAPAP DFSRTGRFVP EQPRAQVEAA IARVIEYIRA GDIFQANLTQ QMRAARPQGL SDLALYTRLR ALSPSPFAAF LRAGPELAVL SASPERFLSL DPDGRVETRP IKGTRRRSPD PQEDARLAAA LLASPKDRAE NLMIVDLLRN DLSRVCKVGS VKVPALCALE TFASVHHLVS VVEGRLKDGL GPVDLLTACF PGGSITGAPK IRAMEIIHEL EPVPRGVYCG SVCWIGFDGA MDSSIVIRTI TRAGETLLAQ AGGGIVADSD PADEYEESLV KLSPMLRALS GETS
|
| |