Gene Apar_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0459 
Symbol 
ID8413308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp528756 
End bp530111 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content45% 
IMG OID645022027 
Productaminodeoxychorismate lyase 
Protein accessionYP_003179481 
Protein GI257784264 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACTC CATCTAATGA TTCTTCTCCG CGTCGTCAGG GCGCACATTT TTCTTCTTCT 
GTATCAGAGG GACAGGCCCA GGAAGAACAG ATACAGGAAA AACAAACTCA ACAAGACTCA
GCGCACTATA ACGCTGGTTC GCAAGAGGCG CTTCAGCAGC CTCCTGTTGA AGCTACGGGA
TCGCTGCCTG CACTTACTGG CGGTAAGCTT TCATCAAGAA GTGCTCAAGT TACGCATAAG
GCAAAAAATA AGCAGGTCAA GCATAGAGAC AGGAAGGCTT CAAAGTGCTC TCGTATTTTT
GCAACGCTTA TCGCTTTTGT TATGGTTGCA GCTCTTGGCA TTTTTGTGTG GAAAGTTGCA
CTTCCAGAGC TTTCTCGCAC TAACTCTGAT ACACAAGAAA TTACCGCTGG TCAGCAAGTT
ACTGTTACCA TTCCAGATGG TGCTGGTGCG CAGGAAGTTG CAAAGATTCT TTTTGAGAAC
AAGATTATTG CTACTAAGAG CGAGTTCTTA GATCAGGTAA AGCGCCAGGA TGCTGAGCAG
AAAATCAAGA GTGGTAGTTA TGTTATTACC ACTGGTACCA AGCCAGCAGA CATCGTACAC
CTTCTTGTTT CTGGTCCAAA TGCCCCTGGC AGTGGCTTCG TCGTACCAGA AGGCTATACC
GTTTCTCAAG TTGCTGATTT GGCTCAGAAC TACTTTGGCA TTTCTCGCGA TGATTTCTTA
AATCAGGCAA AAGCATCCAA TTATGTTGCT GACTATCCGT TCCTTGCCGG TGCGGTAGAT
GCTAACGATT CTCTTGAAGG TTATTTGTTC CCTAAGACGT ATACCTTTAC GGAAAGCAAC
GTAACCGCTG ATACTGTCAT TCGTGCCATG CTTGATCAAT TTAAGGCAGA GACGGCCAAT
CTTAACTTGG ACGCTGCTCG TATTACGCTC AACAAACGTT ACAACTTGAA TCTTACTAAC
GAGCAAATCA TTACCATGGC ATCAATTATT GAGCGAGAAG CTCTGACTGA TGAGGATCGT
CCTAAGGTTG CGTCTGTTTT CTACAACCGC CTGTATGATG ATATGTATCT ACAAAGTGAT
GCAACTCTTG CATATTCCTT GGGTAGAGAA GCTACTGCTG AAGAGTTAAG CTCAATGACA
AGCGATCCGT ATAATACCTA CGCGTTCAAG GGCTTGACCC CTACGCCTAT TTGCTCTCCA
GGTTATGCCT CTATTAAGGC GGCAATGGAT CCAGCAGCAA CCAATTATTA CTACTTCTGG
ATTACCTCAG ATGAACATGT ATTCTCTGAG ACTTATGACG AGCATCAACA GGCTATTGAA
AACGCACGCG AGCGTGAAGC CGCAAGCAAA CAGTAA
 
Protein sequence
MPTPSNDSSP RRQGAHFSSS VSEGQAQEEQ IQEKQTQQDS AHYNAGSQEA LQQPPVEATG 
SLPALTGGKL SSRSAQVTHK AKNKQVKHRD RKASKCSRIF ATLIAFVMVA ALGIFVWKVA
LPELSRTNSD TQEITAGQQV TVTIPDGAGA QEVAKILFEN KIIATKSEFL DQVKRQDAEQ
KIKSGSYVIT TGTKPADIVH LLVSGPNAPG SGFVVPEGYT VSQVADLAQN YFGISRDDFL
NQAKASNYVA DYPFLAGAVD ANDSLEGYLF PKTYTFTESN VTADTVIRAM LDQFKAETAN
LNLDAARITL NKRYNLNLTN EQIITMASII EREALTDEDR PKVASVFYNR LYDDMYLQSD
ATLAYSLGRE ATAEELSSMT SDPYNTYAFK GLTPTPICSP GYASIKAAMD PAATNYYYFW
ITSDEHVFSE TYDEHQQAIE NAREREAASK Q