Gene Apar_1275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1275 
Symbol 
ID8414155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1432133 
End bp1433380 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content54% 
IMG OID645022867 
Productamidohydrolase 
Protein accessionYP_003180290 
Protein GI257785073 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA TTGCTTTTGT AGGCGGAAAG CTCGTTGACG GCACTGGCGC CGCTCCTGTT 
GAGGATTCTC TTGTTCTCGT AGACGACAAG AAGATCGCTT ATGCTGGTCC TCGTACTGAG
GTTCCAGAGG GCTATGAGGT CCGTGACATT CAGGGCCGTA CCATTATGCC AGGTCTTATC
GATACCCACC TGCACTTCTC CGGCAACCTG ACCGACAACG ATAACGACTG GGTTATCGAG
TCTGTTGCTC AGAAGCAGGC ATGCGCAGTC AAGCAGTCCT GGGACGCTCT CTCCCACGGT
CTTACCACCG TCTGCGAGAT TGGCCGCAAT GGTATCGCAA TTCGTGACCT CGTTAACATG
GGTGTCATGG AAGGCCCACG TATTTACGCT ACCGGCCTTG GCTTCTGCCG CGTTGCTGGT
CATGGTGACT GCCACCAGCT CCCACAGGAG ATTTCCAAAA ATGGTCATCC ATGGGGCGAC
CAGGTAGATG GTCCTTGGGA TCTTCGTAAG GCAGTTCGTC GTCGCCTTCG TGAGAACCCA
GACGCAATTA AGATTTGGGC TACCGGTGGC GGAATTTGGC ACTGGGACTC CGGTCGAGAT
CAGCACTACT GCTACGAGGA GATCAAGGCA GTTTGCGACG AGGCAGCAAT GGTTGGCATT
CCTGTTTGGT CACACTCCTA TAACAGCTTC TCCGCAGCTT ATGACTCCGT CCGTTGCGGC
TGCGAGCAGA TGATTCACGG CTTCGAGCTT GATGCAAAGA CTCTTGATCT TATGTGCGAG
CAGGGCACCT TCTTCACTCC TACCATCGGC TTCCTGCCAA CTTGGTACGG AACTTATCCA
CCAGAGTGGA CTCCAGAGCT TGACGCATTC CCAGGTGATA CTGTTGTCGA GAAGGGCCTC
AACCGTACAT ACGAGAACCT CCGCAACGCT AACGAGCGCG GCGTTACTCT CACCATCGGT
TCCGACTCCT TCAGCTTCGT GACTCCTTAC GGCTGGGTTA CCATCGATGA GATGTATGAC
TTTGTTGAGA AGGCTGGCGT CACCATTCTT GAGACTATCA AGGCTGCTAC CCTTAACGGC
GCAAAGATGG TTCACCATGA GGATGAGTTT GGCTCCCTCG AGGCTGGTAA GCTTGCTGAC
CTCCTGGTTG TCAAGGGCGA CGTTGCTACT AACATCCGCG ACCTTACCCC AGATAACATG
GAAGTTATCA TGAAGGAAGG CAGCGAGGTT ATTCGCGGCA CCTTCTAA
 
Protein sequence
MSKIAFVGGK LVDGTGAAPV EDSLVLVDDK KIAYAGPRTE VPEGYEVRDI QGRTIMPGLI 
DTHLHFSGNL TDNDNDWVIE SVAQKQACAV KQSWDALSHG LTTVCEIGRN GIAIRDLVNM
GVMEGPRIYA TGLGFCRVAG HGDCHQLPQE ISKNGHPWGD QVDGPWDLRK AVRRRLRENP
DAIKIWATGG GIWHWDSGRD QHYCYEEIKA VCDEAAMVGI PVWSHSYNSF SAAYDSVRCG
CEQMIHGFEL DAKTLDLMCE QGTFFTPTIG FLPTWYGTYP PEWTPELDAF PGDTVVEKGL
NRTYENLRNA NERGVTLTIG SDSFSFVTPY GWVTIDEMYD FVEKAGVTIL ETIKAATLNG
AKMVHHEDEF GSLEAGKLAD LLVVKGDVAT NIRDLTPDNM EVIMKEGSEV IRGTF