Gene Apar_0293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0293 
Symbol 
ID8413141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp338589 
End bp339836 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content50% 
IMG OID645021860 
Productamidohydrolase 
Protein accessionYP_003179315 
Protein GI257784098 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.356604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGT ATGCTTTTGT AGGCGGCAAG CTCGTTGACG GTACTGGCTC TGCACCAGTT 
GAGGACTCTC TTGTCCTTAT TGATGACGAC AAGATTACCT ATGCTGGTCC TCGTAAAGAG
GTTCCAGAGG GATATGAGGT ACGTGACGCA TCCGGTTATA CCGTAATGCC TGGTCTTGTT
GATACACACC TACACTTCTC CGGTAACCTG ACCGACAACG ATAACGATTG GGTTATTGAG
TCCGTTGCTC AGAAGCAGGC ATGCGCAGTC AAACAGGCTT ATGACGCTCT TACCCACGGT
CTTACCACTG TTGTTGAGAT TGGTCGTAAT GGTATTGCTA TCCGTGACCT CGTTAACATG
GGCATTATGC AGGGTCCTCG TATCTTTGCT ACCGGTCTTG GTCTTTGCCG CGTTGCTGGT
CACGGTGACT CTCATCACCT GCCATTGCAG ATCTCCAAGG ACGGACACCC TTGGGGTGAC
CAGGTAGACG GTCCATGGGA GCTTCGTAAG GCAATTCGTC GTCGTCTTCG TGAAGATCCT
GATGGTATCA AGATTTGGGC AACTGGTGGC GGCATTTGGC GCTGGGACTC TGACCGTTTG
CAGCTCTTCT GCACCGAGGA AATTAAGGCA ATTGCTGACG AGTGCGCACT GGTAGGTATT
CCTCTTTACG CTCACTCTTA TAACAACTTT GACGCTGCGT ATGACTGCGT CCGCTTTGGC
TGCAAGCAGC TCATTCACGG CTTTGAGATT GACGAGCGCA CCATGAAGCT TATGGCTGAG
CAGGGTACCT TCTTTACCCC AACTATCGGC TTCTTGCCAA CTTGGTACGG AACTTATCCA
CCAGACTGGA CTCCAGAGCT TGATGCATTC CCAGGTGAGA CTGTTGTCGA GAAGGGTCTT
GCACGTACCT ATGATAACCT GCGTAAGGCA TATGATATGG GCATTACCAT TACCATTGGT
TCCGACTCCT TCAGTTTTGT TACTCCTTAC GGCTATGTCA CCATCGACGA GATGTATGAC
TTTGTCGAGA AGGTTGGCAT TTCTATTCTT GATACCGTTG CAGCTGCTAC TTACAACGGC
GCAAAGATGC TGGGCAAGGA GAACGAGTTT GGTGCTGTCA AGGAAGGCCT CTCTGCTGAT
ATCCTTGTAG TTAAGGGCGA CGTTGCTAAT AACATCCGCG ACCTCACGCC TGAGAACATG
GACGTCATCA TGAAGGAAGG TAAGATTATC GATCGCGGTA GCTTCTAA
 
Protein sequence
MSKYAFVGGK LVDGTGSAPV EDSLVLIDDD KITYAGPRKE VPEGYEVRDA SGYTVMPGLV 
DTHLHFSGNL TDNDNDWVIE SVAQKQACAV KQAYDALTHG LTTVVEIGRN GIAIRDLVNM
GIMQGPRIFA TGLGLCRVAG HGDSHHLPLQ ISKDGHPWGD QVDGPWELRK AIRRRLREDP
DGIKIWATGG GIWRWDSDRL QLFCTEEIKA IADECALVGI PLYAHSYNNF DAAYDCVRFG
CKQLIHGFEI DERTMKLMAE QGTFFTPTIG FLPTWYGTYP PDWTPELDAF PGETVVEKGL
ARTYDNLRKA YDMGITITIG SDSFSFVTPY GYVTIDEMYD FVEKVGISIL DTVAAATYNG
AKMLGKENEF GAVKEGLSAD ILVVKGDVAN NIRDLTPENM DVIMKEGKII DRGSF