Gene Apar_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0033 
Symbol 
ID8412875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp40777 
End bp42270 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content49% 
IMG OID645021602 
Productamidophosphoribosyltransferase 
Protein accessionYP_003179063 
Protein GI257783846 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTTC ACGAGGAATG TGGCGTCTTT GGTGTATGGG CGCCTGACCG TGACGTTGCT 
CGACTCACTT ATTTTGCGCT GCATGCGCTG CAGCATCGCG GGCAAGATTC CGCAGGTATA
GCCGTTGGTG ATGGGCAGAC CGTGCTTATC AGAAAAGACA CGGGCCTTGT AACTGAAGTT
TTCAACAACG ATGATCTCAA CGCAATGCCA GGTAAAGTGG CTATCGGTCA CTGTCGCTAT
GGTACCGCGG GCGCCAAAGG CTGGGAATCG GCTCAGCCGC ACATGTCGTC CATCGATGAG
ACCCTCATTG CTCTGGCGCA CAACGGCACT CTTGTTAACT TTGACTCCTT GCGGGAGGAA
CTATCTTCTC GCCAGATTTC CTTTAGATCA AACACGGATT CCGAGGTAGC GGCCCAGCTT
ATTGGCTACT TCACACATCA AACGCATCGA CTGCGTACCG GCATTGCCGC AACTATGAAC
CTTATCGAGG GCGGCTATGC TATGGTGCTT ATTCGTGAGA ACGCTCTCTA CGCGTTTAGG
GATCCTAACG GTATTCGTCC GCTGGTACTT GGCTATATTG GCGAGAAGAA CCAGAATAAC
TGGGTGGTTG CATCTGAGAC ATGTGCGCTG GATATTGTGG GTGCAACGTA TGTACGTGAG
GTTGCCCCTG GCGAAATTAT TCGCATATCA GACAACGGTC TGTCTTCTGA GATGGGACTT
TCTCCTCGTC AGCAGGCAGA TTGTATTTTT GAACAGGTGT ATTTCTCGCG CCCTGATTCT
ATTATTAGCG GTCGCTCAGT GTATTCTGTT CGTCATCAGA TGGGTCGACA ACTGGCAAAA
GAAACACCAG CAAATGTTGA TCTGGTAATT GGTGTACCGG ATTCTGGCGT TCCTGCAGCC
GAAGGTTTTG CACAAGAGTT GGACGTGTCA TTTGCTACCG GCTTAATTAA AAACCGCTAC
GTTGCAAGGA CGTTTATTCA ACCTACACAG CAGCTTCGTG AGCTGGGTGT TCGACTCAAG
CTCAATGCAC TGTCCGACGT TGTTGCAGAT AAGCGCATTG TGATGGTGGA TGATTCGGTT
GTTCGCGGCA CTACCTCAAA GCAGATTGTT CAGCTGCTCA GAGATGCAGG AGCTACAGAA
GTGCACGTCA GAAGCGCTTC TCCTAAGGTT GCGTGGCCAT GCTTCTACGG TATTGATACT
GCTGACCAAG ATCAGCTTGT TGCAGCAAAA ATGAGCACAG AGGAAATTTG CGAGTATATC
GGTGCAGACT CTTTGGGCTT CTTAAGCATT GAAGGTCTGC TTGCATGCGT GCCATCAAGG
GGTTATTGCG AGTCATGTTT TAATGGCAGG TATCCTGTAG CAATTCCAAA AGACTTCCAT
GGAAGGTTTT TGCCAGAAAA CACACCAGAC AATCTGAATC CGTCTTTTGC ACTAAAATTT
GACCAGGTAG AACAGCAGTT AATTGACGAG GGTCTTATGC CCTCAGAGCA GTAA
 
Protein sequence
MELHEECGVF GVWAPDRDVA RLTYFALHAL QHRGQDSAGI AVGDGQTVLI RKDTGLVTEV 
FNNDDLNAMP GKVAIGHCRY GTAGAKGWES AQPHMSSIDE TLIALAHNGT LVNFDSLREE
LSSRQISFRS NTDSEVAAQL IGYFTHQTHR LRTGIAATMN LIEGGYAMVL IRENALYAFR
DPNGIRPLVL GYIGEKNQNN WVVASETCAL DIVGATYVRE VAPGEIIRIS DNGLSSEMGL
SPRQQADCIF EQVYFSRPDS IISGRSVYSV RHQMGRQLAK ETPANVDLVI GVPDSGVPAA
EGFAQELDVS FATGLIKNRY VARTFIQPTQ QLRELGVRLK LNALSDVVAD KRIVMVDDSV
VRGTTSKQIV QLLRDAGATE VHVRSASPKV AWPCFYGIDT ADQDQLVAAK MSTEEICEYI
GADSLGFLSI EGLLACVPSR GYCESCFNGR YPVAIPKDFH GRFLPENTPD NLNPSFALKF
DQVEQQLIDE GLMPSEQ