Gene Apar_0111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0111 
Symbol 
ID8412954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp125674 
End bp127146 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content52% 
IMG OID645021678 
Productdihydropteroate synthase 
Protein accessionYP_003179138 
Protein GI257783921 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase
[TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.119932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGTC AAAGCTACAA CACCTGGCAA TGTGGAGCTC ACACTATCTC GTTGGCTCGT 
CCTAGAATCA TGGGCGTTCT CAATGTCACT CCAGATTCAT TTTCTGATGG CGGAAAGAAC
TTTGATCCAG AGGCAGCAAT TGCTCGCGGA TTGCAGATGC TAGATGAGGG CGCTGACATT
ATTGATGTTG GTGGAGAGTC TACTCGTCCA GGTCACACGC CTGTTCTACC TACAGAAGAG
GCTGAACGCG TAGTACCTGT GGTGCGTGCG TTGGTAGCTG CTGGCGCAAT TGTTTCCATT
GACACGCGCC ATGCTGAGGT GGCTAAGATG TGCGTCCGTC TTGGAGTATC AATCATCAAC
GACGTCACGG GCTTCACTGA TCCAGAGATG GTTGCTGTAG CTGCGGAATC TGATTGTGGC
TGCATTGTGA TGCACTGGAA CAAAGAAGGC CTTGGTGCTC GTGTTGAGCG CAAGCAGGTT
CAGTTGGACG ATATGCGTCC TTCTCGCCCT ACTGGCAGTG CAGCTGCTCG AGCTGCAGCT
ACAGGTACTG AAACACGCAC ATTAAGTTCG CAACGTCGTT TTACCTTGCC TGAAGAGGCA
CCTATCATGC GTCAGATTAT GGGATTTTTG GGTGATCAGG CTCGCGGTTT GATGCGTGCT
GGTGTATCCA AGGACCGCAT CTGCATTGAC CCAGGTCCAG GTTTTGACAA GTTTGCCGAC
GAAGATGTTG TTATTCAGCG CGCTACTCGT TCTATGGTTT CTATGGGATA TCCGCTGCTC
TGTGCGGTGT CTAGAAAGCG CTTTGTTGGC GCTGTTTCTG GTGTAACTGA GACTACGCAG
CGTGATGCAG CTACGCACGC AATTTGCATT GCTGCAATTA CCAACGGTGC CCGTATCCTG
CGTGTTCACG ACGTTGCAGG AACCGCTCAA GCCATCAATG CCTATTGGGC TATGACTGAG
CACGACCCTC GTCAGGGATT TGTAGCGCTT GGCTCTAACG TGGGTGACCG CGTAGGCTAT
CTGGCTCGAG CTACACAGCT CATCGATGAA ATTCCACTGA CCTGCGTTGT TTCGGTCAGC
CACGCGTATG AGACCGAGCC TGCGTATGGT ATTGCCACGC CTGTGGTAAA TGCTGTTGCC
GAGATTAGAA CTGAGCTCCA TCCGCTTGTG CTTATGGATA AGCTGCTTGA GGTAGAGAAC
GCTTTGAACC GCACACGCAA GAAGGGCGAA GAGGGCCACG GTCCTCGTAC TATCGACTGC
GATCTTCTGT GGGTTGAGGG CGAACAACAC GCAGGTAAAC ACCTCACTCT GCCTCATCCA
CGCATGGGCG AACGTGATTA CGTACTTGTT CCTATGGAGG ATCTGATGCA TGATCCTGTA
CGCTTCTTCT CACATGGTGA CGTAGATATT GTCCCTCCTG ATCAGCGCGT CGGTCACGTA
ACCGAGGACT TGGGAGCCAT TACGTGGGAG TAG
 
Protein sequence
MLRQSYNTWQ CGAHTISLAR PRIMGVLNVT PDSFSDGGKN FDPEAAIARG LQMLDEGADI 
IDVGGESTRP GHTPVLPTEE AERVVPVVRA LVAAGAIVSI DTRHAEVAKM CVRLGVSIIN
DVTGFTDPEM VAVAAESDCG CIVMHWNKEG LGARVERKQV QLDDMRPSRP TGSAAARAAA
TGTETRTLSS QRRFTLPEEA PIMRQIMGFL GDQARGLMRA GVSKDRICID PGPGFDKFAD
EDVVIQRATR SMVSMGYPLL CAVSRKRFVG AVSGVTETTQ RDAATHAICI AAITNGARIL
RVHDVAGTAQ AINAYWAMTE HDPRQGFVAL GSNVGDRVGY LARATQLIDE IPLTCVVSVS
HAYETEPAYG IATPVVNAVA EIRTELHPLV LMDKLLEVEN ALNRTRKKGE EGHGPRTIDC
DLLWVEGEQH AGKHLTLPHP RMGERDYVLV PMEDLMHDPV RFFSHGDVDI VPPDQRVGHV
TEDLGAITWE