Gene Apar_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1330 
Symbol 
ID8414215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1496311 
End bp1497900 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content45% 
IMG OID645022927 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003180345 
Protein GI257785128 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00539756 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000346935 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCTGAGG CCCCTATCAA ACGTGCGCTA ATTTCGGTAA CAGATAAAAC GGGTATTGTT 
GAATTTGCAC AAACTCTTAC TAAAGAGTTT GGTGTTGAAG TAATTTCAAC AGGTGGAACC
GCAAAAACCC TTGAAGAGGC TGGTGTCCCC GTAGTTCCTA TTGAGTCTTA TACCGGATTT
CCAGAAATGA TGGACGGTCG TGTTAAGACG CTGCATCCTC GTGTTCATGG TGGTCTTTTA
TGCCGTAGAG ATAATTCCGG TCATGTCGCA GACGCAGAGA ATAATGGTAT TGGCATGATT
GACCTGGTCT GCGTTAATCT CTATGAGTTT GAGAAGACTG TAGCTGATCC ATCAGTAACT
CTTGAAAATG CAATTGAGCA TATCGATATC GGTGGACCTT CAATGCTCCG CTCTGCTGCA
AAGAATAATG ATTTTGTTAC GGTTGTTGTT GATCCGGCAG ATTATGGTCG TGTTCTTGAT
GAGATGCGTG TCCATGACGG TGCAACTACA AGGGCTTCTC GCCAGCAGCT GGCTTTGAAG
GTATTTAAGA CAACGGCTGC ATACGATGGC GCTATTGCCG CATACCTTTC TGGTGTTGTT
GAAGCAGAGC AAAGTAAATT CCCAGAGACT TTGCTGGTAA AGGCAACAAA GGAGCAAGAT
CTTCGTTACG GAGAAAATCC TCAGCAGTCC GCAGCGTTTT ACAAGATGCC TGGCGCTCCT
GCACACTCCC TAGCAAATGC TCAGCAACTT CAGGGTAAGC CTTTGTCTTA CAACAATTTG
TTGGATACCG ATGCAGCTTG GGCGGCTGTT CGTGAGTTTG ATGATCCATC AGTCATTATT
TTGAAGCATC AGAATCCTTG TGGTTCTGCA ACAGCAGAAA ATGTTATTGA GGCATATGAC
CGGGCATTTG CTTGTGATCC TCTTTCTGCA TTTGGTGGAA TTATTGCAGT GAACAGAGAA
GTTCCACTGG AGTTTGTGGA GCATTTTGCA GATATCAATA AGCAGTTTGT TGAGGTTCTT
ATTGCATCAA GTTTCACGGA AGAGGCTCTT GAGCGACTGT CAAAGAGAAC AAATCTTCGC
GTATTAGCTA CGGGCGGAAT CGATAGAAGT CGTGAGCTCG AAATGAGAAC TGTTGATGGT
GGTCTTTTAG TGCAAGACCT TGATCATGCT GATGAAACTG CGGATAGCTT TGAGGTTGTC
ACAAAGCGTC AACCAACTTC AGAAGAGTTG TCTGATTTGG TATTTGCTTG GAAGGTCTGT
AAGACCGTTA AGTCTAATGC AATTCTGGTC GCAAAAGATC AGGCTGGAAT TGGTATGGGA
CCAGGTCAGC CCAACCGTGT TGATGCTGCT CTTCTCGCAT GTGAGCGTGC TGAAGCAGCT
TGCGAGCGTA TGGGAATTGA TTCAAAGAAC CTTGTGGCTG CATCTGACGC ATTCTTCCCA
TTCCGAGATA ACGTTGACAC GCTGGCAGCT CATGGCGTAA CAGCTATTAT TCAGCCAGGT
GGATCAGTTA GAGACGATGA ATCCATTGCT GCTTGTGATG AATATGGTAT TGCAATGGTG
TTTACGGGAA AGCGACACTT TAGGCACTAA
 
Protein sequence
MAEAPIKRAL ISVTDKTGIV EFAQTLTKEF GVEVISTGGT AKTLEEAGVP VVPIESYTGF 
PEMMDGRVKT LHPRVHGGLL CRRDNSGHVA DAENNGIGMI DLVCVNLYEF EKTVADPSVT
LENAIEHIDI GGPSMLRSAA KNNDFVTVVV DPADYGRVLD EMRVHDGATT RASRQQLALK
VFKTTAAYDG AIAAYLSGVV EAEQSKFPET LLVKATKEQD LRYGENPQQS AAFYKMPGAP
AHSLANAQQL QGKPLSYNNL LDTDAAWAAV REFDDPSVII LKHQNPCGSA TAENVIEAYD
RAFACDPLSA FGGIIAVNRE VPLEFVEHFA DINKQFVEVL IASSFTEEAL ERLSKRTNLR
VLATGGIDRS RELEMRTVDG GLLVQDLDHA DETADSFEVV TKRQPTSEEL SDLVFAWKVC
KTVKSNAILV AKDQAGIGMG PGQPNRVDAA LLACERAEAA CERMGIDSKN LVAASDAFFP
FRDNVDTLAA HGVTAIIQPG GSVRDDESIA ACDEYGIAMV FTGKRHFRH