Gene Apar_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0974 
Symbol 
ID8413845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1099151 
End bp1102018 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content48% 
IMG OID645022562 
ProductPhosphoenolpyruvate carboxylase 
Protein accessionYP_003179994 
Protein GI257784777 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.78394e-05 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCA GTTCTCAAGA TTCAAATCTA GAGTCCAGCA ATGCTTCTTT TGACGAGAAA 
CCCATCGATA GTGTAGCAAC TTCTGAGGTA GCTCAGTCGC TTCTTGAGCG ACGTGGAAAT
GAGATTGCTG CTGCTCGCAC CCTTATTCAG GCGCTTAAAG CACCAGCTCC ACTGCGTGAT
AACCTTACCT TCTTTTTACG CTTGGTAAGA AAAGTGCTTG CTGAGTACAA TCCAGATCTT
CTTACTAGTT TTGATACGTT GCTGGTTGAG GCAATTAAGG CTGGACCAGA TGATTTGTCT
ACAACTCGTG CGCCCTTGTT ACTTGAGGCA ATTAATGCTG TGCTCAAGGT AGATTCAGAG
AAGGAATCGC TTAACGCTTT CCTTCGTTTT GCCTCTACCA TCGATAGCTT GAGCATGGAA
GACTCCGTGC TTCTTATGCG TGCGTTTGTA ACTTTTTTCC ACTTGGCAAA TCTCTGTGAA
GAAAATTATC GCGTTACCAG CTTGCGTTCT CGTGAGGCTT CTGTTGATTC ATCTTCTGCA
GAAGATCCAA TCAACGAGAT AACTGTTGCA TATAGGCAGC TCATTGACGA GTGTGGTGAA
GAAGAGGCAA AGGCACGTCT TAATCGTCTT GAGTTCCATC CTGTTTTTAC TGCTCATCCA
ACTGAGGCTC GCCGTAAAAA TGTTGAACGT CGAATCCGCA TTATTTCAGA GCTTCTTGAT
GAACGCCAAA GACTTGGCGG TCCTGCTCGT GTGGAAAATG AGCGTCGCAT GCTGCAGGAA
ATTGACGGCC TGTTCCGTAC ATCTCCTATC GGTCATAAAA AGCCAACTCC TTTGGAGGAA
GCAGATACCG TCCTTAATAT CTTTGACACT ACTCTCTTTG AGATGGTTCC TTCGGTGTAT
CGTCGCTTTG ATAACTGGGC GCTGGGAGAA AACGCTGGTT GTGTACCACC TGTCTGTCCG
CCGTTCTTTC GCCCAGGCAG CTGGATTGGC TCCGATCGCG ATGGTAACCC TAACGTCACT
GCTTTAGTTT CTCGCCAAGT TGCCGAGAAG TACCGTGTAC ACGTACTACA AGCACTGGCT
GAAGCTACTA AGGAGGTCAG TCGAGGTCTG ACGCTTGATG GTATTTCAAC CCCAGCTTCA
CCCGCACTTG CTAATCTTTG GGCGCAGCAG GTAGAGATGA GCCAGGCTCT GACGTCTCGC
GCTGTTGATA AGGCTGGGTC TGAGTTGCAT CGTGCGGCTA TGCGCGTAAT TTCTGGTCGA
CTCAGCGCCA CCATTGAACG CAATGCTGAC CTTATGTATC AAAACGCCGA AGAGTTTATT
GCTGACCTGC GTGTTATTCA GGATTCGCTT GTTCAGGCAG GAGCTCGTCG TATTGCCTAC
GGTCCTATTC AGAGGCTTAT CTGGCAGGCT CAGACCTTTG GTTTTCACTT GGTAGAGATG
GAGTTCCGTC AGCACTCTTT GGTACACAAA CGTGCACTTG CTGATCTTGA GGCACATCCA
TCAACTGCCG AGAAACCCGC AAAGCTTGAT GCTATGACGC AGGAGGTGCT TGATACCTTC
CGATCTATTG GCTCAATCCA AAAGAAGAAC GGTATTAACG CTGCTCGTCG TTATATTATT
TCGTTTACAC AGTCTGCTCA GGATGTTGAG AATGTTTACA AACTGGCACG CCTTGCGTTT
GCAAATGAAG AAGATGTTCC CGTACTAGAC GTTATTCCGT TGTTTGAACA GATTGAAGAT
CTTGAGAACG CTGTTACCAC ACTTGATCAG GTAATTCAGA TTCCTGAGGT ACAGGAGCGT
CTTACTCAGA CAGACCGCAA GCTTGAGGTT ATGCTAGGCT ACTCAGATTC TTCAAAGGAT
GAAGGTCCAA CTACTGCAAC ACTGGTACTG CATAAAACTC AAGCCGCTTT GGCGGAGTGG
GCAGAGAAGA ACTCCATTGA TCTTATCCTG ATGCACGGCC GCGGTGGTGC TGTTGGTCGT
GGCGGTGGTC CTGCAAACCG CGCTGTTTTG TCTCAGCCTA AGGGATCTGT TAACGGCCGC
TTTAAACTTA CCGAGCAAGG AGAGGTTATC TTTGCTCGTT ATGGAGACCC AACGCTTGCT
CGCCGTCACG TAGAGTCTGT TGCAGGAGCA ACGCTGCTTC AGATGGCACC TTCTCTTGAG
CAGAAGAATA CTCATGCAGA TGTGAAGTTT GCTTCATTAG CTTCTGAGCT TGATAAGGCT
TCCAAACAGC GCTTCTTAGA ACTCATTCAC TCTGATGGTT TTGCCGAGTG GTTCTCGGTT
GTTACCCCTC TGACTGAGAT TGGTCTTCTC CCTATTGGTT CAAGACCTGC AAAACGTGGT
CTTGGTGCAA AGTCTCTTGA TGATTTACGC GCAATCCCAT GGATTTTCTC GTGGTCACAG
GCTCGTATCA ACCTAGCTGC TTGGTACGGA TTGGGTAGTG CATGTGAGGC TGTGGGAGAT
ATTGAGCGTC TTCGTGAGGC ATACAAGGAG TGGCCGCTGT TCACTACGTT TATTGACAAC
ATCGAGATGT CAATCTCGAA GGTTGATGCT CGTATCGCAA GACTTTACCT GGCTTTGGGA
GATCGCCCTG AGCTTTCAGA GATGGTTCTT TCTGAGATGT CGTTGACACG TAAGTGGGTC
CTTGCTATTA CCGGTAACAA GTGGCCTCTG GAGAACCGTC GCGTGCTAGG ACCTGTTATT
CGTCTGCGTC TACCATTTGT TAACATTCTC TCGGTTACGC AGGTACATGC CCTTTCTGAG
CTTCGTACAA GGGACGACAT GCTTACTCCA GAGGAGCGTG CAAATATCAC GTATCTGATT
TTGTGCACTG TGTCTGGTGT TGCTGCGGGT CTGCAGAATA CGGGCTGA
 
Protein sequence
MAGSSQDSNL ESSNASFDEK PIDSVATSEV AQSLLERRGN EIAAARTLIQ ALKAPAPLRD 
NLTFFLRLVR KVLAEYNPDL LTSFDTLLVE AIKAGPDDLS TTRAPLLLEA INAVLKVDSE
KESLNAFLRF ASTIDSLSME DSVLLMRAFV TFFHLANLCE ENYRVTSLRS REASVDSSSA
EDPINEITVA YRQLIDECGE EEAKARLNRL EFHPVFTAHP TEARRKNVER RIRIISELLD
ERQRLGGPAR VENERRMLQE IDGLFRTSPI GHKKPTPLEE ADTVLNIFDT TLFEMVPSVY
RRFDNWALGE NAGCVPPVCP PFFRPGSWIG SDRDGNPNVT ALVSRQVAEK YRVHVLQALA
EATKEVSRGL TLDGISTPAS PALANLWAQQ VEMSQALTSR AVDKAGSELH RAAMRVISGR
LSATIERNAD LMYQNAEEFI ADLRVIQDSL VQAGARRIAY GPIQRLIWQA QTFGFHLVEM
EFRQHSLVHK RALADLEAHP STAEKPAKLD AMTQEVLDTF RSIGSIQKKN GINAARRYII
SFTQSAQDVE NVYKLARLAF ANEEDVPVLD VIPLFEQIED LENAVTTLDQ VIQIPEVQER
LTQTDRKLEV MLGYSDSSKD EGPTTATLVL HKTQAALAEW AEKNSIDLIL MHGRGGAVGR
GGGPANRAVL SQPKGSVNGR FKLTEQGEVI FARYGDPTLA RRHVESVAGA TLLQMAPSLE
QKNTHADVKF ASLASELDKA SKQRFLELIH SDGFAEWFSV VTPLTEIGLL PIGSRPAKRG
LGAKSLDDLR AIPWIFSWSQ ARINLAAWYG LGSACEAVGD IERLREAYKE WPLFTTFIDN
IEMSISKVDA RIARLYLALG DRPELSEMVL SEMSLTRKWV LAITGNKWPL ENRRVLGPVI
RLRLPFVNIL SVTQVHALSE LRTRDDMLTP EERANITYLI LCTVSGVAAG LQNTG