Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0974 |
Symbol | |
ID | 8413845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1099151 |
End bp | 1102018 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645022562 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_003179994 |
Protein GI | 257784777 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000578394 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGCA GTTCTCAAGA TTCAAATCTA GAGTCCAGCA ATGCTTCTTT TGACGAGAAA CCCATCGATA GTGTAGCAAC TTCTGAGGTA GCTCAGTCGC TTCTTGAGCG ACGTGGAAAT GAGATTGCTG CTGCTCGCAC CCTTATTCAG GCGCTTAAAG CACCAGCTCC ACTGCGTGAT AACCTTACCT TCTTTTTACG CTTGGTAAGA AAAGTGCTTG CTGAGTACAA TCCAGATCTT CTTACTAGTT TTGATACGTT GCTGGTTGAG GCAATTAAGG CTGGACCAGA TGATTTGTCT ACAACTCGTG CGCCCTTGTT ACTTGAGGCA ATTAATGCTG TGCTCAAGGT AGATTCAGAG AAGGAATCGC TTAACGCTTT CCTTCGTTTT GCCTCTACCA TCGATAGCTT GAGCATGGAA GACTCCGTGC TTCTTATGCG TGCGTTTGTA ACTTTTTTCC ACTTGGCAAA TCTCTGTGAA GAAAATTATC GCGTTACCAG CTTGCGTTCT CGTGAGGCTT CTGTTGATTC ATCTTCTGCA GAAGATCCAA TCAACGAGAT AACTGTTGCA TATAGGCAGC TCATTGACGA GTGTGGTGAA GAAGAGGCAA AGGCACGTCT TAATCGTCTT GAGTTCCATC CTGTTTTTAC TGCTCATCCA ACTGAGGCTC GCCGTAAAAA TGTTGAACGT CGAATCCGCA TTATTTCAGA GCTTCTTGAT GAACGCCAAA GACTTGGCGG TCCTGCTCGT GTGGAAAATG AGCGTCGCAT GCTGCAGGAA ATTGACGGCC TGTTCCGTAC ATCTCCTATC GGTCATAAAA AGCCAACTCC TTTGGAGGAA GCAGATACCG TCCTTAATAT CTTTGACACT ACTCTCTTTG AGATGGTTCC TTCGGTGTAT CGTCGCTTTG ATAACTGGGC GCTGGGAGAA AACGCTGGTT GTGTACCACC TGTCTGTCCG CCGTTCTTTC GCCCAGGCAG CTGGATTGGC TCCGATCGCG ATGGTAACCC TAACGTCACT GCTTTAGTTT CTCGCCAAGT TGCCGAGAAG TACCGTGTAC ACGTACTACA AGCACTGGCT GAAGCTACTA AGGAGGTCAG TCGAGGTCTG ACGCTTGATG GTATTTCAAC CCCAGCTTCA CCCGCACTTG CTAATCTTTG GGCGCAGCAG GTAGAGATGA GCCAGGCTCT GACGTCTCGC GCTGTTGATA AGGCTGGGTC TGAGTTGCAT CGTGCGGCTA TGCGCGTAAT TTCTGGTCGA CTCAGCGCCA CCATTGAACG CAATGCTGAC CTTATGTATC AAAACGCCGA AGAGTTTATT GCTGACCTGC GTGTTATTCA GGATTCGCTT GTTCAGGCAG GAGCTCGTCG TATTGCCTAC GGTCCTATTC AGAGGCTTAT CTGGCAGGCT CAGACCTTTG GTTTTCACTT GGTAGAGATG GAGTTCCGTC AGCACTCTTT GGTACACAAA CGTGCACTTG CTGATCTTGA GGCACATCCA TCAACTGCCG AGAAACCCGC AAAGCTTGAT GCTATGACGC AGGAGGTGCT TGATACCTTC CGATCTATTG GCTCAATCCA AAAGAAGAAC GGTATTAACG CTGCTCGTCG TTATATTATT TCGTTTACAC AGTCTGCTCA GGATGTTGAG AATGTTTACA AACTGGCACG CCTTGCGTTT GCAAATGAAG AAGATGTTCC CGTACTAGAC GTTATTCCGT TGTTTGAACA GATTGAAGAT CTTGAGAACG CTGTTACCAC ACTTGATCAG GTAATTCAGA TTCCTGAGGT ACAGGAGCGT CTTACTCAGA CAGACCGCAA GCTTGAGGTT ATGCTAGGCT ACTCAGATTC TTCAAAGGAT GAAGGTCCAA CTACTGCAAC ACTGGTACTG CATAAAACTC AAGCCGCTTT GGCGGAGTGG GCAGAGAAGA ACTCCATTGA TCTTATCCTG ATGCACGGCC GCGGTGGTGC TGTTGGTCGT GGCGGTGGTC CTGCAAACCG CGCTGTTTTG TCTCAGCCTA AGGGATCTGT TAACGGCCGC TTTAAACTTA CCGAGCAAGG AGAGGTTATC TTTGCTCGTT ATGGAGACCC AACGCTTGCT CGCCGTCACG TAGAGTCTGT TGCAGGAGCA ACGCTGCTTC AGATGGCACC TTCTCTTGAG CAGAAGAATA CTCATGCAGA TGTGAAGTTT GCTTCATTAG CTTCTGAGCT TGATAAGGCT TCCAAACAGC GCTTCTTAGA ACTCATTCAC TCTGATGGTT TTGCCGAGTG GTTCTCGGTT GTTACCCCTC TGACTGAGAT TGGTCTTCTC CCTATTGGTT CAAGACCTGC AAAACGTGGT CTTGGTGCAA AGTCTCTTGA TGATTTACGC GCAATCCCAT GGATTTTCTC GTGGTCACAG GCTCGTATCA ACCTAGCTGC TTGGTACGGA TTGGGTAGTG CATGTGAGGC TGTGGGAGAT ATTGAGCGTC TTCGTGAGGC ATACAAGGAG TGGCCGCTGT TCACTACGTT TATTGACAAC ATCGAGATGT CAATCTCGAA GGTTGATGCT CGTATCGCAA GACTTTACCT GGCTTTGGGA GATCGCCCTG AGCTTTCAGA GATGGTTCTT TCTGAGATGT CGTTGACACG TAAGTGGGTC CTTGCTATTA CCGGTAACAA GTGGCCTCTG GAGAACCGTC GCGTGCTAGG ACCTGTTATT CGTCTGCGTC TACCATTTGT TAACATTCTC TCGGTTACGC AGGTACATGC CCTTTCTGAG CTTCGTACAA GGGACGACAT GCTTACTCCA GAGGAGCGTG CAAATATCAC GTATCTGATT TTGTGCACTG TGTCTGGTGT TGCTGCGGGT CTGCAGAATA CGGGCTGA
|
Protein sequence | MAGSSQDSNL ESSNASFDEK PIDSVATSEV AQSLLERRGN EIAAARTLIQ ALKAPAPLRD NLTFFLRLVR KVLAEYNPDL LTSFDTLLVE AIKAGPDDLS TTRAPLLLEA INAVLKVDSE KESLNAFLRF ASTIDSLSME DSVLLMRAFV TFFHLANLCE ENYRVTSLRS REASVDSSSA EDPINEITVA YRQLIDECGE EEAKARLNRL EFHPVFTAHP TEARRKNVER RIRIISELLD ERQRLGGPAR VENERRMLQE IDGLFRTSPI GHKKPTPLEE ADTVLNIFDT TLFEMVPSVY RRFDNWALGE NAGCVPPVCP PFFRPGSWIG SDRDGNPNVT ALVSRQVAEK YRVHVLQALA EATKEVSRGL TLDGISTPAS PALANLWAQQ VEMSQALTSR AVDKAGSELH RAAMRVISGR LSATIERNAD LMYQNAEEFI ADLRVIQDSL VQAGARRIAY GPIQRLIWQA QTFGFHLVEM EFRQHSLVHK RALADLEAHP STAEKPAKLD AMTQEVLDTF RSIGSIQKKN GINAARRYII SFTQSAQDVE NVYKLARLAF ANEEDVPVLD VIPLFEQIED LENAVTTLDQ VIQIPEVQER LTQTDRKLEV MLGYSDSSKD EGPTTATLVL HKTQAALAEW AEKNSIDLIL MHGRGGAVGR GGGPANRAVL SQPKGSVNGR FKLTEQGEVI FARYGDPTLA RRHVESVAGA TLLQMAPSLE QKNTHADVKF ASLASELDKA SKQRFLELIH SDGFAEWFSV VTPLTEIGLL PIGSRPAKRG LGAKSLDDLR AIPWIFSWSQ ARINLAAWYG LGSACEAVGD IERLREAYKE WPLFTTFIDN IEMSISKVDA RIARLYLALG DRPELSEMVL SEMSLTRKWV LAITGNKWPL ENRRVLGPVI RLRLPFVNIL SVTQVHALSE LRTRDDMLTP EERANITYLI LCTVSGVAAG LQNTG
|
| |