Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0613 |
Symbol | |
ID | 8413472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 685719 |
End bp | 688940 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645022190 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003179634 |
Protein GI | 257784417 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.438109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.344128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAGC GTACTGATAT CAAAAAGATT TTGGTTATCG GTTCTGGTCC AATTGTTATT GGTCAGGCTT GTGAGTTTGA CTATTCCGGC ACACAGGCTT GCCGCGCCCT TCGCAAAGAG GGTTTTGAGG TTGTTTTGGT CAACTCAAAT CCAGCAACCA TCATGACTGA CCCAGAAACT GCAGATAGAA CGTACGTTGA GCCAATCACT GTTGAGTCCG TTACTCGTGT CATCGAGCGC GAGCGTCCCG ATGCACTTTT GCCAAATATG GGCGGTCAGA CTGCTTTGAA CTGCACCATT GGTCTTGGCG AGGCAGGTGT TCTTGACAAG TACAATATCG AGGTTATCGG CTGCAACCTG GATTCTATCC GTACCGGTGA AGACCGCGAG CTCTTTAGCG AGGCGGTCCA GGACATTGGC CTGGAGGTTG CTCGCGCAGA CATTGCTCAT TCTATGGAAG ACGCTCAGCG CATTGTTGCC GATTTGGGAT ATCCTGTGGT CATTCGTCCA AGCTTTACTC TTGGTGGCGC TGGCGGCGGT ATCGCGCACA CTCCAGAAGA GCTCGTAGAG ATTGTGGAGC AAGGCCTCTT GCTTTCTCCT GAGCACGAGG TGCTGGTAGA GGAGTCCATC GAGGGCTGGA AAGAGATTGA GATGGAGGTT ATGCGCGATA CCACAGGTAA CGGTATTGTT GTTTGCTCCA TCGAAAATCT TGATCCTATG GGTGTCCACA CCGGAGACTC TATCACCGTT GCTCCAGCTC AGACGCTTAC CGATAAGGAG CTCCAGAACC TCCGTGATTA CTCCATTGCT ATTCTCGAGC GTGTAGGTGT TGCCTGCGGT GGTTCCAACG TTCAGTTTGC GGTCAATCCA ACTAATGGCC GCGTCATCGT TATTGAGATG AATCCTCGCG TTAGCCGTTC TTCCGCGCTT GCTTCCAAGG CAACCGGTTT CCCAATTGCA AAGATGGCTG CGCTGCTCTC TGTTGGTTAT ACGCTTGATG AGATTACCAA CGACATCACC AAGGCAACTC CAGCAGCTTT TGAGCCTTCT ATAGACTACT GCGTGGTTAA GGTCCCTCGC TTTGCCTTCG TCAAGTTCAA GGGCACCAGC CGCGTTCTGA CAACCCGTAT GAAGTCGGTT GGCGAGGTCA TGGCTATGGG CAGAACCTTT GAGGAGGCCC TGCAGAAGGC GCTTCGTTCT TTGGAGCAAG ATCGCGCCGG TCTGGGTGCT GATGGTCACG ATGCCTTTGA CGAGAAGAAC TTTGATGAGC TGGTCAGCAG ACCAACACCA GAGCGTATTT TCTATGTTGC TGAAGCTCTG CGTAGAAACT GGAGCGTTGA GCGCATCCAC GATATGACCG GTATTGATCC TTGGTACCTG CATCGTATGG CTGGCATCAT TAACGCCGAG AAACACATCA AGAAACTGGG GCTTTCTGGT CTGACTACCC AAAATATGCT TGCTGCAAAA CAGCTGGGCT TTTCGGATGA GCAGCTTGCG TACCTTACTG GTACCAAGGC AGACGTTGTC CGTGCCGTTA GGGAAGTGCT GGGTGTGCGC CCACAGATTA AGACTGTAGA TACCTGCGCA GGTGAGTTTG GTGCAACTAC GCAGTACCAC TATGTCACTT ACGAGAAGGG CAACGCTACA GAGTACGTTA AGGCTGAAAA GCCACGCGTC ATGATCCTCT CCGCAGGTCC TAATCGCATT GGTCAAGGTA TTGAGTTTGA CTACTGCTGC GTACATGCTT CGTATGCTCT TCGTGAGCAG GGCTATGAGA CTGTCATGGT CAATTGCAAT CCAGAAACCG TATCCACAGA CTACGACACC TCTGACCGTC TGTATTTCCA GCCTTTGACT TTTGAAGATG TCATGGACGT CATTGAGGTT GAGAAGCCAG AAGGCGTTAT TGTCACCCTT GGTGGACAGA CCCCAATTAA GCTTGCGCGC GCTCTGAAGG ATGCCGGCGT TCCTATCATG GGTACCCAGC CAGAGGCTAT TGACCTGGCA GAGGACCGAG ACCGCTTCGC AGCCCTTCTA GACCGCCTCA ACATTGCCTG CCCGCCATCG GCAGTTGCAT CAACTATGGA CGAGGCAAGA GATGCCGCTC GCCGCATTGG TTACCCATTG ATAGTCCGCC CAAGCTATGT TCTTGGTGGT CGTGGTATGG CTATTGTCTA CGATGACTCT GACTTGGTTA CTTACATGAA GTCCGCTACA CACGTCACAC CAGATCGTCC GGTCTACTTG GATGCCTTCC TTGAGGACGC TATTGAGCTG GACGTTGATG CTCTTTGCGA CACCGAAGAG TGCTACGTAG GTTCTGTCCT GGAGCACATT GAGGAGTGTG GCATTCACTC TGGCGACTCT GCTTGCTGCT GGCCGCCCTT CTCGCTCTCT GAAAAGATTG TTGGCCAGAT TAGAGCTATC ACCAAAAAAT TGGCGCTTGC CTGCGACATC CGAGGTTTGC TGAATATCCA GTACGCTGTT CGTGACGAAC ATGTCTTTGT CATCGAGCTC AATCCTCGAG CTTCTAGAAC TGTGCCTTTC TCGTCTAAGG CAACTGGCGT CTCCCTGGCT AAGTTTGCAT CTCGTATCAT GGCTGGCGAG AAAATCAGTG AGCTTAAAGC ACAAGGTCTG CTCCCTGATG AGAATCGTAG CGTTGACTAC TATGCGGTTA AAGAGGCGGT TATGCCTTGG TCCAGGTTTC CTGGCGCCGA CTCAATCCTT GGTCCTGAGA TGAAGTCTAC TGGTGAGGTC ATGGGCATTG CTCGTACCTT CCCAGCAGCG TATGCAAAGA CTCGTGAGGC AGTTGAAAAT AAGCTTCCTG AGCAGGGCTC AGTCTTTATC AGTGTGTGCG ACCGCGATAA GCGTGCCATT GCTCCTGTTG CTATGGCTCT AGAGAACCTT GGTTACGGCA TCTACACTAC GGGTGGTACA GCAAAAACGC TGCGTGCGGC TGGTATCAAC TGTACTACTG TCAATCGTAT TTCCGATGGT CATCCAAACG TCGTTGACCT TATGCGCGAT AAGACCGTCA GCTTTATTAT CAATACGCCT CACGGTCACG AGGCCCACAG TGATGGCACC AAGATGCGTG CAGAGGCTGT CAGCCAGGGT ATTACCTGCG TTACTGCAAT GTCTGCAGCA ACTGCTCTTA TCCAAGCACT CGCGGCAGCA AGAAAGAGTA AGCCAGAGAC CTTTGCTCTG CAAGATCTTT AA
|
Protein sequence | MPKRTDIKKI LVIGSGPIVI GQACEFDYSG TQACRALRKE GFEVVLVNSN PATIMTDPET ADRTYVEPIT VESVTRVIER ERPDALLPNM GGQTALNCTI GLGEAGVLDK YNIEVIGCNL DSIRTGEDRE LFSEAVQDIG LEVARADIAH SMEDAQRIVA DLGYPVVIRP SFTLGGAGGG IAHTPEELVE IVEQGLLLSP EHEVLVEESI EGWKEIEMEV MRDTTGNGIV VCSIENLDPM GVHTGDSITV APAQTLTDKE LQNLRDYSIA ILERVGVACG GSNVQFAVNP TNGRVIVIEM NPRVSRSSAL ASKATGFPIA KMAALLSVGY TLDEITNDIT KATPAAFEPS IDYCVVKVPR FAFVKFKGTS RVLTTRMKSV GEVMAMGRTF EEALQKALRS LEQDRAGLGA DGHDAFDEKN FDELVSRPTP ERIFYVAEAL RRNWSVERIH DMTGIDPWYL HRMAGIINAE KHIKKLGLSG LTTQNMLAAK QLGFSDEQLA YLTGTKADVV RAVREVLGVR PQIKTVDTCA GEFGATTQYH YVTYEKGNAT EYVKAEKPRV MILSAGPNRI GQGIEFDYCC VHASYALREQ GYETVMVNCN PETVSTDYDT SDRLYFQPLT FEDVMDVIEV EKPEGVIVTL GGQTPIKLAR ALKDAGVPIM GTQPEAIDLA EDRDRFAALL DRLNIACPPS AVASTMDEAR DAARRIGYPL IVRPSYVLGG RGMAIVYDDS DLVTYMKSAT HVTPDRPVYL DAFLEDAIEL DVDALCDTEE CYVGSVLEHI EECGIHSGDS ACCWPPFSLS EKIVGQIRAI TKKLALACDI RGLLNIQYAV RDEHVFVIEL NPRASRTVPF SSKATGVSLA KFASRIMAGE KISELKAQGL LPDENRSVDY YAVKEAVMPW SRFPGADSIL GPEMKSTGEV MGIARTFPAA YAKTREAVEN KLPEQGSVFI SVCDRDKRAI APVAMALENL GYGIYTTGGT AKTLRAAGIN CTTVNRISDG HPNVVDLMRD KTVSFIINTP HGHEAHSDGT KMRAEAVSQG ITCVTAMSAA TALIQALAAA RKSKPETFAL QDL
|
| |