Gene Apar_0613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0613 
Symbol 
ID8413472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp685719 
End bp688940 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content52% 
IMG OID645022190 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003179634 
Protein GI257784417 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.438109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.344128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGC GTACTGATAT CAAAAAGATT TTGGTTATCG GTTCTGGTCC AATTGTTATT 
GGTCAGGCTT GTGAGTTTGA CTATTCCGGC ACACAGGCTT GCCGCGCCCT TCGCAAAGAG
GGTTTTGAGG TTGTTTTGGT CAACTCAAAT CCAGCAACCA TCATGACTGA CCCAGAAACT
GCAGATAGAA CGTACGTTGA GCCAATCACT GTTGAGTCCG TTACTCGTGT CATCGAGCGC
GAGCGTCCCG ATGCACTTTT GCCAAATATG GGCGGTCAGA CTGCTTTGAA CTGCACCATT
GGTCTTGGCG AGGCAGGTGT TCTTGACAAG TACAATATCG AGGTTATCGG CTGCAACCTG
GATTCTATCC GTACCGGTGA AGACCGCGAG CTCTTTAGCG AGGCGGTCCA GGACATTGGC
CTGGAGGTTG CTCGCGCAGA CATTGCTCAT TCTATGGAAG ACGCTCAGCG CATTGTTGCC
GATTTGGGAT ATCCTGTGGT CATTCGTCCA AGCTTTACTC TTGGTGGCGC TGGCGGCGGT
ATCGCGCACA CTCCAGAAGA GCTCGTAGAG ATTGTGGAGC AAGGCCTCTT GCTTTCTCCT
GAGCACGAGG TGCTGGTAGA GGAGTCCATC GAGGGCTGGA AAGAGATTGA GATGGAGGTT
ATGCGCGATA CCACAGGTAA CGGTATTGTT GTTTGCTCCA TCGAAAATCT TGATCCTATG
GGTGTCCACA CCGGAGACTC TATCACCGTT GCTCCAGCTC AGACGCTTAC CGATAAGGAG
CTCCAGAACC TCCGTGATTA CTCCATTGCT ATTCTCGAGC GTGTAGGTGT TGCCTGCGGT
GGTTCCAACG TTCAGTTTGC GGTCAATCCA ACTAATGGCC GCGTCATCGT TATTGAGATG
AATCCTCGCG TTAGCCGTTC TTCCGCGCTT GCTTCCAAGG CAACCGGTTT CCCAATTGCA
AAGATGGCTG CGCTGCTCTC TGTTGGTTAT ACGCTTGATG AGATTACCAA CGACATCACC
AAGGCAACTC CAGCAGCTTT TGAGCCTTCT ATAGACTACT GCGTGGTTAA GGTCCCTCGC
TTTGCCTTCG TCAAGTTCAA GGGCACCAGC CGCGTTCTGA CAACCCGTAT GAAGTCGGTT
GGCGAGGTCA TGGCTATGGG CAGAACCTTT GAGGAGGCCC TGCAGAAGGC GCTTCGTTCT
TTGGAGCAAG ATCGCGCCGG TCTGGGTGCT GATGGTCACG ATGCCTTTGA CGAGAAGAAC
TTTGATGAGC TGGTCAGCAG ACCAACACCA GAGCGTATTT TCTATGTTGC TGAAGCTCTG
CGTAGAAACT GGAGCGTTGA GCGCATCCAC GATATGACCG GTATTGATCC TTGGTACCTG
CATCGTATGG CTGGCATCAT TAACGCCGAG AAACACATCA AGAAACTGGG GCTTTCTGGT
CTGACTACCC AAAATATGCT TGCTGCAAAA CAGCTGGGCT TTTCGGATGA GCAGCTTGCG
TACCTTACTG GTACCAAGGC AGACGTTGTC CGTGCCGTTA GGGAAGTGCT GGGTGTGCGC
CCACAGATTA AGACTGTAGA TACCTGCGCA GGTGAGTTTG GTGCAACTAC GCAGTACCAC
TATGTCACTT ACGAGAAGGG CAACGCTACA GAGTACGTTA AGGCTGAAAA GCCACGCGTC
ATGATCCTCT CCGCAGGTCC TAATCGCATT GGTCAAGGTA TTGAGTTTGA CTACTGCTGC
GTACATGCTT CGTATGCTCT TCGTGAGCAG GGCTATGAGA CTGTCATGGT CAATTGCAAT
CCAGAAACCG TATCCACAGA CTACGACACC TCTGACCGTC TGTATTTCCA GCCTTTGACT
TTTGAAGATG TCATGGACGT CATTGAGGTT GAGAAGCCAG AAGGCGTTAT TGTCACCCTT
GGTGGACAGA CCCCAATTAA GCTTGCGCGC GCTCTGAAGG ATGCCGGCGT TCCTATCATG
GGTACCCAGC CAGAGGCTAT TGACCTGGCA GAGGACCGAG ACCGCTTCGC AGCCCTTCTA
GACCGCCTCA ACATTGCCTG CCCGCCATCG GCAGTTGCAT CAACTATGGA CGAGGCAAGA
GATGCCGCTC GCCGCATTGG TTACCCATTG ATAGTCCGCC CAAGCTATGT TCTTGGTGGT
CGTGGTATGG CTATTGTCTA CGATGACTCT GACTTGGTTA CTTACATGAA GTCCGCTACA
CACGTCACAC CAGATCGTCC GGTCTACTTG GATGCCTTCC TTGAGGACGC TATTGAGCTG
GACGTTGATG CTCTTTGCGA CACCGAAGAG TGCTACGTAG GTTCTGTCCT GGAGCACATT
GAGGAGTGTG GCATTCACTC TGGCGACTCT GCTTGCTGCT GGCCGCCCTT CTCGCTCTCT
GAAAAGATTG TTGGCCAGAT TAGAGCTATC ACCAAAAAAT TGGCGCTTGC CTGCGACATC
CGAGGTTTGC TGAATATCCA GTACGCTGTT CGTGACGAAC ATGTCTTTGT CATCGAGCTC
AATCCTCGAG CTTCTAGAAC TGTGCCTTTC TCGTCTAAGG CAACTGGCGT CTCCCTGGCT
AAGTTTGCAT CTCGTATCAT GGCTGGCGAG AAAATCAGTG AGCTTAAAGC ACAAGGTCTG
CTCCCTGATG AGAATCGTAG CGTTGACTAC TATGCGGTTA AAGAGGCGGT TATGCCTTGG
TCCAGGTTTC CTGGCGCCGA CTCAATCCTT GGTCCTGAGA TGAAGTCTAC TGGTGAGGTC
ATGGGCATTG CTCGTACCTT CCCAGCAGCG TATGCAAAGA CTCGTGAGGC AGTTGAAAAT
AAGCTTCCTG AGCAGGGCTC AGTCTTTATC AGTGTGTGCG ACCGCGATAA GCGTGCCATT
GCTCCTGTTG CTATGGCTCT AGAGAACCTT GGTTACGGCA TCTACACTAC GGGTGGTACA
GCAAAAACGC TGCGTGCGGC TGGTATCAAC TGTACTACTG TCAATCGTAT TTCCGATGGT
CATCCAAACG TCGTTGACCT TATGCGCGAT AAGACCGTCA GCTTTATTAT CAATACGCCT
CACGGTCACG AGGCCCACAG TGATGGCACC AAGATGCGTG CAGAGGCTGT CAGCCAGGGT
ATTACCTGCG TTACTGCAAT GTCTGCAGCA ACTGCTCTTA TCCAAGCACT CGCGGCAGCA
AGAAAGAGTA AGCCAGAGAC CTTTGCTCTG CAAGATCTTT AA
 
Protein sequence
MPKRTDIKKI LVIGSGPIVI GQACEFDYSG TQACRALRKE GFEVVLVNSN PATIMTDPET 
ADRTYVEPIT VESVTRVIER ERPDALLPNM GGQTALNCTI GLGEAGVLDK YNIEVIGCNL
DSIRTGEDRE LFSEAVQDIG LEVARADIAH SMEDAQRIVA DLGYPVVIRP SFTLGGAGGG
IAHTPEELVE IVEQGLLLSP EHEVLVEESI EGWKEIEMEV MRDTTGNGIV VCSIENLDPM
GVHTGDSITV APAQTLTDKE LQNLRDYSIA ILERVGVACG GSNVQFAVNP TNGRVIVIEM
NPRVSRSSAL ASKATGFPIA KMAALLSVGY TLDEITNDIT KATPAAFEPS IDYCVVKVPR
FAFVKFKGTS RVLTTRMKSV GEVMAMGRTF EEALQKALRS LEQDRAGLGA DGHDAFDEKN
FDELVSRPTP ERIFYVAEAL RRNWSVERIH DMTGIDPWYL HRMAGIINAE KHIKKLGLSG
LTTQNMLAAK QLGFSDEQLA YLTGTKADVV RAVREVLGVR PQIKTVDTCA GEFGATTQYH
YVTYEKGNAT EYVKAEKPRV MILSAGPNRI GQGIEFDYCC VHASYALREQ GYETVMVNCN
PETVSTDYDT SDRLYFQPLT FEDVMDVIEV EKPEGVIVTL GGQTPIKLAR ALKDAGVPIM
GTQPEAIDLA EDRDRFAALL DRLNIACPPS AVASTMDEAR DAARRIGYPL IVRPSYVLGG
RGMAIVYDDS DLVTYMKSAT HVTPDRPVYL DAFLEDAIEL DVDALCDTEE CYVGSVLEHI
EECGIHSGDS ACCWPPFSLS EKIVGQIRAI TKKLALACDI RGLLNIQYAV RDEHVFVIEL
NPRASRTVPF SSKATGVSLA KFASRIMAGE KISELKAQGL LPDENRSVDY YAVKEAVMPW
SRFPGADSIL GPEMKSTGEV MGIARTFPAA YAKTREAVEN KLPEQGSVFI SVCDRDKRAI
APVAMALENL GYGIYTTGGT AKTLRAAGIN CTTVNRISDG HPNVVDLMRD KTVSFIINTP
HGHEAHSDGT KMRAEAVSQG ITCVTAMSAA TALIQALAAA RKSKPETFAL QDL