Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1251 |
Symbol | |
ID | 6275403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1501602 |
End bp | 1504805 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613308 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_001877857 |
Protein GI | 187735745 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAAAG ACACCTCCAT CAAAAAGATT CTCGTGATCG GGTCCGGCCC GATCGTCATC GGCCAGGGCT GCGAATTCGA CTACTCCGGC ACCCAGGCCT GCAAAGCCCT CCGGGAAGAG GGTTATGAAG TAGTGCTGGT CAATTCCAAT CCCGCCACCA TCATGACGGA TCCGGAGACA GCTCCACGCA CTTATATTGA ACCGATTACT CCGGAGTTCG TGGAGAAAAT CATCATCCGG GAAAAACCGG ATGCCCTCCT CCCGACCCTG GGCGGGCAGA CAGCCCTGAA CTGCGCCATG GAACTGCACC GCAGCGGCGT GCTGGAAAGG GAAAAGGTGC GTATGATCGG CGCCAACGCA AACGCCATTG ACCGCGGGGA GGACCGCAAC CTGTTCAAGG AAGCCATGCT CCGCATTGGC CTGGACGTGC CGCGTTCCGG CATTGCGCAT ACCATGGAGG AAGCGCGCCG GGTGGCCGGG GATATTGGCA CCTTTCCCCT GATTGTCCGC CCCGCCTTTA CGCTGGGCGG CATGGGAGGC GGCGTGGCGT ACAACAAAAC GGAATTTGAA GAGATATGCG CCCGGGGGCT GGATCTTTCC CCCGTTTCCG AAATCCTGAT TGAAGAGTCC CTGATCGGCT GGAAAGAGTT TGAAATGGAG GTAATGAGGG ACCGCGCGGA CAATTGCGTC GTCATTTGCT CCATTGAGAA CATCGACCCC ATGGGCGTGC ATACCGGGGA TTCCATCACC GTAGCCCCCA TCCAGACACT CACGGACCGA GAATACCAGG CCATGAGGGA CGCTTCTTTT GCCGTCATCC GGGAGATCGG GGTGGAAACG GGCGGTTCCA ACATCCAGTT CGCCATTAAT CCGGCCAACG GCCGCATGAT TGTCATTGAA ATGAACCCCC GCGTTTCCCG CTCTTCCGCG CTGGCTTCCA AGGCCACAGG GTTCCCCATT GCCAAACTGG CCGCCAAACT GGCCATAGGC TATACGCTGG ACGAATTGCG GAATGACATC ACGCGGGAAA CGCCCGCCTG CTTTGAGCCC ACCATCGACT ATGTGGTTAC CAAGGTTCCC CGCTTCACGT TTGAAAAGTT CAAGCAGGCG GACGAACACC TGACCACGTC CATGAAATCC GTAGGGGAGG CCATGGCCAT CGGTCGTACT TTCAAGGAAT CCCTGCAAAA GGCCTTGCGT TCCCTGGAAA CGGGACGCTG GGGCTGGGGA TTTGACGCGA AGGCCCCGCA GGCTCCTTCC GCAGAAGAAA TCACGCGGAA ACTGGCTGTT CCCACGGCAG AACGCATTTT CTGGATTCAA ACGGCGTTCA GCAACGGATT TTCCCTGGAT GAAGTGCAGC AGCTTACCCA GATTGATCCC TGGTTCCTGG CACAAATGCA GGACCTGGCC AAGGCGGGAG ACAGCCTGGA CAAGCTGGAT CTGCGGGAAG CCAAGAAGCT GGGCTTTTCC GACAGGCAGA TCGCCCTGGC CCGGGGAACC ACGGAAGAAA ACATCCGGAA GGAACGCCTG GAACAGGGCA TTGTGCCCGG ATACCGGCTG GTGGATACCT GCGCCGCGGA ATTCGAAGCC TACACCCCCT ACTTTTATTC CACCTATGGA GATGAAAATG AAGCCCGCGA AACAGGCAGA AAAAAAATCC TCATCCTGGG CGGCGGCCCA AACCGCATTG GCCAGGGGAT TGAGTTCGAC TACTGCTGCG TGCACGCCTC CATGGCTCTG CGGGAAATGG GCTACGAAAC CATCATCGTG AACTCCAACC CGGAAACCGT CTCCACGGAC TATGATTCCT CCGACAAGCT TTTCTTCGAA CCCCTGACGC TGGAAGACGT GCTCCATATC TGTGAACAGG AGAAGCCGGA CGGCGTCATC GTCCAGTTCG GAGGACAGAC GCCGCTGAAC CTGGCCAATG CGCTGGAAGC CCACGGCGTG CCTATTATCG GCACCAGCCC CAGGGCCATT GACCTGGCGG AAGACCGCGA ACATTTTTCC GCCCTGTTGA AGGAACTGGG GCTGAAACAG GCGGAGGCAG GAACAGCCAC CAATGTGGAG GACGCCGCCG CCATTGCCGC GCGCATCGGT TACCCGGTGC TGCTGCGCCC CTCCTTCGTG CTGGGCGGAC GCGGCATGAT CATCGTATAT GAGGAAAAAG AACTGCGCCG GTACATGAAT GAAGCCGTGG AAGCTTCCGA GGAACGCCCC GTTCTGATTG ATCGCTTCCT GGAAAATGCC GTAGAGATTG ACGTGGATGT CATCGCGGAC AGGGAGCGCG CCGTCATCGG AGCCATCATG CAGCATGTGG AACCGGCAGG CATCCATTCA GGGGATTCCG CCAGCATGAT TCCCGCCATG GGAATTTCCA TGAAAATGCA CAAGGAAATC ACGCGGGCTT CCAAGGAACT GGCAAGCAAA CTGAACGTCT GCGGGCTGAT GAACATCCAG TTTGCCGTAA AAGACGAACA GCTTTACGTC ATTGAGGTGA ATCCCCGCGC CTCCCGCACC GTTCCGTTCG TCTCCAAATC CATCGGCAAG CCCCTCGCCA AGCTGGCCGC ACAGGTCATG GCCGGCAAAA CGCTGGCGGA ACTGGGCTTT ACCCGGGAAA TCACTCCGGA ATACTATTGC GTGAAGGAAG CCGTTTTCCC ATGGGGCCGC TTCCCGGGCA TCGACGTGGT GCTGGGGCCG GAAATGAAAT CCACCGGAGA GGTGATGGGC ATTGATCCGG ACCCGGATAT CGCTTTCGCA AAATCCCAGG TCAGTGCATT CAATCCCCTG CCGACGGAAG GCAAAGTCTT CATCTCCGTG AATGACCGGG ACAAGGAACG GGTGCTTCAC ATGGCCCGGC AGCTGGCGGA CATGGGATTC ACGCTGTGCG CCACGCGCGG CACGATGATT CACCTGCTCC AGCACGACAT TGAATGCGAA CGCGCCTACA AAGTCAACGA GGCGCGCCGC CCCAACATCG TGGACCATAT TAAAAACGGG GATATTGATT TCATCATCAA CACCCCCGGC TCCCACGACG CGCGGGCGGA CGACATCATC ATCCGTTCCT CCGCCATTGC CGCCAAAACC TCCTATTGCA CCAACCTGGC TTCCGCGCAG GCCTGCGTGA ATGCCATTGA GGCGCTGAAG AACAAAAATC TTCAGGTGTG CACTATTCAG GAGTACCACG CCCAAAACCT TTAA
|
Protein sequence | MPKDTSIKKI LVIGSGPIVI GQGCEFDYSG TQACKALREE GYEVVLVNSN PATIMTDPET APRTYIEPIT PEFVEKIIIR EKPDALLPTL GGQTALNCAM ELHRSGVLER EKVRMIGANA NAIDRGEDRN LFKEAMLRIG LDVPRSGIAH TMEEARRVAG DIGTFPLIVR PAFTLGGMGG GVAYNKTEFE EICARGLDLS PVSEILIEES LIGWKEFEME VMRDRADNCV VICSIENIDP MGVHTGDSIT VAPIQTLTDR EYQAMRDASF AVIREIGVET GGSNIQFAIN PANGRMIVIE MNPRVSRSSA LASKATGFPI AKLAAKLAIG YTLDELRNDI TRETPACFEP TIDYVVTKVP RFTFEKFKQA DEHLTTSMKS VGEAMAIGRT FKESLQKALR SLETGRWGWG FDAKAPQAPS AEEITRKLAV PTAERIFWIQ TAFSNGFSLD EVQQLTQIDP WFLAQMQDLA KAGDSLDKLD LREAKKLGFS DRQIALARGT TEENIRKERL EQGIVPGYRL VDTCAAEFEA YTPYFYSTYG DENEARETGR KKILILGGGP NRIGQGIEFD YCCVHASMAL REMGYETIIV NSNPETVSTD YDSSDKLFFE PLTLEDVLHI CEQEKPDGVI VQFGGQTPLN LANALEAHGV PIIGTSPRAI DLAEDREHFS ALLKELGLKQ AEAGTATNVE DAAAIAARIG YPVLLRPSFV LGGRGMIIVY EEKELRRYMN EAVEASEERP VLIDRFLENA VEIDVDVIAD RERAVIGAIM QHVEPAGIHS GDSASMIPAM GISMKMHKEI TRASKELASK LNVCGLMNIQ FAVKDEQLYV IEVNPRASRT VPFVSKSIGK PLAKLAAQVM AGKTLAELGF TREITPEYYC VKEAVFPWGR FPGIDVVLGP EMKSTGEVMG IDPDPDIAFA KSQVSAFNPL PTEGKVFISV NDRDKERVLH MARQLADMGF TLCATRGTMI HLLQHDIECE RAYKVNEARR PNIVDHIKNG DIDFIINTPG SHDARADDII IRSSAIAAKT SYCTNLASAQ ACVNAIEALK NKNLQVCTIQ EYHAQNL
|
| |