Gene Amuc_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1251 
Symbol 
ID6275403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1501602 
End bp1504805 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content58% 
IMG OID642613308 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001877857 
Protein GI187735745 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAG ACACCTCCAT CAAAAAGATT CTCGTGATCG GGTCCGGCCC GATCGTCATC 
GGCCAGGGCT GCGAATTCGA CTACTCCGGC ACCCAGGCCT GCAAAGCCCT CCGGGAAGAG
GGTTATGAAG TAGTGCTGGT CAATTCCAAT CCCGCCACCA TCATGACGGA TCCGGAGACA
GCTCCACGCA CTTATATTGA ACCGATTACT CCGGAGTTCG TGGAGAAAAT CATCATCCGG
GAAAAACCGG ATGCCCTCCT CCCGACCCTG GGCGGGCAGA CAGCCCTGAA CTGCGCCATG
GAACTGCACC GCAGCGGCGT GCTGGAAAGG GAAAAGGTGC GTATGATCGG CGCCAACGCA
AACGCCATTG ACCGCGGGGA GGACCGCAAC CTGTTCAAGG AAGCCATGCT CCGCATTGGC
CTGGACGTGC CGCGTTCCGG CATTGCGCAT ACCATGGAGG AAGCGCGCCG GGTGGCCGGG
GATATTGGCA CCTTTCCCCT GATTGTCCGC CCCGCCTTTA CGCTGGGCGG CATGGGAGGC
GGCGTGGCGT ACAACAAAAC GGAATTTGAA GAGATATGCG CCCGGGGGCT GGATCTTTCC
CCCGTTTCCG AAATCCTGAT TGAAGAGTCC CTGATCGGCT GGAAAGAGTT TGAAATGGAG
GTAATGAGGG ACCGCGCGGA CAATTGCGTC GTCATTTGCT CCATTGAGAA CATCGACCCC
ATGGGCGTGC ATACCGGGGA TTCCATCACC GTAGCCCCCA TCCAGACACT CACGGACCGA
GAATACCAGG CCATGAGGGA CGCTTCTTTT GCCGTCATCC GGGAGATCGG GGTGGAAACG
GGCGGTTCCA ACATCCAGTT CGCCATTAAT CCGGCCAACG GCCGCATGAT TGTCATTGAA
ATGAACCCCC GCGTTTCCCG CTCTTCCGCG CTGGCTTCCA AGGCCACAGG GTTCCCCATT
GCCAAACTGG CCGCCAAACT GGCCATAGGC TATACGCTGG ACGAATTGCG GAATGACATC
ACGCGGGAAA CGCCCGCCTG CTTTGAGCCC ACCATCGACT ATGTGGTTAC CAAGGTTCCC
CGCTTCACGT TTGAAAAGTT CAAGCAGGCG GACGAACACC TGACCACGTC CATGAAATCC
GTAGGGGAGG CCATGGCCAT CGGTCGTACT TTCAAGGAAT CCCTGCAAAA GGCCTTGCGT
TCCCTGGAAA CGGGACGCTG GGGCTGGGGA TTTGACGCGA AGGCCCCGCA GGCTCCTTCC
GCAGAAGAAA TCACGCGGAA ACTGGCTGTT CCCACGGCAG AACGCATTTT CTGGATTCAA
ACGGCGTTCA GCAACGGATT TTCCCTGGAT GAAGTGCAGC AGCTTACCCA GATTGATCCC
TGGTTCCTGG CACAAATGCA GGACCTGGCC AAGGCGGGAG ACAGCCTGGA CAAGCTGGAT
CTGCGGGAAG CCAAGAAGCT GGGCTTTTCC GACAGGCAGA TCGCCCTGGC CCGGGGAACC
ACGGAAGAAA ACATCCGGAA GGAACGCCTG GAACAGGGCA TTGTGCCCGG ATACCGGCTG
GTGGATACCT GCGCCGCGGA ATTCGAAGCC TACACCCCCT ACTTTTATTC CACCTATGGA
GATGAAAATG AAGCCCGCGA AACAGGCAGA AAAAAAATCC TCATCCTGGG CGGCGGCCCA
AACCGCATTG GCCAGGGGAT TGAGTTCGAC TACTGCTGCG TGCACGCCTC CATGGCTCTG
CGGGAAATGG GCTACGAAAC CATCATCGTG AACTCCAACC CGGAAACCGT CTCCACGGAC
TATGATTCCT CCGACAAGCT TTTCTTCGAA CCCCTGACGC TGGAAGACGT GCTCCATATC
TGTGAACAGG AGAAGCCGGA CGGCGTCATC GTCCAGTTCG GAGGACAGAC GCCGCTGAAC
CTGGCCAATG CGCTGGAAGC CCACGGCGTG CCTATTATCG GCACCAGCCC CAGGGCCATT
GACCTGGCGG AAGACCGCGA ACATTTTTCC GCCCTGTTGA AGGAACTGGG GCTGAAACAG
GCGGAGGCAG GAACAGCCAC CAATGTGGAG GACGCCGCCG CCATTGCCGC GCGCATCGGT
TACCCGGTGC TGCTGCGCCC CTCCTTCGTG CTGGGCGGAC GCGGCATGAT CATCGTATAT
GAGGAAAAAG AACTGCGCCG GTACATGAAT GAAGCCGTGG AAGCTTCCGA GGAACGCCCC
GTTCTGATTG ATCGCTTCCT GGAAAATGCC GTAGAGATTG ACGTGGATGT CATCGCGGAC
AGGGAGCGCG CCGTCATCGG AGCCATCATG CAGCATGTGG AACCGGCAGG CATCCATTCA
GGGGATTCCG CCAGCATGAT TCCCGCCATG GGAATTTCCA TGAAAATGCA CAAGGAAATC
ACGCGGGCTT CCAAGGAACT GGCAAGCAAA CTGAACGTCT GCGGGCTGAT GAACATCCAG
TTTGCCGTAA AAGACGAACA GCTTTACGTC ATTGAGGTGA ATCCCCGCGC CTCCCGCACC
GTTCCGTTCG TCTCCAAATC CATCGGCAAG CCCCTCGCCA AGCTGGCCGC ACAGGTCATG
GCCGGCAAAA CGCTGGCGGA ACTGGGCTTT ACCCGGGAAA TCACTCCGGA ATACTATTGC
GTGAAGGAAG CCGTTTTCCC ATGGGGCCGC TTCCCGGGCA TCGACGTGGT GCTGGGGCCG
GAAATGAAAT CCACCGGAGA GGTGATGGGC ATTGATCCGG ACCCGGATAT CGCTTTCGCA
AAATCCCAGG TCAGTGCATT CAATCCCCTG CCGACGGAAG GCAAAGTCTT CATCTCCGTG
AATGACCGGG ACAAGGAACG GGTGCTTCAC ATGGCCCGGC AGCTGGCGGA CATGGGATTC
ACGCTGTGCG CCACGCGCGG CACGATGATT CACCTGCTCC AGCACGACAT TGAATGCGAA
CGCGCCTACA AAGTCAACGA GGCGCGCCGC CCCAACATCG TGGACCATAT TAAAAACGGG
GATATTGATT TCATCATCAA CACCCCCGGC TCCCACGACG CGCGGGCGGA CGACATCATC
ATCCGTTCCT CCGCCATTGC CGCCAAAACC TCCTATTGCA CCAACCTGGC TTCCGCGCAG
GCCTGCGTGA ATGCCATTGA GGCGCTGAAG AACAAAAATC TTCAGGTGTG CACTATTCAG
GAGTACCACG CCCAAAACCT TTAA
 
Protein sequence
MPKDTSIKKI LVIGSGPIVI GQGCEFDYSG TQACKALREE GYEVVLVNSN PATIMTDPET 
APRTYIEPIT PEFVEKIIIR EKPDALLPTL GGQTALNCAM ELHRSGVLER EKVRMIGANA
NAIDRGEDRN LFKEAMLRIG LDVPRSGIAH TMEEARRVAG DIGTFPLIVR PAFTLGGMGG
GVAYNKTEFE EICARGLDLS PVSEILIEES LIGWKEFEME VMRDRADNCV VICSIENIDP
MGVHTGDSIT VAPIQTLTDR EYQAMRDASF AVIREIGVET GGSNIQFAIN PANGRMIVIE
MNPRVSRSSA LASKATGFPI AKLAAKLAIG YTLDELRNDI TRETPACFEP TIDYVVTKVP
RFTFEKFKQA DEHLTTSMKS VGEAMAIGRT FKESLQKALR SLETGRWGWG FDAKAPQAPS
AEEITRKLAV PTAERIFWIQ TAFSNGFSLD EVQQLTQIDP WFLAQMQDLA KAGDSLDKLD
LREAKKLGFS DRQIALARGT TEENIRKERL EQGIVPGYRL VDTCAAEFEA YTPYFYSTYG
DENEARETGR KKILILGGGP NRIGQGIEFD YCCVHASMAL REMGYETIIV NSNPETVSTD
YDSSDKLFFE PLTLEDVLHI CEQEKPDGVI VQFGGQTPLN LANALEAHGV PIIGTSPRAI
DLAEDREHFS ALLKELGLKQ AEAGTATNVE DAAAIAARIG YPVLLRPSFV LGGRGMIIVY
EEKELRRYMN EAVEASEERP VLIDRFLENA VEIDVDVIAD RERAVIGAIM QHVEPAGIHS
GDSASMIPAM GISMKMHKEI TRASKELASK LNVCGLMNIQ FAVKDEQLYV IEVNPRASRT
VPFVSKSIGK PLAKLAAQVM AGKTLAELGF TREITPEYYC VKEAVFPWGR FPGIDVVLGP
EMKSTGEVMG IDPDPDIAFA KSQVSAFNPL PTEGKVFISV NDRDKERVLH MARQLADMGF
TLCATRGTMI HLLQHDIECE RAYKVNEARR PNIVDHIKNG DIDFIINTPG SHDARADDII
IRSSAIAAKT SYCTNLASAQ ACVNAIEALK NKNLQVCTIQ EYHAQNL