Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2576 |
Symbol | carB |
ID | 4206128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2803122 |
End bp | 2806325 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642567126 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_699823 |
Protein GI | 110803800 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTAA ATAAAGATAT AAAAAAAGTT TTAGTAATAG GTTCAGGTCC AATAATAATA GGACAAGCAG CGGAGTTTGA TTACTCAGGT ACTCAAGCTT GCCAAGCTCT TAAAGAAGAA GGGATTGAAG TTGTACTTGT AAACTCAAAC CCAGCTACAA TAATGACTGA TAAGGAAATA GCAGATAAGG TTTATCTTGA ACCTTTAACA GTTGAATTCG TAGAAAAAGT AATAGAGAAA GAAAGACCAG ATTCTCTTTT AGCAGGAATG GGTGGACAAA CAGGGTTAAA TCTTGCTGTT GAGCTTTATG AAAAGGGAAT TTTAGATAAA TATAATGTAA AAGTAATAGG TACTTCCATA GAATCTATTA AGGAAGGGGA AGATAGAGAA TTATTCAGAG ATATGATGAA TAGAATCAAT CAACCTGTTA TTCAAAGTGA AATAATAACT GATTTAGATT CTGGTATTGC CTTTGCTAGA AAGATTGGAT ACCCAGTAAT AGTTAGACCC GCATATACTC TTGGAGGAAC TGGTGGAGGA ATAGCTAATA ATGAAGAAGA ATTAATAGAA ACATTAACAT CAGGATTACA ATTAAGTACA ATTGGACAGG TTCTTTTAGA GAAGAGTGTT AAAGGATGGA AAGAAATTGA GTACGAAGTA ATGAGAGACT CTTTTGGTAA TTGTATAACA GTTTGTAACA TGGAAAACAT TGACCCTGTA GGAATACATA CTGGGGATTC AATAGTTGTA GCTCCTAGCC AAACATTATC AGATAAAGAG TATCAAATGC TTAGAAGTGC ATCAATAGAT ATAATAAATG CTGTAGGAAT TAAAGGGGGA TGTAATGTTC AGTTTGCTTT AAATCCACAT TCATTTGAAT ATGCAGTAAT AGAAATCAAT CCAAGGGTTT CAAGATCTTC AGCTTTAGCT TCAAAGGCAA CTGGTTATCC AATAGCCAAG GTGGCAGCTA AAATAGCTTT AGGATATGGA TTAGATGAAA TAAAAAATGC AGTAACAGGA ATGACATATG CTTGTTTTGA GCCTTCATTA GACTATGTAG TTGTAAAAAT TCCTAAATGG CCTTTTGATA AATTCCAAGG AGCTGATAGG GTTTTAGGAA CTAAGATGAT GGCTACTGGA GAAATAATGG CCATAGGAAG TAATTTTGAG GCAGCATTTT TAAAGGGTAT AAGATCATTA GAAATAGGTA AGTATTCATT AGAACATAAA AAATTTAAAG ATCTTTCAAT GTATGAGTTA AGAGAAAGGG TAGTTTCTCC AGATGATGAG AGAATATTTG CTTTAGCTGA AATGCTAAGA AGAGGATACA GAATAGATAT GGTTTCTAAA ATAACTGGAA TAGATATATT CTTCTTAGAA AAATTCAGAT GGCTAGTTGA AGAAGAACAA AAGCTTAAGC AAAGTACAAT AGATGATTTA AATAGAGAGT GGTTATTAAA ATTAAAGAGA AGAGGTTTTT CAGATAAGGC AATAGCTGAT ATGCTAAAGG TTTCACCAGA TGAAATCTAC AGATTAAGAG ATATATGGCA TATAAAACCT TCTTATAAAA TGGTTGATAC TTGTGGTGGA GAGTTTGAGG CTTTATCACC ATACTATTAT TCAACATATG AACAATATGA CGAAGTAGTT GTTTCTGATA ATAAGAAAGT TGTAGTTATA GGATCAGGTC CTATAAGAAT AGGACAAGGG ATAGAATTTG ACTATGCTTC AGTTCACTGT GTAATGGCAC TTAGAAAGCA AGGAATTGAA ACAATAGTTA TAAACAATAA CCCAGAGACA GTAAGTACAG ACTTCAGTAT TTCAGATAAG CTTTACTTTG AACCATTAAC TGAAGAGGAC GTTTTAAACA TAATAGATAA AGAAAATCCA GATGGAGTTA TACTTCAATT TGGTGGTCAA ACAGCTATTA AGCTTGCTAA GTTCTTAAAG GAGAAGAATA TTCCTACATT AGGAACTACT TCAGATCAAA TAGACCTAGC TGAGGATAGA GAACAATTTG ATGATTTATT AGAAAGACTT AATATAGCTA GACCGAAGGG AAAAGGAGTT TGGAGCTTAG AAGAAGGATT AGAGGAAGCA AGAAGATTAG GATTCCCAAT CTTAGTTAGA CCTTCCTTCG TTTTAGGTGG TCAAGGTATG GAAATAACTC ATGATGAAGA AGAGTTAACA TACTATTTAA CAAATGCTTT TGAGAAAGAT TCCAAGAATC CAATACTTAT AGACAAATAC TTAATGGGTA GAGAAATAGA AGTTGATGCC ATATCTGACG GTGAAGATGT TTTAGTTCCA GGAATTATGG AGCATTTAGA AAGAGCAGGA GTTCACTCAG GAGATAGTAT TACTATGTAT CCAGCTCAAA ATATTTCAGA TAAGATAAAG GAAGATGTTT TAGATTACAC TAAGAAATTA GCTTTAAGCA TAGGAATAAA AGGAATGATA AACATTCAGT TTATTGAGTT TGAAGGAAAG CTTTATGTAA TAGAAGTTAA TCCAAGAGCT TCAAGAACAG TGCCTTATAT ATCAAAGGTA AGTGGAGTTC CAATAGTAGA TATAGCTACG AGAATAATGC TTGGAGAAAA ATTAAAAGAT TTAGGATATG GTACAGGAGT TTATAAGGAG CCTGAGTTAG TATCAGTTAA GGTTCCGGTA TTCTCAACTC AAAAGCTTCC TAATGTTGAA GTAAGTTTAG GACCAGAAAT GAGATCCACA GGAGAAGTTT TAGGAGTAGG AAGAAATGTT TTTGAAGCGT TATACAAAGG TTTTGTTGGG GCTTCAATGT ACACTGGAGA TAAGGGTAAA ACAATCTTAG CTACTATTAA GAAACATGAT AAAAAAGAGT TTATGGAACT TTCTAAGGAT TTAGATAAAC TAGGATATAA TTTCATAGCA ACAACAGGAA CAGCTAATGA ATTAAGAGAG GCTGGAATAG ATGCTAAGGA AGTTAGAAGA ATAGGAGAAG AATCTCCAAA CATTATGGAT TTAATAAAGA ATAAAGAAAT AGATTTAGTT GTAAATACTC CAACTAAAGC TAATGATTCT AAGAGAGATG GATTCCATAT AAGAAGAGCG GCAATAGAAA GAAATATAGG AGTTATGACT TCATTAGATA CTCTTAAAGC TCTAGTAGAG TTACAAAAAG AAGGAGCACA TAATAGAGAG TTAGAAGTAT TTAACTTAAT ATAA
|
Protein sequence | MPLNKDIKKV LVIGSGPIII GQAAEFDYSG TQACQALKEE GIEVVLVNSN PATIMTDKEI ADKVYLEPLT VEFVEKVIEK ERPDSLLAGM GGQTGLNLAV ELYEKGILDK YNVKVIGTSI ESIKEGEDRE LFRDMMNRIN QPVIQSEIIT DLDSGIAFAR KIGYPVIVRP AYTLGGTGGG IANNEEELIE TLTSGLQLST IGQVLLEKSV KGWKEIEYEV MRDSFGNCIT VCNMENIDPV GIHTGDSIVV APSQTLSDKE YQMLRSASID IINAVGIKGG CNVQFALNPH SFEYAVIEIN PRVSRSSALA SKATGYPIAK VAAKIALGYG LDEIKNAVTG MTYACFEPSL DYVVVKIPKW PFDKFQGADR VLGTKMMATG EIMAIGSNFE AAFLKGIRSL EIGKYSLEHK KFKDLSMYEL RERVVSPDDE RIFALAEMLR RGYRIDMVSK ITGIDIFFLE KFRWLVEEEQ KLKQSTIDDL NREWLLKLKR RGFSDKAIAD MLKVSPDEIY RLRDIWHIKP SYKMVDTCGG EFEALSPYYY STYEQYDEVV VSDNKKVVVI GSGPIRIGQG IEFDYASVHC VMALRKQGIE TIVINNNPET VSTDFSISDK LYFEPLTEED VLNIIDKENP DGVILQFGGQ TAIKLAKFLK EKNIPTLGTT SDQIDLAEDR EQFDDLLERL NIARPKGKGV WSLEEGLEEA RRLGFPILVR PSFVLGGQGM EITHDEEELT YYLTNAFEKD SKNPILIDKY LMGREIEVDA ISDGEDVLVP GIMEHLERAG VHSGDSITMY PAQNISDKIK EDVLDYTKKL ALSIGIKGMI NIQFIEFEGK LYVIEVNPRA SRTVPYISKV SGVPIVDIAT RIMLGEKLKD LGYGTGVYKE PELVSVKVPV FSTQKLPNVE VSLGPEMRST GEVLGVGRNV FEALYKGFVG ASMYTGDKGK TILATIKKHD KKEFMELSKD LDKLGYNFIA TTGTANELRE AGIDAKEVRR IGEESPNIMD LIKNKEIDLV VNTPTKANDS KRDGFHIRRA AIERNIGVMT SLDTLKALVE LQKEGAHNRE LEVFNLI
|
| |