Gene CPR_2576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2576 
SymbolcarB 
ID4206128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2803122 
End bp2806325 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content33% 
IMG OID642567126 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_699823 
Protein GI110803800 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase
[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTAA ATAAAGATAT AAAAAAAGTT TTAGTAATAG GTTCAGGTCC AATAATAATA 
GGACAAGCAG CGGAGTTTGA TTACTCAGGT ACTCAAGCTT GCCAAGCTCT TAAAGAAGAA
GGGATTGAAG TTGTACTTGT AAACTCAAAC CCAGCTACAA TAATGACTGA TAAGGAAATA
GCAGATAAGG TTTATCTTGA ACCTTTAACA GTTGAATTCG TAGAAAAAGT AATAGAGAAA
GAAAGACCAG ATTCTCTTTT AGCAGGAATG GGTGGACAAA CAGGGTTAAA TCTTGCTGTT
GAGCTTTATG AAAAGGGAAT TTTAGATAAA TATAATGTAA AAGTAATAGG TACTTCCATA
GAATCTATTA AGGAAGGGGA AGATAGAGAA TTATTCAGAG ATATGATGAA TAGAATCAAT
CAACCTGTTA TTCAAAGTGA AATAATAACT GATTTAGATT CTGGTATTGC CTTTGCTAGA
AAGATTGGAT ACCCAGTAAT AGTTAGACCC GCATATACTC TTGGAGGAAC TGGTGGAGGA
ATAGCTAATA ATGAAGAAGA ATTAATAGAA ACATTAACAT CAGGATTACA ATTAAGTACA
ATTGGACAGG TTCTTTTAGA GAAGAGTGTT AAAGGATGGA AAGAAATTGA GTACGAAGTA
ATGAGAGACT CTTTTGGTAA TTGTATAACA GTTTGTAACA TGGAAAACAT TGACCCTGTA
GGAATACATA CTGGGGATTC AATAGTTGTA GCTCCTAGCC AAACATTATC AGATAAAGAG
TATCAAATGC TTAGAAGTGC ATCAATAGAT ATAATAAATG CTGTAGGAAT TAAAGGGGGA
TGTAATGTTC AGTTTGCTTT AAATCCACAT TCATTTGAAT ATGCAGTAAT AGAAATCAAT
CCAAGGGTTT CAAGATCTTC AGCTTTAGCT TCAAAGGCAA CTGGTTATCC AATAGCCAAG
GTGGCAGCTA AAATAGCTTT AGGATATGGA TTAGATGAAA TAAAAAATGC AGTAACAGGA
ATGACATATG CTTGTTTTGA GCCTTCATTA GACTATGTAG TTGTAAAAAT TCCTAAATGG
CCTTTTGATA AATTCCAAGG AGCTGATAGG GTTTTAGGAA CTAAGATGAT GGCTACTGGA
GAAATAATGG CCATAGGAAG TAATTTTGAG GCAGCATTTT TAAAGGGTAT AAGATCATTA
GAAATAGGTA AGTATTCATT AGAACATAAA AAATTTAAAG ATCTTTCAAT GTATGAGTTA
AGAGAAAGGG TAGTTTCTCC AGATGATGAG AGAATATTTG CTTTAGCTGA AATGCTAAGA
AGAGGATACA GAATAGATAT GGTTTCTAAA ATAACTGGAA TAGATATATT CTTCTTAGAA
AAATTCAGAT GGCTAGTTGA AGAAGAACAA AAGCTTAAGC AAAGTACAAT AGATGATTTA
AATAGAGAGT GGTTATTAAA ATTAAAGAGA AGAGGTTTTT CAGATAAGGC AATAGCTGAT
ATGCTAAAGG TTTCACCAGA TGAAATCTAC AGATTAAGAG ATATATGGCA TATAAAACCT
TCTTATAAAA TGGTTGATAC TTGTGGTGGA GAGTTTGAGG CTTTATCACC ATACTATTAT
TCAACATATG AACAATATGA CGAAGTAGTT GTTTCTGATA ATAAGAAAGT TGTAGTTATA
GGATCAGGTC CTATAAGAAT AGGACAAGGG ATAGAATTTG ACTATGCTTC AGTTCACTGT
GTAATGGCAC TTAGAAAGCA AGGAATTGAA ACAATAGTTA TAAACAATAA CCCAGAGACA
GTAAGTACAG ACTTCAGTAT TTCAGATAAG CTTTACTTTG AACCATTAAC TGAAGAGGAC
GTTTTAAACA TAATAGATAA AGAAAATCCA GATGGAGTTA TACTTCAATT TGGTGGTCAA
ACAGCTATTA AGCTTGCTAA GTTCTTAAAG GAGAAGAATA TTCCTACATT AGGAACTACT
TCAGATCAAA TAGACCTAGC TGAGGATAGA GAACAATTTG ATGATTTATT AGAAAGACTT
AATATAGCTA GACCGAAGGG AAAAGGAGTT TGGAGCTTAG AAGAAGGATT AGAGGAAGCA
AGAAGATTAG GATTCCCAAT CTTAGTTAGA CCTTCCTTCG TTTTAGGTGG TCAAGGTATG
GAAATAACTC ATGATGAAGA AGAGTTAACA TACTATTTAA CAAATGCTTT TGAGAAAGAT
TCCAAGAATC CAATACTTAT AGACAAATAC TTAATGGGTA GAGAAATAGA AGTTGATGCC
ATATCTGACG GTGAAGATGT TTTAGTTCCA GGAATTATGG AGCATTTAGA AAGAGCAGGA
GTTCACTCAG GAGATAGTAT TACTATGTAT CCAGCTCAAA ATATTTCAGA TAAGATAAAG
GAAGATGTTT TAGATTACAC TAAGAAATTA GCTTTAAGCA TAGGAATAAA AGGAATGATA
AACATTCAGT TTATTGAGTT TGAAGGAAAG CTTTATGTAA TAGAAGTTAA TCCAAGAGCT
TCAAGAACAG TGCCTTATAT ATCAAAGGTA AGTGGAGTTC CAATAGTAGA TATAGCTACG
AGAATAATGC TTGGAGAAAA ATTAAAAGAT TTAGGATATG GTACAGGAGT TTATAAGGAG
CCTGAGTTAG TATCAGTTAA GGTTCCGGTA TTCTCAACTC AAAAGCTTCC TAATGTTGAA
GTAAGTTTAG GACCAGAAAT GAGATCCACA GGAGAAGTTT TAGGAGTAGG AAGAAATGTT
TTTGAAGCGT TATACAAAGG TTTTGTTGGG GCTTCAATGT ACACTGGAGA TAAGGGTAAA
ACAATCTTAG CTACTATTAA GAAACATGAT AAAAAAGAGT TTATGGAACT TTCTAAGGAT
TTAGATAAAC TAGGATATAA TTTCATAGCA ACAACAGGAA CAGCTAATGA ATTAAGAGAG
GCTGGAATAG ATGCTAAGGA AGTTAGAAGA ATAGGAGAAG AATCTCCAAA CATTATGGAT
TTAATAAAGA ATAAAGAAAT AGATTTAGTT GTAAATACTC CAACTAAAGC TAATGATTCT
AAGAGAGATG GATTCCATAT AAGAAGAGCG GCAATAGAAA GAAATATAGG AGTTATGACT
TCATTAGATA CTCTTAAAGC TCTAGTAGAG TTACAAAAAG AAGGAGCACA TAATAGAGAG
TTAGAAGTAT TTAACTTAAT ATAA
 
Protein sequence
MPLNKDIKKV LVIGSGPIII GQAAEFDYSG TQACQALKEE GIEVVLVNSN PATIMTDKEI 
ADKVYLEPLT VEFVEKVIEK ERPDSLLAGM GGQTGLNLAV ELYEKGILDK YNVKVIGTSI
ESIKEGEDRE LFRDMMNRIN QPVIQSEIIT DLDSGIAFAR KIGYPVIVRP AYTLGGTGGG
IANNEEELIE TLTSGLQLST IGQVLLEKSV KGWKEIEYEV MRDSFGNCIT VCNMENIDPV
GIHTGDSIVV APSQTLSDKE YQMLRSASID IINAVGIKGG CNVQFALNPH SFEYAVIEIN
PRVSRSSALA SKATGYPIAK VAAKIALGYG LDEIKNAVTG MTYACFEPSL DYVVVKIPKW
PFDKFQGADR VLGTKMMATG EIMAIGSNFE AAFLKGIRSL EIGKYSLEHK KFKDLSMYEL
RERVVSPDDE RIFALAEMLR RGYRIDMVSK ITGIDIFFLE KFRWLVEEEQ KLKQSTIDDL
NREWLLKLKR RGFSDKAIAD MLKVSPDEIY RLRDIWHIKP SYKMVDTCGG EFEALSPYYY
STYEQYDEVV VSDNKKVVVI GSGPIRIGQG IEFDYASVHC VMALRKQGIE TIVINNNPET
VSTDFSISDK LYFEPLTEED VLNIIDKENP DGVILQFGGQ TAIKLAKFLK EKNIPTLGTT
SDQIDLAEDR EQFDDLLERL NIARPKGKGV WSLEEGLEEA RRLGFPILVR PSFVLGGQGM
EITHDEEELT YYLTNAFEKD SKNPILIDKY LMGREIEVDA ISDGEDVLVP GIMEHLERAG
VHSGDSITMY PAQNISDKIK EDVLDYTKKL ALSIGIKGMI NIQFIEFEGK LYVIEVNPRA
SRTVPYISKV SGVPIVDIAT RIMLGEKLKD LGYGTGVYKE PELVSVKVPV FSTQKLPNVE
VSLGPEMRST GEVLGVGRNV FEALYKGFVG ASMYTGDKGK TILATIKKHD KKEFMELSKD
LDKLGYNFIA TTGTANELRE AGIDAKEVRR IGEESPNIMD LIKNKEIDLV VNTPTKANDS
KRDGFHIRRA AIERNIGVMT SLDTLKALVE LQKEGAHNRE LEVFNLI