Gene CPF_2897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2897 
SymbolcarB 
ID4201549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3167173 
End bp3170376 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content33% 
IMG OID638083764 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_697261 
Protein GI110800466 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTAA ATAAAGATAT AAAAAAAGTT TTAGTAATAG GTTCAGGTCC AATAATAATA 
GGACAAGCAG CGGAGTTTGA TTACTCAGGT ACTCAAGCTT GCCAAGCTCT TAAAGAAGAA
GGGATTGAAG TTGTACTTGT AAACTCAAAC CCAGCTACAA TAATGACTGA TAAGGAAATA
GCAGATAAGG TTTATCTTGA GCCTTTAACA GTTGAATTCG TAGAAAAAGT AATAGAGAAA
GAAAGACCAG ATTCTCTTTT AGCAGGAATG GGTGGACAAA CAGGGTTAAA CCTTGCTGTT
GAGCTTTATG AAAAGGGAAT TTTAGATAAA TATAATGTAA AAGTAATAGG TACTTCCATA
GAATCTATTA AGGAAGGGGA AGATAGAGAA CTATTCAGAG ATATGATGAA TAGAATTAAT
CAACCTGTTA TTCAAAGTGA AATAATAACT GATTTAGACG CTGGTATTGC CTTTGCTAGA
AAGATTGGAT ATCCAGTAAT AGTTAGACCA GCATATACTC TTGGAGGAAC TGGTGGAGGA
ATAGCTAATA ATGAAGAAGA ATTAATAGAA ACATTAACAT CAGGATTACA ATTAAGTACA
ATTGGACAGG TTCTTTTAGA GAAGAGTGTT AAAGGATGGA AAGAAATTGA GTACGAAGTA
ATGAGAGACT CTTTTGGTAA TTGTATCACA GTTTGTAACA TGGAAAACAT TGACCCTGTA
GGAATACACA CTGGGGATTC AATAGTTGTA GCTCCTAGCC AAACATTATC AGATAAAGAG
TATCAAATGC TTAGAAGTGC ATCAATAGAT ATAATAAATG CTGTAGGAAT TGAAGGAGGA
TGTAATGTTC AGTTTGCTTT AAATCCACAT TCCTTTGAAT ATGCAGTAAT AGAAATCAAT
CCAAGGGTTT CAAGATCTTC AGCTTTAGCT TCAAAGGCAA CTGGTTATCC AATAGCAAAG
GTTGCAGCTA AAATAGCTTT AGGATATGGA TTAGATGAAA TAAAAAATGC AGTAACAGGA
ATGACATATG CTTGCTTTGA GCCTTCATTA GACTATGTAG TTGTAAAAAT TCCTAAATGG
CCTTTTGATA AATTCCAAGG AGCTGATAGA GCTTTAGGAA CTAAAATGAT GGCTACTGGA
GAAATAATGG CCATAGGAAG TAATTTTGAG GCAGCATTTT TAAAGGGTAT AAGATCATTA
GAAATAGGTA AGTATTCATT AGAGCATAAA AAATTTAAAG ATCTTTCAAT GTATGAGTTA
AGAGAAAGAG TAGTTTCTCC AGATGATGAG AGAATATTTG CTTTAGCTGA AATGCTAAGA
AGAGGATACA GAATAGATAT GGTTTCAAAA ATAACTGGAA TAGATATATT CTTCTTAGAA
AAATTCAGAT GGTTAGTTGA AGAAGAACAA AAGCTTAAGC AAAGCACAAT AGATGATTTA
AATAGAGAGT GGTTATTAAA ATTAAAGAGA AGAGGTTTCT CAGATAAGGC AATAGCTGAT
ATGCTAAAGG TTTCACCAGA TGAAATCTAC AGATTAAGAG ATATATGGCA TATAAAGCCT
GCTTATAAAA TGGTTGATAC TTGTGGTGGA GAATTTGAAG CTTTATCACC ATACTATTAC
TCAACATATG AACAATATGA CGAAGTAGTT GTTTCTGATA ATAAGAAAGT TGTAGTTATA
GGATCAGGTC CTATAAGAAT AGGACAAGGG ATAGAGTTTG ACTATGCTTC AGTTCACTGT
GTAATGGCAC TTAGAAAGCA GGAAATTGAA ACAATAGTTA TAAACAATAA CCCAGAGACA
GTAAGTACAG ACTTCAGTAT TTCAGATAAG CTTTATTTTG AGCCATTAAC TGAAGAGGAT
GTTTTAAACA TCATAGATAA AGAAAAGCCA GATGGAGTTA TACTTCAATT TGGTGGTCAA
ACAGCTATTA AACTTGCTAA GTTCTTAAAA GAGAAGAACA TTCCTACCTT AGGAACTACT
TCAGATCAAA TAGACCTAGC TGAGGATAGA GAACAATTTG ATGATTTATT AGAAAGGCTT
AATATAGCTA GACCAAAGGG AAAAGGAGTT TGGAGCTTAG AAGAAGGCTT AGAGGAAGCA
AGAAGATTAG GATTCCCAAT CTTAGTTAGA CCTTCCTTCG TTTTAGGTGG TCAAGGTATG
GAAATAACTC ATGATGAAGA AGAGTTAACA TACTATTTAA CAAATGCTTT TGAGAAAGAT
TCCAAGAATC CAATACTTAT AGACAAATAC TTAATGGGTA GAGAAATAGA AGTTGATGCC
ATATCTGATG GTGAAGATGT TCTAGTTCCA GGAATTATGG AGCATTTAGA AAGAGCAGGA
GTTCACTCAG GAGATAGTAT TACTATGTAC CCAGCTCAAA ATATTTCAGA TAAGATAAAG
GAAGATGTTC TAGATTACAC TAAGAAATTA GCCTTAAGCA TAGGAATAAA GGGAATGATA
AACATTCAGT TTATTGAGTT TGAAGGAAAG CTTTATGTAA TAGAGGTTAA TCCAAGAGCT
TCAAGAACAG TGCCTTATAT ATCAAAGGTA AGTGGAGTTC CAATAGTAGA TATAGCTACG
AGAATAATGC TTGGAGAAAG ATTAAAAGAT TTAGGATATG GAACAGGAGT TTATAAGGAG
CCAGATTTAG TATCAGTTAA GGTTCCAGTA TTCTCAACTC AAAAACTTCC TAATGTTGAA
GTAAGTTTAG GACCAGAAAT GAGATCCACA GGAGAAGTTT TAGGAGTAGG AAGAAATGTT
TTTGAAGCGT TATACAAAGG CTTTGTTGGG GCTTCTATGT ACACTGGAGA TAAGGGTAAA
ACTATCTTAG CTACTATTAA GAAACATGAT AAAAAAGAAT TTATGGAACT TGCTAAGGAT
TTAGATAAAT TAGGATATAA TTTCATAGCA ACAACAGGAA CAGCTAAGGA ATTAAGAGAG
GCTGGAATAG ATGCTAAGGA AGTTAGAAGA ATAGGAGAAG AATCTCCAAA CATCATGGAT
TTAATAAAGA ATAAAGAAAT AGATTTAGTG GTAAATACTC CAACTAAAGC TAACGATTCT
AAGAGAGATG GATTCCATAT AAGAAGAGCG GCAATAGAAA GAAATATAGG AGTTATGACT
TCATTAGATA CTCTTAAAGC TCTAGTAGAA TTACAAAAAG AAGGAGCACA TAATAGAGAG
TTAGAAGTAT TTAACTTAAT ATAA
 
Protein sequence
MPLNKDIKKV LVIGSGPIII GQAAEFDYSG TQACQALKEE GIEVVLVNSN PATIMTDKEI 
ADKVYLEPLT VEFVEKVIEK ERPDSLLAGM GGQTGLNLAV ELYEKGILDK YNVKVIGTSI
ESIKEGEDRE LFRDMMNRIN QPVIQSEIIT DLDAGIAFAR KIGYPVIVRP AYTLGGTGGG
IANNEEELIE TLTSGLQLST IGQVLLEKSV KGWKEIEYEV MRDSFGNCIT VCNMENIDPV
GIHTGDSIVV APSQTLSDKE YQMLRSASID IINAVGIEGG CNVQFALNPH SFEYAVIEIN
PRVSRSSALA SKATGYPIAK VAAKIALGYG LDEIKNAVTG MTYACFEPSL DYVVVKIPKW
PFDKFQGADR ALGTKMMATG EIMAIGSNFE AAFLKGIRSL EIGKYSLEHK KFKDLSMYEL
RERVVSPDDE RIFALAEMLR RGYRIDMVSK ITGIDIFFLE KFRWLVEEEQ KLKQSTIDDL
NREWLLKLKR RGFSDKAIAD MLKVSPDEIY RLRDIWHIKP AYKMVDTCGG EFEALSPYYY
STYEQYDEVV VSDNKKVVVI GSGPIRIGQG IEFDYASVHC VMALRKQEIE TIVINNNPET
VSTDFSISDK LYFEPLTEED VLNIIDKEKP DGVILQFGGQ TAIKLAKFLK EKNIPTLGTT
SDQIDLAEDR EQFDDLLERL NIARPKGKGV WSLEEGLEEA RRLGFPILVR PSFVLGGQGM
EITHDEEELT YYLTNAFEKD SKNPILIDKY LMGREIEVDA ISDGEDVLVP GIMEHLERAG
VHSGDSITMY PAQNISDKIK EDVLDYTKKL ALSIGIKGMI NIQFIEFEGK LYVIEVNPRA
SRTVPYISKV SGVPIVDIAT RIMLGERLKD LGYGTGVYKE PDLVSVKVPV FSTQKLPNVE
VSLGPEMRST GEVLGVGRNV FEALYKGFVG ASMYTGDKGK TILATIKKHD KKEFMELAKD
LDKLGYNFIA TTGTAKELRE AGIDAKEVRR IGEESPNIMD LIKNKEIDLV VNTPTKANDS
KRDGFHIRRA AIERNIGVMT SLDTLKALVE LQKEGAHNRE LEVFNLI