Gene Caul_4285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4285 
SymbolcarB 
ID5901746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4655976 
End bp4659293 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content68% 
IMG OID641564804 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001685904 
Protein GI167648241 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.233689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCTGC GCGCGAGTTT TCCAATGCCC AAAAGAACAG ACATCTCCTC GATCCTGATC 
ATCGGCGCCG GCCCGATCGT CATCGGCCAG GCGTGCGAGT TCGACTATTC GGGCGTCCAG
GCCTGCAAGG CGCTGCGGGC CGAGGGCTAC CGGATCATCC TGGTCAATTC CAATCCCGCC
ACGATCATGA CCGATCCCGA CGTGGCCGAC GCGACCTATA TCGAGCCGAT CACCCCCGAC
ATGGTCGCCA AGATCATCGC CAAGGAGCGG CCCGACGCCC TTCTGCCGAC CATGGGCGGC
CAGACGGCGC TGAACACCGC CCTGGCCCTG GAAGCCGACG GCACCCTGGC CAAGTACGGC
GTCGAGATGA TCGGGGCCAA GGCCGAAGTG ATCGACAAGG CCGAGGACCG CCAGAAGTTC
CGCGACGCCA TGGACAAGCT GGGCCTGGAA AGCCCCAAGT CCAAGGCCGC CCACAACATG
GACGAGGCCC GCGAAGGCCT GGCCTTCGTC GGCCTGCCCG CCATCATCCG CCCGTCCTTC
ACCCTGGCCG GCACCGGCGG CGGCATCGCC TACAATCTCG AGGAATTCGA GGAGATCGTC
GAACGCGGCC TGGACCTTTC GCCGACCACC GAGGTGCTGA TCGAAGAGAG CGTCCTGGGC
TGGAAGGAAT ACGAGATGGA GGTGGTCCGC GACACGGCGG ACAACTGCAT CATCGTCTGC
TCGATCGAGA ACATCGACCC GATGGGCGTC CACACGGGCG ACTCGATCAC CGTCGCCCCG
GCCCTGACCC TGACGGACAA GGAATACCAG TGGATGCGCG CGGCCAGCAT CGCCGTGCTG
CGCGAGATCG GCGTCGAGAC CGGCGGGTCG AACGTGCAGT TCGCGGTCAA TCCGGCCGAC
GGCCGCATGG TGGTGATCGA GATGAACCCG CGCGTGTCGC GCTCGTCCGC GCTGGCCTCC
AAGGCCACCG GCTTCCCGAT CGCCAAGGTC GCCGCCCGCC TGGCCGTCGG CTACACGCTG
GATGAGCTGA AGAACGACAT CACCGGCGGC GCGACCCCGG CCTCGTTCGA GCCCAGCATC
GACTATGTGG TCACCAAGAT CCCGCGCTTC GCCTTCGAGA AGTATCCGGG CAGCGAGCCG
CTGCTGACTA CCGCCATGAA GTCGGTGGGC GAGGTGATGG CCATCGGCCG CACCTTCAAG
GAAAGCGTCC AGAAGGCCCT GCGCGGCCTG GAAACCGGCC TCAACGGCTT CGACGAGATC
GAGATCCACG GCGCCGATGA TCCCGACACC GGCCGGGCCG CGGTGATCCG CGCCCTGGGC
ACGCCCACCC CCGACCGCAT CCGCGTCATC GCCCAGGCCT TCCGCCACGG CCTGACCGTG
GAAGAGGTCA ACGCCGCCTG TTCCTACGAG CCCTGGTTCC TGCGCCAGAT CGCCGAGCTG
GTCCGCCAGG AGGGCTGGGT CCGGGCCGGC GGCCTGCCGA CCGACGCGCA AGGCTTCCGC
GCCCTGAAGG CCCAGGGCTT CTCCGACGCC CGCCTGGCCA AGCTGGTGGG TTCGGACGAA
AAGACCGTGC GCCAGCAGCG CCAGGCCCTG AACGTGCGCC CGGTGTTCAA GCGCATCGAC
AGCTGCGCCG GCGAGTTCGC CGCCACCACG CCCTACATGT ATTCCACCTA CGAGACCGGC
GCCCTGGGCC AGGTCCCCGA GTGCGAAAGC CTGCCGACCA ACCGCAAGAA GGCGGTGATC
CTGGGCGGCG GTCCCAACCG GATCGGCCAG GGCATCGAGT TCGACTACTG CTGCTGCCAC
GCCGCGTTCG CCTTGGACCA GATTGGCGTT GAGTCGATCA TGGTCAACTG CAACCCCGAG
ACCGTCTCGA CCGACTACGA CACCTCCGAC CGCCTGTATT TCGAGCCGCT GACGGCCGAG
GACGTGCTGG AGCTGCTGGA CGTCGAGAAG AGCAACGGCA CGCTGGCCGG CGTCATCGTC
CAGTTCGGCG GCCAGACGCC CCTGAAGCTG GCCCAGGCGC TGCAGGACGC GGGCATTCCG
ATCCTGGGCA CCAGCCCTGA CGCCATCGAC CTGGCAGAGG ACCGCGAGCG CTTCCAGCAA
CTGCTGAACG GCCTGGACAT CGCCCAACCC GAGAACGCCA TCGCCCGCAC CTGGGACGAG
GCCCGCGCGA AGGGCGACGA GATCGGCTTC CCGTTCGTGA TGCGCCCGTC CTACGTGCTG
GGCGGCCGGG GCATGGAGAT CATCCGCGAT CACGAGCACC TGGAACGCTA CATCGCCAAC
ACCGGCGAGA TCTCGTTCGA GCACCCGATC CTGCTGGACC ACTATCTGAG CCGCGCCACC
GAGGTGGACG TCGACGCCCT GTGCGACGGG ACCGACGTGT TCGTGGCCGG CGTGCTGGAG
CATATCGAGG AAGCCGGCGT CCACTCGGGC GACAGCGCCT GCTCGATGCC GCCCTTCTCG
CTCAGCGCCG CCACCGTGGA GGAGCTGAAG CGCCAGACCG TCAAGATGGC CCTGGCCCTG
AACGTCCGCG GCCTGATGAA CGTGCAGTTC GCGATCGAGG AACCGCACAG CGACGCCCCG
CGCATCTATG TGCTGGAAGT GAACCCGCGC GCCTCGCGCA CGGTGCCGTT CGTGGCCAAG
ACCATCGGCC AGCCGGTGGC CGCCATCGCC GCCAAGATCA TGGCCGGCGA GACGCTGGCC
AGCTTCGGCC TCAAGGACGT TCCCTACGAC CACATCGCGG TCAAGGAAGC GGTGTTCCCG
TTCGCCCGCT TCGCCGGCGT CGACACGGTG CTGGGCCCGG AAATGCGCTC GACCGGCGAG
GTCATGGGCT TGGACTGGAT CCGCGAGGGC GAGAACGGCC TGGGTCCGGC CTTCGCCCGC
GCCTTCGCCA AGAGCCAGCT GGGCGGCGGC GTCACCCTGC CGACCACCGG CACGGCCTTC
GTCTCGGTCA AGGAAAGCGA CCGGCCGTGG ATCGTCGAGC CGGTGAAGCT CTTGCAGGCG
GCCGGCTTCA AGGTGCTGTC GACGGTCGGC ACGCGCGGCT ATCTGGCCGA GCAGGGCGTC
GAGGTCGAGT TGGTCAAGAA GGTGCTGGAA GGCCGTCCGC ACATCGTCGA CGTGATGAAG
AACGGCGGCG TGCAGCTGGT GTTCAACACC ACCGAGGGCA AGCAGGCCCT GGAAGACAGC
TTCGAAATCC GCCGCACGGC CCTGATGATG AAGGTGCCCT ACTACACCAC CTCGGCCGGC
GCCCTCGCCG CCGCCCAGGC CATCTCCTCG GCCCCCGCCG AGGCGCTGGA AGTGCGGCCG
CTGCAGAGCT ATCAGTAG
 
Protein sequence
MSLRASFPMP KRTDISSILI IGAGPIVIGQ ACEFDYSGVQ ACKALRAEGY RIILVNSNPA 
TIMTDPDVAD ATYIEPITPD MVAKIIAKER PDALLPTMGG QTALNTALAL EADGTLAKYG
VEMIGAKAEV IDKAEDRQKF RDAMDKLGLE SPKSKAAHNM DEAREGLAFV GLPAIIRPSF
TLAGTGGGIA YNLEEFEEIV ERGLDLSPTT EVLIEESVLG WKEYEMEVVR DTADNCIIVC
SIENIDPMGV HTGDSITVAP ALTLTDKEYQ WMRAASIAVL REIGVETGGS NVQFAVNPAD
GRMVVIEMNP RVSRSSALAS KATGFPIAKV AARLAVGYTL DELKNDITGG ATPASFEPSI
DYVVTKIPRF AFEKYPGSEP LLTTAMKSVG EVMAIGRTFK ESVQKALRGL ETGLNGFDEI
EIHGADDPDT GRAAVIRALG TPTPDRIRVI AQAFRHGLTV EEVNAACSYE PWFLRQIAEL
VRQEGWVRAG GLPTDAQGFR ALKAQGFSDA RLAKLVGSDE KTVRQQRQAL NVRPVFKRID
SCAGEFAATT PYMYSTYETG ALGQVPECES LPTNRKKAVI LGGGPNRIGQ GIEFDYCCCH
AAFALDQIGV ESIMVNCNPE TVSTDYDTSD RLYFEPLTAE DVLELLDVEK SNGTLAGVIV
QFGGQTPLKL AQALQDAGIP ILGTSPDAID LAEDRERFQQ LLNGLDIAQP ENAIARTWDE
ARAKGDEIGF PFVMRPSYVL GGRGMEIIRD HEHLERYIAN TGEISFEHPI LLDHYLSRAT
EVDVDALCDG TDVFVAGVLE HIEEAGVHSG DSACSMPPFS LSAATVEELK RQTVKMALAL
NVRGLMNVQF AIEEPHSDAP RIYVLEVNPR ASRTVPFVAK TIGQPVAAIA AKIMAGETLA
SFGLKDVPYD HIAVKEAVFP FARFAGVDTV LGPEMRSTGE VMGLDWIREG ENGLGPAFAR
AFAKSQLGGG VTLPTTGTAF VSVKESDRPW IVEPVKLLQA AGFKVLSTVG TRGYLAEQGV
EVELVKKVLE GRPHIVDVMK NGGVQLVFNT TEGKQALEDS FEIRRTALMM KVPYYTTSAG
ALAAAQAISS APAEALEVRP LQSYQ