Gene Caul_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1037 
Symbol 
ID5898492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1094552 
End bp1095718 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content70% 
IMG OID641561519 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_001682665 
Protein GI167645002 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAG ACCTTCTCCC CGGCGTTACC GGCGTTCTGG CCCTGGCCGA CGGCACGATC 
CTGCAAGGGG TCGGCTGCGG CGCGACCGGC GACGCGGTCG GCGAGGTGTG CTTCAACACC
GCCATGACCG GCTACCAGGA GATCCTCACC GATCCCTCCT ACATGGCCCA GATCGTCGCC
TTCACCTTCC CGCACGTCGG CAATGTGGGC ACGAACGTCG AGGACCTGGA ACAGATGGCC
GGCGGGGCCG AGACGGCGGC GCGCGGCGCG ATCTTCCGCG ATGCTCCCAC CCACCAGGCC
AACTGGCGCG CCGACAGCGA TTTCGACGGC TGGATGAAGC GCCGCAACGT CATCGGCCTG
GCCGGCGTCG ACACCCGCGC CCTGACCCGC AAGATCCGCG AGACAGGCAT GCCGCACGGT
GTGATCGCCC ACGCGCCGGA CGGCGTCTTC GACCTCCCCG CCCTGGTCGC CAAGGCCAAG
GCCTGGGCTG GCCTCGAGGG CCTGGACCTG GCCAAGGACG CCTCCACCAC CCAGACCTTC
ACCTGGGACG AGGGCCTGTG GTCGTGGCCG GAAGGCTACG CCAAGCTGGA CAAGCCCAAG
TACGAGGTCG TGGTCCTCGA CTACGGCGTC AAGCGCAACA TCCTGCGCGC CCTGGCCCAT
GTCGGCGCCC GCGCCACGGT GGTGCCGGCC GACACCTCGG CCGAGGCCAT TCTCGCGCGC
AACCCCGACG GCGTGCTGCT GTCCAACGGA CCGGGCGACC CGGCCGCCAC CGGCGTCTAC
GCGGTTCCGG TCATTCAGGC GCTGGTCGCC AGCGGCAAGC CGGTGTTCGG CATCTGCCTG
GGACACCAGA TGCTGGCCCT GGCGGTGGGC GCCCAGACCG TGAAGATGGA ACAGGGACAC
CACGGGGCCA ACCACCCGGT GAAGGACCTG ACGACCGGCA AGGTCGAGAT CGTCTCGATG
AACCACGGCT TCACGGTGGA CAGCGCGAGC CTGCCGGCCG CCGTCACCGA GACCCACGTC
TCGCTGTTCG ACGGCACCAA CGCCGGCATC GCCCTGGAGG GCAAGCCGGT GTTCTCGGTG
CAGCACCACC CCGAGGCGTC GCCTGGCCCG ACCGACAGCC TGTACCTGTT CGAGCGCTTC
GCGGGGCTGA TGGATGCGGC GAAGTAG
 
Protein sequence
MSQDLLPGVT GVLALADGTI LQGVGCGATG DAVGEVCFNT AMTGYQEILT DPSYMAQIVA 
FTFPHVGNVG TNVEDLEQMA GGAETAARGA IFRDAPTHQA NWRADSDFDG WMKRRNVIGL
AGVDTRALTR KIRETGMPHG VIAHAPDGVF DLPALVAKAK AWAGLEGLDL AKDASTTQTF
TWDEGLWSWP EGYAKLDKPK YEVVVLDYGV KRNILRALAH VGARATVVPA DTSAEAILAR
NPDGVLLSNG PGDPAATGVY AVPVIQALVA SGKPVFGICL GHQMLALAVG AQTVKMEQGH
HGANHPVKDL TTGKVEIVSM NHGFTVDSAS LPAAVTETHV SLFDGTNAGI ALEGKPVFSV
QHHPEASPGP TDSLYLFERF AGLMDAAK