Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1037 |
Symbol | |
ID | 5898492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1094552 |
End bp | 1095718 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561519 |
Product | carbamoyl phosphate synthase small subunit |
Protein accession | YP_001682665 |
Protein GI | 167645002 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCAAG ACCTTCTCCC CGGCGTTACC GGCGTTCTGG CCCTGGCCGA CGGCACGATC CTGCAAGGGG TCGGCTGCGG CGCGACCGGC GACGCGGTCG GCGAGGTGTG CTTCAACACC GCCATGACCG GCTACCAGGA GATCCTCACC GATCCCTCCT ACATGGCCCA GATCGTCGCC TTCACCTTCC CGCACGTCGG CAATGTGGGC ACGAACGTCG AGGACCTGGA ACAGATGGCC GGCGGGGCCG AGACGGCGGC GCGCGGCGCG ATCTTCCGCG ATGCTCCCAC CCACCAGGCC AACTGGCGCG CCGACAGCGA TTTCGACGGC TGGATGAAGC GCCGCAACGT CATCGGCCTG GCCGGCGTCG ACACCCGCGC CCTGACCCGC AAGATCCGCG AGACAGGCAT GCCGCACGGT GTGATCGCCC ACGCGCCGGA CGGCGTCTTC GACCTCCCCG CCCTGGTCGC CAAGGCCAAG GCCTGGGCTG GCCTCGAGGG CCTGGACCTG GCCAAGGACG CCTCCACCAC CCAGACCTTC ACCTGGGACG AGGGCCTGTG GTCGTGGCCG GAAGGCTACG CCAAGCTGGA CAAGCCCAAG TACGAGGTCG TGGTCCTCGA CTACGGCGTC AAGCGCAACA TCCTGCGCGC CCTGGCCCAT GTCGGCGCCC GCGCCACGGT GGTGCCGGCC GACACCTCGG CCGAGGCCAT TCTCGCGCGC AACCCCGACG GCGTGCTGCT GTCCAACGGA CCGGGCGACC CGGCCGCCAC CGGCGTCTAC GCGGTTCCGG TCATTCAGGC GCTGGTCGCC AGCGGCAAGC CGGTGTTCGG CATCTGCCTG GGACACCAGA TGCTGGCCCT GGCGGTGGGC GCCCAGACCG TGAAGATGGA ACAGGGACAC CACGGGGCCA ACCACCCGGT GAAGGACCTG ACGACCGGCA AGGTCGAGAT CGTCTCGATG AACCACGGCT TCACGGTGGA CAGCGCGAGC CTGCCGGCCG CCGTCACCGA GACCCACGTC TCGCTGTTCG ACGGCACCAA CGCCGGCATC GCCCTGGAGG GCAAGCCGGT GTTCTCGGTG CAGCACCACC CCGAGGCGTC GCCTGGCCCG ACCGACAGCC TGTACCTGTT CGAGCGCTTC GCGGGGCTGA TGGATGCGGC GAAGTAG
|
Protein sequence | MSQDLLPGVT GVLALADGTI LQGVGCGATG DAVGEVCFNT AMTGYQEILT DPSYMAQIVA FTFPHVGNVG TNVEDLEQMA GGAETAARGA IFRDAPTHQA NWRADSDFDG WMKRRNVIGL AGVDTRALTR KIRETGMPHG VIAHAPDGVF DLPALVAKAK AWAGLEGLDL AKDASTTQTF TWDEGLWSWP EGYAKLDKPK YEVVVLDYGV KRNILRALAH VGARATVVPA DTSAEAILAR NPDGVLLSNG PGDPAATGVY AVPVIQALVA SGKPVFGICL GHQMLALAVG AQTVKMEQGH HGANHPVKDL TTGKVEIVSM NHGFTVDSAS LPAAVTETHV SLFDGTNAGI ALEGKPVFSV QHHPEASPGP TDSLYLFERF AGLMDAAK
|
| |