Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4449 |
Symbol | |
ID | 5901910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4813887 |
End bp | 4817603 |
Gene Length | 3717 bp |
Protein Length | 1238 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564968 |
Product | urea carboxylase |
Protein accession | YP_001686067 |
Protein GI | 167648404 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAACG AACCTCCCTT CGACAAGGTC CTGATCGCCA ACCGCGGCGC CATCGCCTGC CGGATCATCC GTACCCTGAA GGCCATGGGC GTGAAGTCGG TGGCGGTGTT CAGCGACGCC GACGCCGGAT CGCTGCACGT CAGCCTGGCC GACGAGGCGG TGCGGATCGG CCCCGCCCCC GCCGCCGAGA GCTACCTGCG CGGCGACCTG ATCCTGGCGG CGGCCAGGGC GACCGGCGCC CAGGCCATCC ATCCGGGCTA TGGCTTCCTC AGCGAGAACG CGGGCTTCGC GGAGGCCTGC GAAGCGGAAG GGATCGCCTT CATTGGTCCG ACAGCGGACA ATATCCGCGC CTTCGGGCTC AAGCACACGG CCCGGGACCT CGCCCAGGCC CACGGCGTGC CGCTGGCGCC GGGCACCGAC CTGCTGGTCG ATCCGGCGGC CGCGTTGGAA GCGGCCCAGC GCATCGGCTT CCCGGTCATC CTGAAGGCCA CGGCCGGCGG CGGCGGCATC GGCATGCGGG TCTGTGAAAC CGCCGAAGCC GTGAGCGAAG CCTTCGCGGC CGTCCAGCGG CTGGCGAACG GCCATTTCAG CGACGGCGGG GTGTTCCTCG AGCGCTATGT GCGCAAGGCG CGCCACGTCG AGGTGCAGAT GTTCGGCGAC GGGGCCGGCA AGGTCGTGGC CCTGGGCGAG CGCGACTGCT CGCTGCAGCG GCGCAACCAG AAGGTCGTCG AGGAGACCCC CGCCCCCGGC CTGCCCGCCG CCACCCGCGC GGCCCTGCTG GACGCCGCCG TGCGCCTGGC CTCGGCCGCC CACTACCGCT CGGCCGGCAC GGTGGAGTTC CTCTACGACG CCGAGCGCGA CGACTTCTTT TTCCTAGAGG TCAACACCCG GTTGCAGGTC GAGCACGGCG TCACCGAGCA GGTGACGGGC GTGGACCTGG TCGAGTGGAT GGTGCGCGGC GCGGCCGGCG ATTTTACCTT CCTCGATGTG GCGCCGCCCG AGCCGAAAGG CGCGTCGATC CAGGTGCGGC TCTATGCCGA GGACCCGGCC CAGGGCTATC GGCCCAGCTC CGGCGTGCTG ACCCAGGTTT CGTTCCCGCC AAGCGTCCGC GCCGATGGCT GGGTGGTCGA TGGAACCGAG GTCAGCGCCT TCTACGACCC GCTGCTGGCC AAGCTGATCG TCACGGCCGC TGATCGCCCC GCCGCCGTCG CGGCCTTGCA GGCTGCGCTG GACGACACCC GCCTGGCCGG GATCGAGACC AATCTGGACT GGTTGCGCAC GGTCGTTCGC TCCGAGGCCT TCGTCAGCGG CGAGGTCTCG ACCCGGGCGC TGGAGAGCGT GACCTGGCGG CCCGACACCC TCCAGGTGCT CAGTGGCGGT CCGGCCACCA CCGTCCAGGA CTGGCCGGGC CGCCAAGGCT ACTGGGACGT CGGCGTGCCC CCCTCCGGCC CGATGGACGC CTTCGCCTTC CGGCTGGGCA ACCGCCTGCT GGGCAATGGC GAGGACGCGG CCGGGCTTGA GATCACCGCG CTTGGCCCGA CGCTGAAGTT CAATCGCCCC GCCGTGGTCT GCCTGACCGG CGCGGCGTTC GACGCCAAGC TCGATGGCGC GCCCTTCGAC GCCTATGCCC CGATCGCCGT CGCCGCCGGC CAGACCCTGA AGATCGGCCG GGTGCTCGGC GCCGGCCTGC GCGGTTACCT GCTGTTCCAG GGCGGGCTGG ACGTACCGGT CTATCTGGGC AGCCGCTCGA CCTTCACCCT GGGCGGCTTC GGCGGCCACG CCGGCCGCAA CCTGACCGCC GGAGACGTGC TGCGGTTGGC CTCCGATGGG GACAGGCGCA TGCGCCTGTC CCCAACCGCG CCCCTGCCCC TCAACCTACG CCCCGCCACC ACCAAGGCCT GGACCATCCG CGTCCTGCCC GGCCCGCACG GCGCGCCGGA CTTCTTCACC CCGGCCGACG TCGCCATGAT CGGCGGCGTG GATTGGAAGG CGCACTACAA TTCCAACCGC ACTGGCGTGC GGCTGGTCGG TCCCAAGCCC GAATGGGCGC GGCGCGACGG CGGCGAGGCC GGGCTGCATC CGTCCAACAT CCACGACAAC GCCTACGCCA TCGGCGCGGT GGATTTCACC GGCGACATGC CGATCATCCT GGGTCCGGAC GGGCCGTCGC TGGGCGGGTT CGTGTGCCCG TTCGTGATCA TCCAGGCGGA TCTCTGGAAG GCCGGGCAGT TGGCGCCGGG GGATACGGTG CGGTTTGAGG TGGTGGGGGA TGCTGAGGCG GCTGCTGCTC TCGCCGAACA GGAGGCCTTG CTCGAAACGC TCAGCGACTC CCCCCACCGA CCCTTCGGGT CGCCTCCCCC ACGGGGGGAG GCTTTGAGCC AGGTCATGCC TCCTCCTGTG GGGGCGGTGT CGGCGCGGCC GACGGAGGGG GGAGAACCTG TCCTTCAGGA ACTCCCCGCC GACGGCCACC GCCCGCACGT CACCTACCGC CGCCAGGGCG ACCAGCACCT GCTCGTCGAA TACGGCCCCA TCGTGCTCGA CCTGGAACTG CGGCTGCGCA TCCACGCCCT ACACCTGGAC CTGCAGGCCC TCGCCCTGCC CGCCGTCATC GACCTGACCC CCGGCATCCG CTCGCTGCAG GTCCACTATG ACAGCCGCCG CCTGACCCAG GCCGACCTGC TGGCCGCGCT GACAGCCGCC GAGGAACGCC TGGGTGGCCT GGATGATTTC GAGATCCCGT CGCGCGTGGT CCACCTGCCG CTCAGTTGGA AGGACCCCGC GATCTACCAA ACCATCGACA AGTACATGCA AGCCGTCCGC GACGACGCGC CGTGGTGCCC GGACAACATC GAATTTATCC GCCGGGTCAA CGGCCTGGAC AGCATCGACG ACGTCCAGCG CATCGTCTTC GACGCCCGCT ACCTGGTGAT GGGCCTGGGC GACGTCTATC TGGGCGCGCC GGTCGCCACC CCGGTCGATC CGCGCCACCG CCTGGTGACC ACCAAGTACA ACCCCGCCCG CACCTGGACC CCGCCCAACG TGGTGGGCAT CGGCGGGGCC TATATGTGCA TCTACGGCAT GGAGGGACCG GGCGGCTACC AGCTGTTCGG CCGCACCATC CAGGTGTGGA ACACCTGGCG CCAGACCGAA GCCTTCACGG GCGGCAAGCC CTGGCTCTTG AGATTCTTCG ACCAGATCCG CTTCTTCCCG GTCAGCGCCG AGGAACTGGT GGAATGGCGA CGCGACTTCC CGCTGGGCCG TCGCCAGATC CGCATCGAGG AAGAGACCTT CCGGCTGTCG GACTATCGCA AGATGCTGGC CGACAACGCC GGCGGCATCG AAGCCTTCCA GACCACGCGC CAGGCCGCCT TCGACGCCGA GCGCGCCGAC TGGGAGGCCA GGGGCGAGTT CGCGCGGGTC GAAGCCTTGT CGAGCGTGGC CGACGATGGC GGCGAGGTCG CGGCGATCAT CGTCCCCGAC GGCAGCGACC TGGTCGAGGC CCCGCTGGGC GGCAATGTCT GGAAGGTGCT GGTCGAGCCC GGGCAACGGG TCGAGGCCGG GGCGGTGATC GCGGTGATCG AGGCCATGAA GGCCGAGTGC GACGTCAACA GCCCGACAGC CGGCGTCGTC ACCGCTGTCT ACGCCCAGCC GGGCGGGGCC ATCGCCGCCG GCGCGCCGAT CGTCGCCATC GCGCCCGATC TGGAGGCGGC GGCTTGA
|
Protein sequence | MLNEPPFDKV LIANRGAIAC RIIRTLKAMG VKSVAVFSDA DAGSLHVSLA DEAVRIGPAP AAESYLRGDL ILAAARATGA QAIHPGYGFL SENAGFAEAC EAEGIAFIGP TADNIRAFGL KHTARDLAQA HGVPLAPGTD LLVDPAAALE AAQRIGFPVI LKATAGGGGI GMRVCETAEA VSEAFAAVQR LANGHFSDGG VFLERYVRKA RHVEVQMFGD GAGKVVALGE RDCSLQRRNQ KVVEETPAPG LPAATRAALL DAAVRLASAA HYRSAGTVEF LYDAERDDFF FLEVNTRLQV EHGVTEQVTG VDLVEWMVRG AAGDFTFLDV APPEPKGASI QVRLYAEDPA QGYRPSSGVL TQVSFPPSVR ADGWVVDGTE VSAFYDPLLA KLIVTAADRP AAVAALQAAL DDTRLAGIET NLDWLRTVVR SEAFVSGEVS TRALESVTWR PDTLQVLSGG PATTVQDWPG RQGYWDVGVP PSGPMDAFAF RLGNRLLGNG EDAAGLEITA LGPTLKFNRP AVVCLTGAAF DAKLDGAPFD AYAPIAVAAG QTLKIGRVLG AGLRGYLLFQ GGLDVPVYLG SRSTFTLGGF GGHAGRNLTA GDVLRLASDG DRRMRLSPTA PLPLNLRPAT TKAWTIRVLP GPHGAPDFFT PADVAMIGGV DWKAHYNSNR TGVRLVGPKP EWARRDGGEA GLHPSNIHDN AYAIGAVDFT GDMPIILGPD GPSLGGFVCP FVIIQADLWK AGQLAPGDTV RFEVVGDAEA AAALAEQEAL LETLSDSPHR PFGSPPPRGE ALSQVMPPPV GAVSARPTEG GEPVLQELPA DGHRPHVTYR RQGDQHLLVE YGPIVLDLEL RLRIHALHLD LQALALPAVI DLTPGIRSLQ VHYDSRRLTQ ADLLAALTAA EERLGGLDDF EIPSRVVHLP LSWKDPAIYQ TIDKYMQAVR DDAPWCPDNI EFIRRVNGLD SIDDVQRIVF DARYLVMGLG DVYLGAPVAT PVDPRHRLVT TKYNPARTWT PPNVVGIGGA YMCIYGMEGP GGYQLFGRTI QVWNTWRQTE AFTGGKPWLL RFFDQIRFFP VSAEELVEWR RDFPLGRRQI RIEEETFRLS DYRKMLADNA GGIEAFQTTR QAAFDAERAD WEARGEFARV EALSSVADDG GEVAAIIVPD GSDLVEAPLG GNVWKVLVEP GQRVEAGAVI AVIEAMKAEC DVNSPTAGVV TAVYAQPGGA IAAGAPIVAI APDLEAAA
|
| |