Gene Caul_4449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4449 
Symbol 
ID5901910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4813887 
End bp4817603 
Gene Length3717 bp 
Protein Length1238 aa 
Translation table11 
GC content71% 
IMG OID641564968 
Producturea carboxylase 
Protein accessionYP_001686067 
Protein GI167648404 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2
[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAACG AACCTCCCTT CGACAAGGTC CTGATCGCCA ACCGCGGCGC CATCGCCTGC 
CGGATCATCC GTACCCTGAA GGCCATGGGC GTGAAGTCGG TGGCGGTGTT CAGCGACGCC
GACGCCGGAT CGCTGCACGT CAGCCTGGCC GACGAGGCGG TGCGGATCGG CCCCGCCCCC
GCCGCCGAGA GCTACCTGCG CGGCGACCTG ATCCTGGCGG CGGCCAGGGC GACCGGCGCC
CAGGCCATCC ATCCGGGCTA TGGCTTCCTC AGCGAGAACG CGGGCTTCGC GGAGGCCTGC
GAAGCGGAAG GGATCGCCTT CATTGGTCCG ACAGCGGACA ATATCCGCGC CTTCGGGCTC
AAGCACACGG CCCGGGACCT CGCCCAGGCC CACGGCGTGC CGCTGGCGCC GGGCACCGAC
CTGCTGGTCG ATCCGGCGGC CGCGTTGGAA GCGGCCCAGC GCATCGGCTT CCCGGTCATC
CTGAAGGCCA CGGCCGGCGG CGGCGGCATC GGCATGCGGG TCTGTGAAAC CGCCGAAGCC
GTGAGCGAAG CCTTCGCGGC CGTCCAGCGG CTGGCGAACG GCCATTTCAG CGACGGCGGG
GTGTTCCTCG AGCGCTATGT GCGCAAGGCG CGCCACGTCG AGGTGCAGAT GTTCGGCGAC
GGGGCCGGCA AGGTCGTGGC CCTGGGCGAG CGCGACTGCT CGCTGCAGCG GCGCAACCAG
AAGGTCGTCG AGGAGACCCC CGCCCCCGGC CTGCCCGCCG CCACCCGCGC GGCCCTGCTG
GACGCCGCCG TGCGCCTGGC CTCGGCCGCC CACTACCGCT CGGCCGGCAC GGTGGAGTTC
CTCTACGACG CCGAGCGCGA CGACTTCTTT TTCCTAGAGG TCAACACCCG GTTGCAGGTC
GAGCACGGCG TCACCGAGCA GGTGACGGGC GTGGACCTGG TCGAGTGGAT GGTGCGCGGC
GCGGCCGGCG ATTTTACCTT CCTCGATGTG GCGCCGCCCG AGCCGAAAGG CGCGTCGATC
CAGGTGCGGC TCTATGCCGA GGACCCGGCC CAGGGCTATC GGCCCAGCTC CGGCGTGCTG
ACCCAGGTTT CGTTCCCGCC AAGCGTCCGC GCCGATGGCT GGGTGGTCGA TGGAACCGAG
GTCAGCGCCT TCTACGACCC GCTGCTGGCC AAGCTGATCG TCACGGCCGC TGATCGCCCC
GCCGCCGTCG CGGCCTTGCA GGCTGCGCTG GACGACACCC GCCTGGCCGG GATCGAGACC
AATCTGGACT GGTTGCGCAC GGTCGTTCGC TCCGAGGCCT TCGTCAGCGG CGAGGTCTCG
ACCCGGGCGC TGGAGAGCGT GACCTGGCGG CCCGACACCC TCCAGGTGCT CAGTGGCGGT
CCGGCCACCA CCGTCCAGGA CTGGCCGGGC CGCCAAGGCT ACTGGGACGT CGGCGTGCCC
CCCTCCGGCC CGATGGACGC CTTCGCCTTC CGGCTGGGCA ACCGCCTGCT GGGCAATGGC
GAGGACGCGG CCGGGCTTGA GATCACCGCG CTTGGCCCGA CGCTGAAGTT CAATCGCCCC
GCCGTGGTCT GCCTGACCGG CGCGGCGTTC GACGCCAAGC TCGATGGCGC GCCCTTCGAC
GCCTATGCCC CGATCGCCGT CGCCGCCGGC CAGACCCTGA AGATCGGCCG GGTGCTCGGC
GCCGGCCTGC GCGGTTACCT GCTGTTCCAG GGCGGGCTGG ACGTACCGGT CTATCTGGGC
AGCCGCTCGA CCTTCACCCT GGGCGGCTTC GGCGGCCACG CCGGCCGCAA CCTGACCGCC
GGAGACGTGC TGCGGTTGGC CTCCGATGGG GACAGGCGCA TGCGCCTGTC CCCAACCGCG
CCCCTGCCCC TCAACCTACG CCCCGCCACC ACCAAGGCCT GGACCATCCG CGTCCTGCCC
GGCCCGCACG GCGCGCCGGA CTTCTTCACC CCGGCCGACG TCGCCATGAT CGGCGGCGTG
GATTGGAAGG CGCACTACAA TTCCAACCGC ACTGGCGTGC GGCTGGTCGG TCCCAAGCCC
GAATGGGCGC GGCGCGACGG CGGCGAGGCC GGGCTGCATC CGTCCAACAT CCACGACAAC
GCCTACGCCA TCGGCGCGGT GGATTTCACC GGCGACATGC CGATCATCCT GGGTCCGGAC
GGGCCGTCGC TGGGCGGGTT CGTGTGCCCG TTCGTGATCA TCCAGGCGGA TCTCTGGAAG
GCCGGGCAGT TGGCGCCGGG GGATACGGTG CGGTTTGAGG TGGTGGGGGA TGCTGAGGCG
GCTGCTGCTC TCGCCGAACA GGAGGCCTTG CTCGAAACGC TCAGCGACTC CCCCCACCGA
CCCTTCGGGT CGCCTCCCCC ACGGGGGGAG GCTTTGAGCC AGGTCATGCC TCCTCCTGTG
GGGGCGGTGT CGGCGCGGCC GACGGAGGGG GGAGAACCTG TCCTTCAGGA ACTCCCCGCC
GACGGCCACC GCCCGCACGT CACCTACCGC CGCCAGGGCG ACCAGCACCT GCTCGTCGAA
TACGGCCCCA TCGTGCTCGA CCTGGAACTG CGGCTGCGCA TCCACGCCCT ACACCTGGAC
CTGCAGGCCC TCGCCCTGCC CGCCGTCATC GACCTGACCC CCGGCATCCG CTCGCTGCAG
GTCCACTATG ACAGCCGCCG CCTGACCCAG GCCGACCTGC TGGCCGCGCT GACAGCCGCC
GAGGAACGCC TGGGTGGCCT GGATGATTTC GAGATCCCGT CGCGCGTGGT CCACCTGCCG
CTCAGTTGGA AGGACCCCGC GATCTACCAA ACCATCGACA AGTACATGCA AGCCGTCCGC
GACGACGCGC CGTGGTGCCC GGACAACATC GAATTTATCC GCCGGGTCAA CGGCCTGGAC
AGCATCGACG ACGTCCAGCG CATCGTCTTC GACGCCCGCT ACCTGGTGAT GGGCCTGGGC
GACGTCTATC TGGGCGCGCC GGTCGCCACC CCGGTCGATC CGCGCCACCG CCTGGTGACC
ACCAAGTACA ACCCCGCCCG CACCTGGACC CCGCCCAACG TGGTGGGCAT CGGCGGGGCC
TATATGTGCA TCTACGGCAT GGAGGGACCG GGCGGCTACC AGCTGTTCGG CCGCACCATC
CAGGTGTGGA ACACCTGGCG CCAGACCGAA GCCTTCACGG GCGGCAAGCC CTGGCTCTTG
AGATTCTTCG ACCAGATCCG CTTCTTCCCG GTCAGCGCCG AGGAACTGGT GGAATGGCGA
CGCGACTTCC CGCTGGGCCG TCGCCAGATC CGCATCGAGG AAGAGACCTT CCGGCTGTCG
GACTATCGCA AGATGCTGGC CGACAACGCC GGCGGCATCG AAGCCTTCCA GACCACGCGC
CAGGCCGCCT TCGACGCCGA GCGCGCCGAC TGGGAGGCCA GGGGCGAGTT CGCGCGGGTC
GAAGCCTTGT CGAGCGTGGC CGACGATGGC GGCGAGGTCG CGGCGATCAT CGTCCCCGAC
GGCAGCGACC TGGTCGAGGC CCCGCTGGGC GGCAATGTCT GGAAGGTGCT GGTCGAGCCC
GGGCAACGGG TCGAGGCCGG GGCGGTGATC GCGGTGATCG AGGCCATGAA GGCCGAGTGC
GACGTCAACA GCCCGACAGC CGGCGTCGTC ACCGCTGTCT ACGCCCAGCC GGGCGGGGCC
ATCGCCGCCG GCGCGCCGAT CGTCGCCATC GCGCCCGATC TGGAGGCGGC GGCTTGA
 
Protein sequence
MLNEPPFDKV LIANRGAIAC RIIRTLKAMG VKSVAVFSDA DAGSLHVSLA DEAVRIGPAP 
AAESYLRGDL ILAAARATGA QAIHPGYGFL SENAGFAEAC EAEGIAFIGP TADNIRAFGL
KHTARDLAQA HGVPLAPGTD LLVDPAAALE AAQRIGFPVI LKATAGGGGI GMRVCETAEA
VSEAFAAVQR LANGHFSDGG VFLERYVRKA RHVEVQMFGD GAGKVVALGE RDCSLQRRNQ
KVVEETPAPG LPAATRAALL DAAVRLASAA HYRSAGTVEF LYDAERDDFF FLEVNTRLQV
EHGVTEQVTG VDLVEWMVRG AAGDFTFLDV APPEPKGASI QVRLYAEDPA QGYRPSSGVL
TQVSFPPSVR ADGWVVDGTE VSAFYDPLLA KLIVTAADRP AAVAALQAAL DDTRLAGIET
NLDWLRTVVR SEAFVSGEVS TRALESVTWR PDTLQVLSGG PATTVQDWPG RQGYWDVGVP
PSGPMDAFAF RLGNRLLGNG EDAAGLEITA LGPTLKFNRP AVVCLTGAAF DAKLDGAPFD
AYAPIAVAAG QTLKIGRVLG AGLRGYLLFQ GGLDVPVYLG SRSTFTLGGF GGHAGRNLTA
GDVLRLASDG DRRMRLSPTA PLPLNLRPAT TKAWTIRVLP GPHGAPDFFT PADVAMIGGV
DWKAHYNSNR TGVRLVGPKP EWARRDGGEA GLHPSNIHDN AYAIGAVDFT GDMPIILGPD
GPSLGGFVCP FVIIQADLWK AGQLAPGDTV RFEVVGDAEA AAALAEQEAL LETLSDSPHR
PFGSPPPRGE ALSQVMPPPV GAVSARPTEG GEPVLQELPA DGHRPHVTYR RQGDQHLLVE
YGPIVLDLEL RLRIHALHLD LQALALPAVI DLTPGIRSLQ VHYDSRRLTQ ADLLAALTAA
EERLGGLDDF EIPSRVVHLP LSWKDPAIYQ TIDKYMQAVR DDAPWCPDNI EFIRRVNGLD
SIDDVQRIVF DARYLVMGLG DVYLGAPVAT PVDPRHRLVT TKYNPARTWT PPNVVGIGGA
YMCIYGMEGP GGYQLFGRTI QVWNTWRQTE AFTGGKPWLL RFFDQIRFFP VSAEELVEWR
RDFPLGRRQI RIEEETFRLS DYRKMLADNA GGIEAFQTTR QAAFDAERAD WEARGEFARV
EALSSVADDG GEVAAIIVPD GSDLVEAPLG GNVWKVLVEP GQRVEAGAVI AVIEAMKAEC
DVNSPTAGVV TAVYAQPGGA IAAGAPIVAI APDLEAAA