Gene Caul_3151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3151 
Symbol 
ID5900606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3414085 
End bp3416058 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content70% 
IMG OID641563654 
Productcarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_001684776 
Protein GI167647113 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.111688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCTCGT CCGTCCTCAT CGCCAATCGC GGCGAAATCG CCCGTCGGAT CATCCGCACC 
GCCCGCGAGT TGGGCGTGCG GACGATCGCG GTCTATTCCG AGGCGGACGC CGAGGCGCCG
TTCGTGATGG AGGCTGACGT CGCCATCCTG ATCGGCCCGG CCCCGGCGCG CGAGAGCTAC
CTGGTCCCCG AGAAGATCCT GGCCGCCGCC CGCCAGTCCG GCGCCGAGGC GATCCATCCC
GGCTACGGCT TCCTGTCCGA GAACGCCGAC TTCGCCCAGG CGGTGATCGA CGCCGGCCTG
ATCTGGATCG GCCCGCCGCC CTCGGCGATC CGCGCCATGG GCCTGAAGGA CGCGGCCAAG
AAGCTCATGA TCGCGGCGGG CGTGCCGACC ACCCCCGGCT ATCTGGGCGA GGATCAGTCG
GTCGAACGCC TGGCGGCCGA GGCGGCCAAG ATCGGCTTTC CAGTGCTGAT CAAGGCCGTG
GCGGGCGGCG GCGGCAAGGG CATGCGCAAG GTCGAGAAGG CGGAGGACTT CGCCGCCCTG
CTGGCCTCGT GCCAGCGCGA AGCCAGCGCC AGCTTCGGCG ACCAGCACGT GTTGCTGGAG
AAATACGTTA CCCGCCCGCG CCACATCGAG GTGCAGGTGT TCGGCGACAG CCACGGCAAC
GTCGTCCACC TGTTCGAGCG CGATTGCTCG CTGCAGCGTC GCCACCAGAA GGTCATCGAG
GAGGCGCCGG CCCCCGGCAT GGACGAGGCC ACTCGCGCCG CCGTCTGCGG GGCGGCGGTC
AAGGCCGCCC AGGCCGTGGG CTATGTCGGG GCCGGCACGG TGGAATTCAT CGCCGACGCT
TCCGAAGGCC TGCGCGCAGA CCGCATCTGG TTCATGGAGA TGAACACCCG GCTGCAGGTC
GAGCACCCGG TCACCGAGAT GGTCACCGGC CAGGACCTGG TCGAATGGCA GCTTCTGGTG
GCCTCGGGCG AGACCCTGCC GCTGGAGCAG GACGAGATCA CGCTGGACGG CTGGGCCATG
GAGGCCCGTC TCTACGCCGA GAACCCGGCC ACCGGCTTCC TGCCGAGCAT CGGCCCGCTG
ACCCACTTCC GCCTGCCCGA GGGCGATGTT CGGGTCGACA GCGCGGTGGA GGAGGGCGGC
GAGGTCACCC CGTTCTACGA CCCGATGATC GCCAAGCTGA TCGCCCACGG CGTCGACCGC
GAGGACGCCG CCGCCCGGCT GGCGCAGGCC TGCCGGCTGG TCGAGGTCTA TCCGGTCAAG
ACCAACGCCG CCTTCCTGGC CAAGTGCGCC AGCCACCCGG ACTTCATCGA CGGCGCCATC
GACACCGGCT TCATCGAAGC GCGGCTGGAG GAACTGACCG ACCGCGCCTT CACCGACGCC
CCGACCTTGG CGGCGATCGG CCAGCGGCTG GAGGCCTTCA TGGAGGCCGA CCAGCCGCGA
GCCGATGTCT GGGCCAGCGC GCCCTCGCGC CTGCTGGGCT TCCGAATGAA CGCGCCGCGC
GCGGCCATGA CCCTGCCGAT GAGCATCGAC GGCAAGGCCG TGCCGCTGCG CGTAGCCTTG
GCGGGTGGCG CGGGCGACGA CTGGTCATGG GATATCACCG TCGAGGACGG CCGACCGCTC
GATGACGTCG ACGTGTTGCC GTCGGCCTTT GGTGGCGACC CGCTCTACGT GTTCGAGGGC
GGAGACGTCC GCGAATTCGA TTTCGAGCCC AAGGTCGGCG CGGCCCATGT CGCCGCTTCG
GACGGGGCGA TCCTGTCGCC GATGCCGGGC AAGATCGTCT CCGTCTCGGT CGAGGCTGGT
CAGACGGTGG TCAGGGGCCA GACCCTGCTG ACCCTCGAGG CCATGAAGAT GGAGCACGCC
CTGGCCGCGC CGTTCGACGG CGTGGTGGCC GAGCTGTCGG CCGTCGCCGG CGGCCAGGTC
AGCGAAGGCG TGGTGCTGGC GCGGCTTGAG CCGGCGGTCG CCCTGGCGTC CTAG
 
Protein sequence
MISSVLIANR GEIARRIIRT ARELGVRTIA VYSEADAEAP FVMEADVAIL IGPAPARESY 
LVPEKILAAA RQSGAEAIHP GYGFLSENAD FAQAVIDAGL IWIGPPPSAI RAMGLKDAAK
KLMIAAGVPT TPGYLGEDQS VERLAAEAAK IGFPVLIKAV AGGGGKGMRK VEKAEDFAAL
LASCQREASA SFGDQHVLLE KYVTRPRHIE VQVFGDSHGN VVHLFERDCS LQRRHQKVIE
EAPAPGMDEA TRAAVCGAAV KAAQAVGYVG AGTVEFIADA SEGLRADRIW FMEMNTRLQV
EHPVTEMVTG QDLVEWQLLV ASGETLPLEQ DEITLDGWAM EARLYAENPA TGFLPSIGPL
THFRLPEGDV RVDSAVEEGG EVTPFYDPMI AKLIAHGVDR EDAAARLAQA CRLVEVYPVK
TNAAFLAKCA SHPDFIDGAI DTGFIEARLE ELTDRAFTDA PTLAAIGQRL EAFMEADQPR
ADVWASAPSR LLGFRMNAPR AAMTLPMSID GKAVPLRVAL AGGAGDDWSW DITVEDGRPL
DDVDVLPSAF GGDPLYVFEG GDVREFDFEP KVGAAHVAAS DGAILSPMPG KIVSVSVEAG
QTVVRGQTLL TLEAMKMEHA LAAPFDGVVA ELSAVAGGQV SEGVVLARLE PAVALAS