Gene Caul_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3163 
Symbol 
ID5900618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3429419 
End bp3431428 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content68% 
IMG OID641563667 
Productcarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_001684788 
Protein GI167647125 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGA AGCTGCTGAT CGCCAACCGA GGCGAGATCG CGGTTCGGGT GATCAAGACC 
TGCCGCCGGC TGGGCATCAG GACGGTGGTG GTGTTTTCGG ACGCCGACGC CGACAGCCTG
GCCGTGGAGA TGGCCGACGA GACCGTGCAC ATCGGCCCCG CGCCGGCCAA TCAGTCCTAT
CTGGTGGCCG ACAAGATCAT CGCCGCCTGC AAGCAGACCG GCGCCCAGGC CGTGCATCCG
GGCTTCGGCT TCCTGTCGGA GAACGCCAGC TTCGCCCAGC GCTGCGCCGA CGAGGGGATC
GTGTTCATCG GTCCCAACCC GGGCGCCATC TCGGCCATGG GCGACAAGAT CGAGAGCAAG
AAGTTCGCGC AGAAGGCCGG CGTGTCCTGC GTGCCCGGCC ATATCGGCGA GATCGACGAC
ACGGCCCACG CGGTGACCAT CTCCGAGCAG GTGGGCTATC CGGTCATGAT CAAGGCCAGC
GCCGGCGGCG GCGGCAAGGG CATCCGCGTG GCCTGGAACC GCCAGGACGT CGAGGAGGGC
TTTCCGGCCG TGCGGGCCGA GGCCAAGGCC AGCTTCGGCG ACGACCGGAT CTTCATCGAG
AAGTTCATCC AAAGCCCGCG CCACATCGAG ATCCAGGTGC TGGGCGACAA GCATGGCAAC
GTGGTTCACC TGTTCGAGCG CGAATGCTCG ATCCAGCGCC GTAACCAGAA GGTCATCGAG
GAAGCGCCCA GCCCGCTTCT CGACGAGGCC ACCCGCGCCG CCATGGGCGC CCAGGCCGTG
GCCCTGGCCA AGGCCGTGAA CTACGATAGC GCCGGCACGG TCGAGTTCGT GGCCGGCCAG
GACAAGAGCT TCTTCTTCCT GGAAATGAAC ACCCGCCTGC AAGTCGAGCA TCCGGTGACC
GAGCTGATCA CGGGGCTGGA CCTGGTCGAG CAAATGATCC GCTCGGCCTG GGGCGAGAAG
CTGGCCTTCG AACAGAAGGA CTTGAAGATC AACGGCTGGG CCATCGAGAG CCGGATCTAC
GCCGAGGACC CCTACCGCAA GTTCCTGCCG TCGATCGGGC GTCTCGTCCG GTACGATCCG
CCGGAGGAGG GCGAGCAAGA GGGTTACACC GTCCGCAACG ACGCCGGGGT GCGCGAGGGC
GACGAGATCT CGATGTACTA CGACCCGATG ATCTCCAAGC TCTGCACCTG GGCGCCGACC
CGCCTGGCGG CGATCGACGG CATGGGCCGG GCGCTGGAGG ACTTCCACAT CGAGGGCCCC
GCCCACAACA TCCCGTTCCT GGCCGCGGTG ATGGACCAGG ATCGGTTCCG CTCGGGCAGG
ATCTCGACCA ACTACATCAA GGACGAGTTC GCCGACGGCT TCAAAGGCGT CGCGCCCACG
CCCGAGCAGG TCGATGTGAT GACCGCCGTG GGCGCGGCCA TGCAGCGGGT CTACGCCGCC
CGGGCGCGGT CGATCCAGGC GGGGCTGAGC CATCCGATCC GCACCCAATG GGTGGTCGCC
GTCGGTCACG CCAAGCGGCG GGTCGACCTG TCGGGCGGCG CGTCGCTGGG CGAGGCCCCG
CTGACCGTCC AGTTGCTCGA CGAGGGGCGC ACCTTGGTGC TGCAGACCCT CGACTGGCGT
CCCGGCAAGC CGGTGTTCAG GGGCCGGCTG GACGGCAAGG CCTTCACCGT CCAGGTGACC
CCGGCCGCCG AGGGCTTCGT GATCCGCCAC CGGGCCGCCA AGGCCAAGGT GCTGGTCCTG
ACCCCGCGCT CGGCCGAGCT GCACGACAAG CTGCCCGAAA AGCAGGCCGC CGACACTTCC
AGGCTGGTGC TCTCGCCGAT GCCCGGCTTG GTGGTCAGCA TGGACGTCGC CACCGGCCAG
CAGGTCCGCG AGGGCGAGAT CGTCTGCGTG CTCGAGGCCA TGAAGATGCA GAACATCATC
CGCGCCGAGC GCGACGGCGT CGTCAAGGCC GTCAACGCCA AGAGCGGAGA CCCCGTCGCC
GCCGACGAGG TCCTCGTCGA GTTCGCGTGA
 
Protein sequence
MFEKLLIANR GEIAVRVIKT CRRLGIRTVV VFSDADADSL AVEMADETVH IGPAPANQSY 
LVADKIIAAC KQTGAQAVHP GFGFLSENAS FAQRCADEGI VFIGPNPGAI SAMGDKIESK
KFAQKAGVSC VPGHIGEIDD TAHAVTISEQ VGYPVMIKAS AGGGGKGIRV AWNRQDVEEG
FPAVRAEAKA SFGDDRIFIE KFIQSPRHIE IQVLGDKHGN VVHLFERECS IQRRNQKVIE
EAPSPLLDEA TRAAMGAQAV ALAKAVNYDS AGTVEFVAGQ DKSFFFLEMN TRLQVEHPVT
ELITGLDLVE QMIRSAWGEK LAFEQKDLKI NGWAIESRIY AEDPYRKFLP SIGRLVRYDP
PEEGEQEGYT VRNDAGVREG DEISMYYDPM ISKLCTWAPT RLAAIDGMGR ALEDFHIEGP
AHNIPFLAAV MDQDRFRSGR ISTNYIKDEF ADGFKGVAPT PEQVDVMTAV GAAMQRVYAA
RARSIQAGLS HPIRTQWVVA VGHAKRRVDL SGGASLGEAP LTVQLLDEGR TLVLQTLDWR
PGKPVFRGRL DGKAFTVQVT PAAEGFVIRH RAAKAKVLVL TPRSAELHDK LPEKQAADTS
RLVLSPMPGL VVSMDVATGQ QVREGEIVCV LEAMKMQNII RAERDGVVKA VNAKSGDPVA
ADEVLVEFA