Gene Caul_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3000 
Symbol 
ID5900455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3264802 
End bp3266811 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content68% 
IMG OID641563497 
Productcarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_001684625 
Protein GI167646962 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGCA AAATTCTGAT CGCCAACCGA GGCGAGATCG CGGTTCGGGT GATCAAGACC 
TGTCGTCGGC TGGGCATCAG GACGGTGGTG GTGTTTTCGG ACGCCGACGC CGACAGCCTG
GCCGTGGAGA TGGCCGACGA GACCGTGCAC ATCGGCCCCG CGCCGGCCAA TCAGTCCTAT
CTGGTGGCCG ACAAGATCAT CGCCGCCTGC AAGCAGACCG GCGCCCAGGC CGTGCATCCG
GGCTTTGGCT TCCTGTCGGA GAACGCCAGC TTCGCCCAGC GCTGCGCCGA CGAGGGGATC
GTGTTCATCG GTCCCAACCC GGGCGCCATC TCGGCGATGG GCGACAAGAT CGAGAGCAAG
AAGTTCGCGC AGAAGGCGGG CGTGTCCTGC GTGCCGGGCC ATATCGGCGA GATCGACGAC
ACGGCCCATG CGGTGACCAT CGCCGAGCAG GTCGGCTATC CGGTGATGAT CAAGGCCAGC
GCCGGCGGCG GCGGCAAGGG CATCCGCGTG GCCTGGACCC GCCAGGACGT CGAGGAAGGC
TTCCCGGCCG TGCGGGCCGA GGCCAAGGCC AGCTTTGGCG ACGACCGCAT CTTCATCGAG
AAGTTTATCG AGAGCCCGCG CCACATCGAG ATCCAGGTGC TGGGCGACAA GCACGGCAAT
GTGGTCCACC TGTTCGAGCG CGAATGCTCG ATCCAGCGCC GCAACCAGAA GGTCATCGAG
GAAGCGCCCA GCCCGCTGCT CGACGAGGCC ACCCGGGCGG CCATGGGCGC CCAGGCCGTG
GCCCTGGCCA AGGCGGTGAA CTACGACAGC GCCGGCACGG TCGAGTTCGT GGCCGGCCAG
GACAAGAGCT TCTACTTCCT GGAGATGAAC ACCCGCCTGC AGGTCGAGCA CCCGGTCACC
GAGCTGATCA CCGGCCTGGA CCTGGTCGAA CAGATGATCC GCAGCGCCTG GGGCGAAAAG
CTGGCCTTCG AACAGCAGGA CCTGAAGATC AACGGCTGGG CCATCGAGAG CCGCATCTAC
GCCGAGGACC CCTACCGCAA GTTCCTGCCC AGCATCGGCC GGCTGGTCCG CTATGACCCG
CCGCAAGAGG GCGAGCACGA GGGCTACACC GTCCGCAACG ACGCCGGGGT GCGCGAGGGC
GACGAGATCT CGATGTACTA CGACCCGATG ATCTCCAAGC TCTGCACCTG GGCGCCGACC
CGCCTGGCGG CGATCGACGG CATGGGCCGG GCGCTGGAGG ACTTCCACAT CGAGGGCCCC
GCCCACAACA TCCCGTTCCT GGCCGCGGTG ATGGACCAGG ATCGGTTCCG CTCGGGCAAG
ATCTCGACCA ACTACATCAA GGACGAGTTC GCCGACGGCT TCAAAGGTGT CGCGCCCACG
CCCGAGCAGG TCGATGTGAT GACCGCCGTG GGCGCGGCCA TGCAGCGGGT CTACGCCGCC
CGGGCGCGGT CGATCCAGGC GGGGCTGAGC CATCCGATCC GCACCCAATG GGTGGTCGCC
GTCGGTCACG CCAAGCGGCG GGTCGACCTG TCGGGCGGCG CGTCGCTGGG CGAGGGGCCG
CTGACCGTCG AACTGCCGGA CGAAGGCCGC ACGATCAGCC TGCAGACCCT GGACTGGCGT
CCGGGCAAGC CGGTGTTCAG GGGCCGGCTG GACGGCAAGG CCTTCACCGT CCAGGTGACC
CCGGCCGCCG AGGGCTTCGT GATCCGCCAC CGGGCCGCCA AGGCCAAGGT GCTGGTCCTG
ACCCCGCGCT CGGCCGAACT GCACGACAAG CTGCCCGAAA AGCAGGCCGC CGACACTTCC
AGGCTGGTGC TCTCGCCGAT GCCCGGCTTG GTGGTCAGCA TGGACGTCGC CACCGGCCAG
CAGGTCCGCG AGGGCGAGAT CGTCTGCGTG CTCGAGGCCA TGAAGATGCA GAACATCATC
CGCGCCGAGC GCGACGGCGT CGTCAAGGCC GTCAACGCCA AGAGCGGAGA CCCCGTCGCC
GCCGACGAGG TCCTCGTCGA GTTCGCGTGA
 
Protein sequence
MFSKILIANR GEIAVRVIKT CRRLGIRTVV VFSDADADSL AVEMADETVH IGPAPANQSY 
LVADKIIAAC KQTGAQAVHP GFGFLSENAS FAQRCADEGI VFIGPNPGAI SAMGDKIESK
KFAQKAGVSC VPGHIGEIDD TAHAVTIAEQ VGYPVMIKAS AGGGGKGIRV AWTRQDVEEG
FPAVRAEAKA SFGDDRIFIE KFIESPRHIE IQVLGDKHGN VVHLFERECS IQRRNQKVIE
EAPSPLLDEA TRAAMGAQAV ALAKAVNYDS AGTVEFVAGQ DKSFYFLEMN TRLQVEHPVT
ELITGLDLVE QMIRSAWGEK LAFEQQDLKI NGWAIESRIY AEDPYRKFLP SIGRLVRYDP
PQEGEHEGYT VRNDAGVREG DEISMYYDPM ISKLCTWAPT RLAAIDGMGR ALEDFHIEGP
AHNIPFLAAV MDQDRFRSGK ISTNYIKDEF ADGFKGVAPT PEQVDVMTAV GAAMQRVYAA
RARSIQAGLS HPIRTQWVVA VGHAKRRVDL SGGASLGEGP LTVELPDEGR TISLQTLDWR
PGKPVFRGRL DGKAFTVQVT PAAEGFVIRH RAAKAKVLVL TPRSAELHDK LPEKQAADTS
RLVLSPMPGL VVSMDVATGQ QVREGEIVCV LEAMKMQNII RAERDGVVKA VNAKSGDPVA
ADEVLVEFA