Gene Caul_4737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4737 
Symbol 
ID5902199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5122858 
End bp5124486 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content67% 
IMG OID641565256 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_001686355 
Protein GI167648692 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.603739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00210198 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCAAGA CCCCCGCTAA GGCTCCCGCC GCCGCTGCCA AGCCGGCCGC CGTGAAGAAG 
CCCGCCGCTC CCAAGGCTGC TGCCGCCCCG AAGGCCGCCG CTGTCGCGAC CCCCGCCGCC
AAGAAGCCCG CCGCCCCCAA GGCCGCGCCG GTTTCGAAGG TCGCTGGCAC CCGCGAAAAG
CCCGACACCA TCCGCGCCGG CACCGGCAAG CTGGTGCAGG TCATCGGCGC CGTGGTCGAC
GTCGAGTTCA CCGGTGAGCT GCCGGCGATC CTCAACGCCC TGGAAACGGT CAACATCGCC
ACCGGCCAGC GCCTGGTGTT CGAAGTCGCC CAGCACCTGG GCCAGAACAC GGTCCGCGCC
ATCGCCATGG ACGCCACCGA AGGCCTGGTT CGCGGCCAGG AAGTGCGCGA CACCGGCCAA
TCGATCCGCG TCCCGGTCGG CCCCGGCACC CTGGGCCGCA TCATGAACGT CATCGGCGAG
CCGATCGACG AGCAGGGTCC GATCAAGTCG GACATCTTCC GCACGATCCA CCGCGACGCC
CCGACCTTCG CCGAGCAGAC CAACACCGCT GAAGTCCTGG TCACGGGCAT CAAGGTCATC
GACCTGATGT GCCCCTACAC CAAGGGCGGC AAGATCGGCC TGTTCGGCGG CGCCGGCGTC
GGCAAGACCG TGACGATGCA GGAACTGATC AACAACATCG CCAAGGCTTA CGGCGGTTAT
TCGGTTCTGG CCGGCGTGGG CGAACGCACC CGCGAAGGCA ACGACCTCTA TCACGAGATG
ATCGAGTCCA ACGTCAACGT GGACCCGAAG ATCAACGGTT CGACCGAAGG CAGCCGCTGC
GCCCTGGTCT ACGGCCAGAT GAACGAACCC CCCGGCGCCC GCGCCCGCGT GGCCCTGACC
GGCCTCTCCA TCGCTGAATA TTTCCGCGAT GAAGAAGGCA AGGACGTGCT GCTGTTCGTC
GACAACATCT TCCGCTTCAC CCAGGCCGGC GCCGAAGTGT CGGCTCTGCT GGGCCGCATC
CCCTCGGCCG TGGGCTATCA GCCCACCCTG GCCACCGAGA TGGGCAACCT GCAGGAGCGC
ATCACCTCGA CCAACAAGGG TTCGATCACC TCGGTCCAGG CCATCTACGT GCCCGCCGAC
GACCTGACCG ACCCGGCGCC CGCCGCCTCG TTCGCCCACT TGGACGCCAC CACCGTTCTG
AGCCGCGACA TCGCCGCCCA GGCCATCTTC CCGGCCGTCG ATCCGCTGGA CTCGACCTCG
CGGATCATGG ACCCGCTGAT CATCGGCCAG GAGCACTACG ACGTGGCCCG TTCGGTCCAG
GAAGTGCTTC AGCAGTACAA GTCGCTGAAG GACATCATCG CCATCCTGGG CATGGACGAG
CTGTCGGAAG AGGACAAGCT GACCGTCGCC CGGGCGCGCA AGATCTCGCG CTTCCTCAGC
CAGCCGTTCC ACGTCGCCGA GCAGTTCACC AACACCCCTG GTGCCTTCGT GTCGCTGAAG
GACACCATCC GCTCGTTCAA GGGCATCGTG GACGGCGAGT ACGACCACCT GCCGGAAGCC
GCCTTCTACA TGGTCGGCCC GATCGAGGAA GCGGTGGCCA AGGCTGAAAA GCTGGCTGGC
GAAGCCTGA
 
Protein sequence
MAKTPAKAPA AAAKPAAVKK PAAPKAAAAP KAAAVATPAA KKPAAPKAAP VSKVAGTREK 
PDTIRAGTGK LVQVIGAVVD VEFTGELPAI LNALETVNIA TGQRLVFEVA QHLGQNTVRA
IAMDATEGLV RGQEVRDTGQ SIRVPVGPGT LGRIMNVIGE PIDEQGPIKS DIFRTIHRDA
PTFAEQTNTA EVLVTGIKVI DLMCPYTKGG KIGLFGGAGV GKTVTMQELI NNIAKAYGGY
SVLAGVGERT REGNDLYHEM IESNVNVDPK INGSTEGSRC ALVYGQMNEP PGARARVALT
GLSIAEYFRD EEGKDVLLFV DNIFRFTQAG AEVSALLGRI PSAVGYQPTL ATEMGNLQER
ITSTNKGSIT SVQAIYVPAD DLTDPAPAAS FAHLDATTVL SRDIAAQAIF PAVDPLDSTS
RIMDPLIIGQ EHYDVARSVQ EVLQQYKSLK DIIAILGMDE LSEEDKLTVA RARKISRFLS
QPFHVAEQFT NTPGAFVSLK DTIRSFKGIV DGEYDHLPEA AFYMVGPIEE AVAKAEKLAG
EA