Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4737 |
Symbol | |
ID | 5902199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5122858 |
End bp | 5124486 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641565256 |
Product | F0F1 ATP synthase subunit beta |
Protein accession | YP_001686355 |
Protein GI | 167648692 |
COG category | [C] Energy production and conversion |
COG ID | [COG0055] F0F1-type ATP synthase, beta subunit |
TIGRFAM ID | [TIGR01039] ATP synthase, F1 beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.603739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00210198 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCAAGA CCCCCGCTAA GGCTCCCGCC GCCGCTGCCA AGCCGGCCGC CGTGAAGAAG CCCGCCGCTC CCAAGGCTGC TGCCGCCCCG AAGGCCGCCG CTGTCGCGAC CCCCGCCGCC AAGAAGCCCG CCGCCCCCAA GGCCGCGCCG GTTTCGAAGG TCGCTGGCAC CCGCGAAAAG CCCGACACCA TCCGCGCCGG CACCGGCAAG CTGGTGCAGG TCATCGGCGC CGTGGTCGAC GTCGAGTTCA CCGGTGAGCT GCCGGCGATC CTCAACGCCC TGGAAACGGT CAACATCGCC ACCGGCCAGC GCCTGGTGTT CGAAGTCGCC CAGCACCTGG GCCAGAACAC GGTCCGCGCC ATCGCCATGG ACGCCACCGA AGGCCTGGTT CGCGGCCAGG AAGTGCGCGA CACCGGCCAA TCGATCCGCG TCCCGGTCGG CCCCGGCACC CTGGGCCGCA TCATGAACGT CATCGGCGAG CCGATCGACG AGCAGGGTCC GATCAAGTCG GACATCTTCC GCACGATCCA CCGCGACGCC CCGACCTTCG CCGAGCAGAC CAACACCGCT GAAGTCCTGG TCACGGGCAT CAAGGTCATC GACCTGATGT GCCCCTACAC CAAGGGCGGC AAGATCGGCC TGTTCGGCGG CGCCGGCGTC GGCAAGACCG TGACGATGCA GGAACTGATC AACAACATCG CCAAGGCTTA CGGCGGTTAT TCGGTTCTGG CCGGCGTGGG CGAACGCACC CGCGAAGGCA ACGACCTCTA TCACGAGATG ATCGAGTCCA ACGTCAACGT GGACCCGAAG ATCAACGGTT CGACCGAAGG CAGCCGCTGC GCCCTGGTCT ACGGCCAGAT GAACGAACCC CCCGGCGCCC GCGCCCGCGT GGCCCTGACC GGCCTCTCCA TCGCTGAATA TTTCCGCGAT GAAGAAGGCA AGGACGTGCT GCTGTTCGTC GACAACATCT TCCGCTTCAC CCAGGCCGGC GCCGAAGTGT CGGCTCTGCT GGGCCGCATC CCCTCGGCCG TGGGCTATCA GCCCACCCTG GCCACCGAGA TGGGCAACCT GCAGGAGCGC ATCACCTCGA CCAACAAGGG TTCGATCACC TCGGTCCAGG CCATCTACGT GCCCGCCGAC GACCTGACCG ACCCGGCGCC CGCCGCCTCG TTCGCCCACT TGGACGCCAC CACCGTTCTG AGCCGCGACA TCGCCGCCCA GGCCATCTTC CCGGCCGTCG ATCCGCTGGA CTCGACCTCG CGGATCATGG ACCCGCTGAT CATCGGCCAG GAGCACTACG ACGTGGCCCG TTCGGTCCAG GAAGTGCTTC AGCAGTACAA GTCGCTGAAG GACATCATCG CCATCCTGGG CATGGACGAG CTGTCGGAAG AGGACAAGCT GACCGTCGCC CGGGCGCGCA AGATCTCGCG CTTCCTCAGC CAGCCGTTCC ACGTCGCCGA GCAGTTCACC AACACCCCTG GTGCCTTCGT GTCGCTGAAG GACACCATCC GCTCGTTCAA GGGCATCGTG GACGGCGAGT ACGACCACCT GCCGGAAGCC GCCTTCTACA TGGTCGGCCC GATCGAGGAA GCGGTGGCCA AGGCTGAAAA GCTGGCTGGC GAAGCCTGA
|
Protein sequence | MAKTPAKAPA AAAKPAAVKK PAAPKAAAAP KAAAVATPAA KKPAAPKAAP VSKVAGTREK PDTIRAGTGK LVQVIGAVVD VEFTGELPAI LNALETVNIA TGQRLVFEVA QHLGQNTVRA IAMDATEGLV RGQEVRDTGQ SIRVPVGPGT LGRIMNVIGE PIDEQGPIKS DIFRTIHRDA PTFAEQTNTA EVLVTGIKVI DLMCPYTKGG KIGLFGGAGV GKTVTMQELI NNIAKAYGGY SVLAGVGERT REGNDLYHEM IESNVNVDPK INGSTEGSRC ALVYGQMNEP PGARARVALT GLSIAEYFRD EEGKDVLLFV DNIFRFTQAG AEVSALLGRI PSAVGYQPTL ATEMGNLQER ITSTNKGSIT SVQAIYVPAD DLTDPAPAAS FAHLDATTVL SRDIAAQAIF PAVDPLDSTS RIMDPLIIGQ EHYDVARSVQ EVLQQYKSLK DIIAILGMDE LSEEDKLTVA RARKISRFLS QPFHVAEQFT NTPGAFVSLK DTIRSFKGIV DGEYDHLPEA AFYMVGPIEE AVAKAEKLAG EA
|
| |