Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4740 |
Symbol | |
ID | 5902202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5126073 |
End bp | 5127605 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641565259 |
Product | F0F1 ATP synthase subunit alpha |
Protein accession | YP_001686358 |
Protein GI | 167648695 |
COG category | [C] Energy production and conversion |
COG ID | [COG0056] F0F1-type ATP synthase, alpha subunit |
TIGRFAM ID | [TIGR00962] proton translocating ATP synthase, F1 alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.408023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00635659 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACATCC GCGCCGCCGA GATTTCGGCC ATCCTCAAGT CGCAGATCGC CAATTTCGGC GAAGAAGCCG CCGTCTCGGA CGTCGGTCAG GTGCTGTCCG TCGGTGACGG CATCGCTCGC ATCTATGGCT TGGACAACGT CCAGGCCGGC GAAATGGTCG AATTCCCGAA GGCCGGCGTG AAGGGCATGG CCCTGAACCT CGAGCGCGAC AATGTCGGCG CCGTGATCTT CGGCCAGGAC CAGGCCATCA AGGAAGGCGA CGAAGTCCGT CGTCTCGGCG AGATCGTCGA CGTTCCGGTC GGCCGCGGCC TGCTGGGCCG CGTCGTCAAC CCGCTGGGCG AGCCGATCGA CGGCAAGGGC CCGATCGTCT CGACCGAGCG TCGCCGCGTC GACGTCAAGG CGCCCGGCAT CATCCCGCGC AAGTCGGTGC ACGAGCCCGT GCAGACCGGC CTGAAGTCGA TCGACACCCT GATCCCCGTC GGCCGCGGCC AGCGCGAGCT GATCATCGGT GACCGTCAGA CCGGCAAGAC CGCCGTCGCC ATCGACACCA TCCTGAACCA GAAGGCCGCC AACGCCGGCA CGGACGAGAG CGCCAAGCTC TATTGCGTCT ATGTCGCCAT CGGCCAGAAG CGTTCGACCG TCGCCCAGAT CGTCAAGACG CTCGAAGAGC ACGGCGCTCT GGAATACACG ATCGTCGTCG TGGCCTCGGC TTCCGAGCCG GCCCCGCTGC AATACCTGGC CCCGTTCTCG GGCTGCGCCA TGGGCGAGTG GTTCCGCGAC AACGGTCTGC ACGGCCTGAT CATCTATGAC GACCTTTCCA AGCAAGCTGT CGCCTACCGC CAGATGTCGT TGCTGCTGCG CCGCCCGCCG GGCCGCGAAG CCTATCCGGG CGACGTCTTC TACCTGCACT CCCGCCTGCT GGAACGCGCC GCCAAGCTGA ACGAAGACAA CGGTTCGGGT TCGCTGACGG CGCTGCCGAT CATCGAAACC CAGGCCAACG ACGTTTCGGC CTACATCCCG ACCAACGTGA TCTCGATCAC CGACGGCCAG ATCTTCCTGG AAACCGACCT GTTCTATCAG GGCATTCGTC CCGCCGTGAA CGTCGGCATC TCGGTGTCGC GCGTCGGCTC GTCGGCCCAG ATCAAGGCCA TGAAGCAAGT CGCCGGCGCG ATTAAGGGCG AGTTGGCCCA GTATCGCGAA ATGGCCGCCT TCGCCAAGTT CGGCTCGGAC CTGGACGCCT CGACCCAAAA GCTGCTGGCC CGCGGCGAGC GTCTGACCGA GCTGCTCAAG CAGCCGCAAT ACGCGCCGCA GGCCGTCGAA GAGCAGGTCT GCGTGATCTA CGCCGGTACG CGCGGCTATC TGGACAACAT CCCGACCTCG TCGGTCCGCC GGTTCGAGAG CGAGCTGCTG GCCCGCCTGC ACAGCCAGCA CAAGGATCTG CTGGACAACA TTCGCACCAA GAAGGCCCTC GATAAGGACC TCGAGAACAC GCTCAAGAGC GTGCTCGACA ACTTCTCGGC GACCTTCGCC TAG
|
Protein sequence | MDIRAAEISA ILKSQIANFG EEAAVSDVGQ VLSVGDGIAR IYGLDNVQAG EMVEFPKAGV KGMALNLERD NVGAVIFGQD QAIKEGDEVR RLGEIVDVPV GRGLLGRVVN PLGEPIDGKG PIVSTERRRV DVKAPGIIPR KSVHEPVQTG LKSIDTLIPV GRGQRELIIG DRQTGKTAVA IDTILNQKAA NAGTDESAKL YCVYVAIGQK RSTVAQIVKT LEEHGALEYT IVVVASASEP APLQYLAPFS GCAMGEWFRD NGLHGLIIYD DLSKQAVAYR QMSLLLRRPP GREAYPGDVF YLHSRLLERA AKLNEDNGSG SLTALPIIET QANDVSAYIP TNVISITDGQ IFLETDLFYQ GIRPAVNVGI SVSRVGSSAQ IKAMKQVAGA IKGELAQYRE MAAFAKFGSD LDASTQKLLA RGERLTELLK QPQYAPQAVE EQVCVIYAGT RGYLDNIPTS SVRRFESELL ARLHSQHKDL LDNIRTKKAL DKDLENTLKS VLDNFSATFA
|
| |