Gene Caul_4740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4740 
Symbol 
ID5902202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5126073 
End bp5127605 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content65% 
IMG OID641565259 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001686358 
Protein GI167648695 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.408023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00635659 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACATCC GCGCCGCCGA GATTTCGGCC ATCCTCAAGT CGCAGATCGC CAATTTCGGC 
GAAGAAGCCG CCGTCTCGGA CGTCGGTCAG GTGCTGTCCG TCGGTGACGG CATCGCTCGC
ATCTATGGCT TGGACAACGT CCAGGCCGGC GAAATGGTCG AATTCCCGAA GGCCGGCGTG
AAGGGCATGG CCCTGAACCT CGAGCGCGAC AATGTCGGCG CCGTGATCTT CGGCCAGGAC
CAGGCCATCA AGGAAGGCGA CGAAGTCCGT CGTCTCGGCG AGATCGTCGA CGTTCCGGTC
GGCCGCGGCC TGCTGGGCCG CGTCGTCAAC CCGCTGGGCG AGCCGATCGA CGGCAAGGGC
CCGATCGTCT CGACCGAGCG TCGCCGCGTC GACGTCAAGG CGCCCGGCAT CATCCCGCGC
AAGTCGGTGC ACGAGCCCGT GCAGACCGGC CTGAAGTCGA TCGACACCCT GATCCCCGTC
GGCCGCGGCC AGCGCGAGCT GATCATCGGT GACCGTCAGA CCGGCAAGAC CGCCGTCGCC
ATCGACACCA TCCTGAACCA GAAGGCCGCC AACGCCGGCA CGGACGAGAG CGCCAAGCTC
TATTGCGTCT ATGTCGCCAT CGGCCAGAAG CGTTCGACCG TCGCCCAGAT CGTCAAGACG
CTCGAAGAGC ACGGCGCTCT GGAATACACG ATCGTCGTCG TGGCCTCGGC TTCCGAGCCG
GCCCCGCTGC AATACCTGGC CCCGTTCTCG GGCTGCGCCA TGGGCGAGTG GTTCCGCGAC
AACGGTCTGC ACGGCCTGAT CATCTATGAC GACCTTTCCA AGCAAGCTGT CGCCTACCGC
CAGATGTCGT TGCTGCTGCG CCGCCCGCCG GGCCGCGAAG CCTATCCGGG CGACGTCTTC
TACCTGCACT CCCGCCTGCT GGAACGCGCC GCCAAGCTGA ACGAAGACAA CGGTTCGGGT
TCGCTGACGG CGCTGCCGAT CATCGAAACC CAGGCCAACG ACGTTTCGGC CTACATCCCG
ACCAACGTGA TCTCGATCAC CGACGGCCAG ATCTTCCTGG AAACCGACCT GTTCTATCAG
GGCATTCGTC CCGCCGTGAA CGTCGGCATC TCGGTGTCGC GCGTCGGCTC GTCGGCCCAG
ATCAAGGCCA TGAAGCAAGT CGCCGGCGCG ATTAAGGGCG AGTTGGCCCA GTATCGCGAA
ATGGCCGCCT TCGCCAAGTT CGGCTCGGAC CTGGACGCCT CGACCCAAAA GCTGCTGGCC
CGCGGCGAGC GTCTGACCGA GCTGCTCAAG CAGCCGCAAT ACGCGCCGCA GGCCGTCGAA
GAGCAGGTCT GCGTGATCTA CGCCGGTACG CGCGGCTATC TGGACAACAT CCCGACCTCG
TCGGTCCGCC GGTTCGAGAG CGAGCTGCTG GCCCGCCTGC ACAGCCAGCA CAAGGATCTG
CTGGACAACA TTCGCACCAA GAAGGCCCTC GATAAGGACC TCGAGAACAC GCTCAAGAGC
GTGCTCGACA ACTTCTCGGC GACCTTCGCC TAG
 
Protein sequence
MDIRAAEISA ILKSQIANFG EEAAVSDVGQ VLSVGDGIAR IYGLDNVQAG EMVEFPKAGV 
KGMALNLERD NVGAVIFGQD QAIKEGDEVR RLGEIVDVPV GRGLLGRVVN PLGEPIDGKG
PIVSTERRRV DVKAPGIIPR KSVHEPVQTG LKSIDTLIPV GRGQRELIIG DRQTGKTAVA
IDTILNQKAA NAGTDESAKL YCVYVAIGQK RSTVAQIVKT LEEHGALEYT IVVVASASEP
APLQYLAPFS GCAMGEWFRD NGLHGLIIYD DLSKQAVAYR QMSLLLRRPP GREAYPGDVF
YLHSRLLERA AKLNEDNGSG SLTALPIIET QANDVSAYIP TNVISITDGQ IFLETDLFYQ
GIRPAVNVGI SVSRVGSSAQ IKAMKQVAGA IKGELAQYRE MAAFAKFGSD LDASTQKLLA
RGERLTELLK QPQYAPQAVE EQVCVIYAGT RGYLDNIPTS SVRRFESELL ARLHSQHKDL
LDNIRTKKAL DKDLENTLKS VLDNFSATFA