Gene Caul_0634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0634 
Symbol 
ID5898089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp703547 
End bp705127 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content65% 
IMG OID641561116 
Productcytochrome bd ubiquinol oxidase subunit I 
Protein accessionYP_001682265 
Protein GI167644602 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.361821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.168063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCG CCGTCATCGA CCTTTCTCGG CTGCAGTTCG CCCTGACGGC GCTGTACCAT 
TTCCTGTTCG TGCCCCTGAC CCTGGGTCTC TCCTTCATGC TGGTGATCAT GGAGAGCGTC
TATGTGATGA CCAAGCGGCC GATCTGGAAA ACCACCACCC GGTTCTGGAG CACGCTGTTC
GGCATCAATT TCGTGCTGGG CGTCGCCACC GGCCTGACCA TGGAATTCCA GTTCGGCATG
AACTGGTCCT ACTATTCGCA CTATGTGGGC GACATCTTCG GCGCGCCCCT GGCCATCGAG
GGCCTGATGG CCTTCTTCCT CGAAGCCACC TTCGTCGGGC TGATGTTCTT CGGCTGGGAC
AAGCTCAAGC CCGTCACCCA CCTGTTCGTG ACCTTCCTCG TCGCCTTGGG CACCAACCTG
TCGGCCCTGT GGATCCTGGT CGCCAACGGC TGGATGCAGA ACCCGGTCGG CGCGGCCTTC
AATCCCGACA CGATGCGGAT GGAGGTGGTC GACTTCGGGG CGGTGGTCTT CAACCCGGTG
GCCCAGGCCA AGTTCGTGCA CACCGTCAGC GCTGGCTACA CCATCGCCGC GGTCTTCGTG
CTGGGGATCA GCGCCTTCTA CCTGCTGAAG GGTCGTTATG TCAGCGTCGC CAAGCGCTCG
CTGACCGTGG CCGCCGCCTT CGGCCTGGCC TCGTCCCTGT CGGTGGTCGT GCTGGGCGAC
GAGAGCGGCT ACGCCCTGAC CGACAACCAG AAAATGAAGC TCGCGGCCTT GGAGGCCATG
TGGGAGACCG AACCCGCGCC GGCCGGCCTG ACCGCTTTTG GCATTCCCGA TCTGAAGAAC
CGCACGACCC ATGCCGAGGT CAAGATTCCC TATGTTCTGG GCCTGATCTC GACCCGCAGC
CTGGACCGTC CGGTGGCCGG CATCTTCCAA CTGGTCGCCC AGGCGCAGAC CCGCATCGAG
AGCGGCGTCG TGGCCTATGA CGCGCTGGAA AAGCTGAAGG TCACCCCCAC CGATCTGGCG
GCGCGCGGCG TGTTCGAGAC CCACCGCCGC GATCTGGGCT ACGCGCTGCT GCTCAAGCGC
TATGTCGCTG ATCCCCGCCA GGCCGACGCG GCGCTGATCG CCAAGACCGC CTGGGACACC
GTGCCCAATG TGCCGGTGAT GTTCTGGGCG TTCCGGATCA TGGCCGGCAT CGGTTTCCTG
ATGATCGCCA TGTTCGCGAC CGCCTTCGTC CTGGTCACCC TGCGCAAGCA CAATACCCGC
TGGTTCCTGA TGATCGCGGT GGCGGCCATC CCCCTGCCGT GGATCTCGAC GGAGCTGGGC
TGGGTGCTGG CCGAGGTCGG ACGCCAGCCC TGGGCGGTCG AGGGCGTGCT GCCCACCTTC
CTGGCGCCGT CCAGCCTCAG CGTGGCCCAG GTCCTGACCA GCATCGTGAT CTTCACCCTG
CTCTATGGAT CGCTGGCGGT GGTCGAGGTC GGACTGATCC TCAAGACCAT CAAGAAGGGT
CCCTTCGCCG ACCAGGAGGC CTTCCCATCC GGCGCTCCAG GGCGTCTTGG GGCCGCCCCC
GCCGGCGAAG CCGTGGCCTA G
 
Protein sequence
MDPAVIDLSR LQFALTALYH FLFVPLTLGL SFMLVIMESV YVMTKRPIWK TTTRFWSTLF 
GINFVLGVAT GLTMEFQFGM NWSYYSHYVG DIFGAPLAIE GLMAFFLEAT FVGLMFFGWD
KLKPVTHLFV TFLVALGTNL SALWILVANG WMQNPVGAAF NPDTMRMEVV DFGAVVFNPV
AQAKFVHTVS AGYTIAAVFV LGISAFYLLK GRYVSVAKRS LTVAAAFGLA SSLSVVVLGD
ESGYALTDNQ KMKLAALEAM WETEPAPAGL TAFGIPDLKN RTTHAEVKIP YVLGLISTRS
LDRPVAGIFQ LVAQAQTRIE SGVVAYDALE KLKVTPTDLA ARGVFETHRR DLGYALLLKR
YVADPRQADA ALIAKTAWDT VPNVPVMFWA FRIMAGIGFL MIAMFATAFV LVTLRKHNTR
WFLMIAVAAI PLPWISTELG WVLAEVGRQP WAVEGVLPTF LAPSSLSVAQ VLTSIVIFTL
LYGSLAVVEV GLILKTIKKG PFADQEAFPS GAPGRLGAAP AGEAVA